Popular repositories Loading
-
-
-
-
minbpe
minbpe PublicForked from karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Python
-
PPO-PyTorch
PPO-PyTorch PublicForked from nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Python
-
MOSS-RLHF
MOSS-RLHF PublicForked from OpenLMLab/MOSS-RLHF
Secrets of RLHF in Large Language Models Part I: PPO
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.