Here, we share our ideas and code for building LLMs – including Transformers, GPT-2, and training methods like SFT, DPO, and GRPO – entirely from scratch. We also provide simple mathematical derivations for algorithms such as DPO and GRPO, along with insights into recent research topics in LLMs, such as reasoning. We hope you find these resources helpful.
-
Notifications
You must be signed in to change notification settings - Fork 1
tianbingsz/LLM
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
Sharing LLM basic ideas and code
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published