Skip to content

Modified torch titan for customized size of model.

License

Notifications You must be signed in to change notification settings

qiziAI/torchtitan

Repository files navigation

torchtrain

Note: This repository is currently under heavy development.

torchtrain contains PyTorch native parallelisms, tools and utilities to train large models.

Installation

install PyTorch from source or install the latest pytorch nightly, then install requirements by

pip install -r requirements.txt

download tokenizer from HF This part is needed first time if there's no tokenizer locally by run:

python torchtrain/datasets/download_tokenizer.py --hf_token your_token

run the llama debug model locally to verify the setup is correct:

./run_llama_train.sh

About

Modified torch titan for customized size of model.

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 92.6%
  • Shell 6.7%
  • Dockerfile 0.7%