Skip to content

📚 Datasets and models for instruction-tuning

License

Notifications You must be signed in to change notification settings

fsodogandji/txtinstruct

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Datasets and models for instruction-tuning


txtinstruct is a framework for training instruction-tuned models.

architecture architecture

The objective of this project is to support open data, open models and integration with your own data. One of the biggest problems today is the lack of licensing clarity with instruction-following datasets and large language models. txtinstruct makes it easy to build your own instruction-following datasets and use those datasets to train instructed-tuned models.

txtinstruct is built with Python 3.7+ and txtai.

Installation

The easiest way to install is via pip and PyPI

pip install txtinstruct

You can also install txtinstruct directly from GitHub. Using a Python Virtual Environment is recommended.

pip install git+https://github.com/neuml/txtinstruct

Python 3.7+ is supported

See this link to help resolve environment-specific install issues.

Examples

The following example notebooks show how to build models with txtinstruct.

Notebook Description
Introducing txtinstruct Build instruction-tuned datasets and models Open In Colab

Further Reading

About

📚 Datasets and models for instruction-tuning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%