![scikit-learn logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/scikit-learn/scikit-learn.png)
Starred repositories
My implementation of a GPT language model in PyTorch
BabyGPT: Build Your Own GPT Large Language Model from Scratch Pre-Training Generative Transformer Models: Building GPT from Scratch with a Step-by-Step Guide to Generative AI in PyTorch and Python
Building a GPT-like LLM from scratch with PyTorch.
Learning records for building a large language model from scratch
Repository for the free online book Machine Learning from Scratch (link below!)
Helpful tools and examples for working with flex-attention
PyTorch Implementation of "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
AI Roadmap:机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN),图神经网络(GNN),NLP,大数据相关的发展路书(roadmap), 并附海量源码(python,pytorch)带大家消化基本知识点,突破面试,完成从新手到合格工程师的跨越,其中深度学习相关论文附有tensorflow caffe官方源码,应用部分含推荐算法…
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
Facenet implementation by Keras2
Trained a 114 million Parameter LLM from Scratch.
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Fine-tune mistral-7B on 3090s, a100s, h100s
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
PyTorch implementation of Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
PyTorch 101 series covering everything from the basic building blocks all the way to building custom architectures.
LLM Finetuning with peft
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
🤖 Python examples of popular machine learning algorithms with interactive Jupyter demos and math being explained
Scratch Implementations of Major Machine Learning Algorithms