Skip to content
View 441041's full-sized avatar

Block or report 441041

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

coding an autograd from scratch

172 37 Updated Jan 12, 2019

My implementation of a GPT language model in PyTorch

Jupyter Notebook 4 Updated Aug 11, 2024

gpt from 0 -> 1

Python 3 Updated Aug 21, 2024

BabyGPT: Build Your Own GPT Large Language Model from Scratch Pre-Training Generative Transformer Models: Building GPT from Scratch with a Step-by-Step Guide to Generative AI in PyTorch and Python

Python 72 22 Updated Dec 5, 2023

Building a GPT-like LLM from scratch with PyTorch.

Python 60 30 Updated Dec 20, 2024

LLM-from-scratch

Python 7 1 Updated Jan 4, 2025

Learning records for building a large language model from scratch

Jupyter Notebook 49 Updated Jan 1, 2025

Repository for the free online book Machine Learning from Scratch (link below!)

Jupyter Notebook 1,178 214 Updated Aug 30, 2023

Helpful tools and examples for working with flex-attention

Python 626 34 Updated Feb 11, 2025

best way to save what you love

Svelte 27,621 2,223 Updated Feb 12, 2025

PyTorch Implementation of "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

Python 2 Updated Jun 9, 2021

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,174 4,925 Updated Feb 12, 2025

AI Roadmap:机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN),图神经网络(GNN),NLP,大数据相关的发展路书(roadmap), 并附海量源码(python,pytorch)带大家消化基本知识点,突破面试,完成从新手到合格工程师的跨越,其中深度学习相关论文附有tensorflow caffe官方源码,应用部分含推荐算法…

2,815 602 Updated Jan 20, 2025

Deep Learning for humans

Python 62,502 19,500 Updated Feb 11, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,461 649 Updated Feb 3, 2025

Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.

Jupyter Notebook 140 31 Updated May 12, 2024

Facenet implementation by Keras2

Jupyter Notebook 554 220 Updated Nov 28, 2018

Face recognition using Tensorflow

Python 13,938 4,812 Updated Jul 24, 2023

Trained a 114 million Parameter LLM from Scratch.

Python 17 1 Updated Jul 21, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,777 527 Updated Dec 14, 2024

Fine-tune mistral-7B on 3090s, a100s, h100s

Python 705 63 Updated Oct 11, 2023

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

Python 10,263 2,021 Updated Feb 12, 2025

PyTorch implementation of Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation

Python 27 3 Updated Dec 29, 2021
Jupyter Notebook 22 3 Updated May 7, 2023

PyTorch 101 series covering everything from the basic building blocks all the way to building custom architectures.

Jupyter Notebook 258 58 Updated Aug 19, 2020

LLM Finetuning with peft

Jupyter Notebook 2,305 633 Updated Jul 8, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,412 111 Updated Jan 24, 2025

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,444 75 Updated Mar 8, 2024

🤖 Python examples of popular machine learning algorithms with interactive Jupyter demos and math being explained

Jupyter Notebook 23,295 4,082 Updated Nov 12, 2024

Scratch Implementations of Major Machine Learning Algorithms

Jupyter Notebook 64 11 Updated Nov 29, 2018
Next