Skip to content
View nstylia's full-sized avatar
🚀
Accelerate
🚀
Accelerate

Block or report nstylia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
41 stars written in Python
Clear filter

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 57,961 5,911 Updated Aug 24, 2024

Inference code for Llama models

Python 57,244 9,658 Updated Aug 18, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,175 3,255 Updated Aug 17, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,278 4,202 Updated Jan 18, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,822 6,447 Updated Jan 9, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,820 3,418 Updated Jan 9, 2025

DSPy: The framework for programming—not prompting—language models

Python 21,150 1,595 Updated Jan 17, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,217 1,446 Updated Jan 17, 2025

🦉 Data Versioning and ML Experiments

Python 14,093 1,195 Updated Jan 13, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,013 882 Updated Jan 10, 2025

Ongoing research training transformer models at scale

Python 11,122 2,488 Updated Jan 18, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,921 636 Updated Jan 16, 2025

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Python 5,308 328 Updated Jan 18, 2025

Supercharge Your Model Training

Python 5,241 429 Updated Jan 13, 2025

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 4,852 431 Updated Nov 18, 2024

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

Python 4,342 1,559 Updated Jan 17, 2025

Machine learning glossary

Python 3,035 725 Updated Aug 8, 2024

Simple, safe way to store and distribute tensors

Python 3,012 206 Updated Jan 9, 2025

FFCV: Fast Forward Computer Vision (and other ML workloads!)

Python 2,881 179 Updated Jun 16, 2024

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Python 2,455 458 Updated Apr 29, 2024

A python tool for evaluating the quality of sentence embeddings.

Python 2,091 310 Updated Mar 19, 2024

The implementation of DeBERTa

Python 2,026 230 Updated Sep 29, 2023

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,354 220 Updated Mar 20, 2024

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

Python 1,105 130 Updated Aug 28, 2024

Curate better data for LLMs

Python 1,000 96 Updated Mar 19, 2024

A Python parser for MediaWiki wikicode

Python 773 78 Updated Jan 13, 2025

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting has…

Python 664 90 Updated Feb 27, 2024

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

Python 439 61 Updated Sep 6, 2023

EsViT: Efficient self-supervised Vision Transformers

Python 409 44 Updated Aug 28, 2023
Next