-
coconut Public
Forked from facebookresearch/coconutTraining Large Language Model to Reason in a Continuous Latent Space
Python MIT License UpdatedJan 16, 2025 -
flash-linear-attention Public
Forked from fla-org/flash-linear-attention🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Python MIT License UpdatedJan 7, 2025 -
big_vision Public
Forked from google-research/big_visionOfficial codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Jupyter Notebook Apache License 2.0 UpdatedDec 20, 2024 -
torchglyph Public
Data Processor Combinators for Natural Language Processing
-
-
torchrua Public
Manipulate tensors with PackedSequence and CattedSequence
-
aku Public
An interactive annotation-driven ArgumentParser generator
-
-
SimPO Public
Forked from princeton-nlp/SimPOSimPO: Simple Preference Optimization with a Reference-Free Reward
Python MIT License UpdatedNov 12, 2024 -
-
-
Emu3 Public
Forked from baaivision/Emu3Next-Token Prediction is All You Need
Python Apache License 2.0 UpdatedOct 22, 2024 -
-
imagen-pytorch Public
Forked from lucidrains/imagen-pytorchImplementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Python MIT License UpdatedOct 7, 2024 -
denoising-diffusion-pytorch Public
Forked from lucidrains/denoising-diffusion-pytorchImplementation of Denoising Diffusion Probabilistic Model in Pytorch
Python MIT License UpdatedSep 27, 2024 -
-
torchlatent Public
High Performance Structured Prediction in PyTorch
-
-
XLM Public
Forked from facebookresearch/XLMPyTorch original implementation of Cross-lingual Language Model Pretraining.
Python Other UpdatedAug 26, 2024 -
llama-recipes Public
Forked from meta-llama/llama-cookbookScripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…
Jupyter Notebook UpdatedAug 23, 2024 -
wikipedia2vec Public
Forked from wikipedia2vec/wikipedia2vecA tool for learning vector representations of words and entities from Wikipedia
Python Other UpdatedAug 19, 2024 -
-
ALMA Public
Forked from fe1ixxu/ALMAState-of-the-art LLM-based translation models.
Ruby MIT License UpdatedJul 27, 2024 -
llm2vec Public
Forked from McGill-NLP/llm2vecCode for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Python MIT License UpdatedJul 11, 2024 -
mae Public
Forked from facebookresearch/maePyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Python Other UpdatedJun 23, 2024 -
muse-maskgit-pytorch Public
Forked from lucidrains/muse-maskgit-pytorchImplementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Python MIT License UpdatedJun 15, 2024 -
orpo Public
Forked from xfactlab/orpoOfficial repository for ORPO
Python Apache License 2.0 UpdatedMay 31, 2024 -
mistral-v0.2-jax Public
Forked from yixiaoer/mistral-v0.2-jaxJAX implementation of the Mistral 7b v0.2 model
Python MIT License UpdatedApr 8, 2024 -
-
transformer-debugger Public
Forked from openai/transformer-debuggerPython MIT License UpdatedMar 12, 2024