- Nanjing
Stars
The simplest, fastest repository for training/finetuning medium-sized GPTs.
HumanML3D: A large and diverse 3d human motion-language dataset.
Latte: Latent Diffusion Transformer for Video Generation.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"
Scaling Diffusion Transformers with Mixture of Experts
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)
Python library for designing and training your own Diffusion Models with PyTorch.
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"
A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)
This repo contains the code for 1D tokenizer and generator
Implementation of Classifier Free Guidance in Pytorch, with emphasis on text conditioning, and flexibility to include multiple text embedding models
Official Code for Stable Cascade
Virtual whiteboard for sketching hand-drawn like diagrams
Code for "Cascaded Latent Diffusion Models for High-Resolution Chest X-ray Synthesis" @ PAKDD 2023
Stable Diffusion implemented from scratch in PyTorch
π₯π₯π₯A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
diffusion generative model
π§βπ« 60+ Implementations/tutorials of deep learning papers with side-by-side notes π; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gaβ¦
An LLM Chatbot that dynamically retrieves and processes resumes using RAG to perform resume screening.
MinImagen: A minimal implementation of the Imagen text-to-image model
Audio generation using diffusion models, in PyTorch.