Starred repositories
Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
Official ComfyUI Node for Paper - MagicQuill: An Intelligent Interactive Image Editing System
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
HunyuanVideo: A Systematic Framework For Large Video Generation Model
📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.
A minimal and universal controller for FLUX.1.
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Official repository of In-Context LoRA for Diffusion Transformers
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Unifying 3D Mesh Generation with Language Models
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
A suite of image and video neural tokenizers
ComfyUI nodes to edit videos using Genmo Mochi
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Official PyTorch implementation of "Framer: Interactive Frame Interpolation".
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Official implementations for paper: Dynamic Typography: Bringing Text to Life via Video Diffusion Prior
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Memory optimized finetuning scripts for CogVideoX & Mochi using TorchAO and DeepSpeed
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance