Highlights
- Pro
Lists (5)
Sort Name ascending (A-Z)
Starred repositories
[ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos
🔥🔥🔥 This repository includes latest papers, projects and datasets on GenAI for Cel-Animation.
Official implementation of "MangaNinja: Line Art Colorization with Precise Reference Following"
Sky-T1: Train your own O1 preview model within $450
FastVideo is a lightweight framework for accelerating large video diffusion models.
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
An open-source tool-augmented conversational language model from Fudan University
The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"
Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)
[arXiv'25] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Code release for "SegLLM: Multi-round Reasoning Segmentation"
[Arxiv 2024] Edicho: Consistent Image Editing in the Wild
A minimal and universal controller for FLUX.1.
Memory-optimized training scripts for video models based on Diffusers
Official implementation of "DepthLab: From Partial to Complete"
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
A generative world for general-purpose robotics & embodied AI learning.
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Official Implementations for Paper - AniDoc: Animation Creation Made Easier
NOVA: Autoregressive Video Generation without Vector Quantization
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text