Stars
A general fine-tuning kit geared toward diffusion models.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
[CVPR'24 Best Student Paper] Mip-Splatting: Alias-free 3D Gaussian Splatting
Navigate dreamscapes with a click – your chosen point guides the drone’s flight in a thrilling visual journey.
A curated list for Efficient Large Language Models
(CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos"
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Official Code for DragGAN (SIGGRAPH 2023)
Official repo for consistency models.
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
A curated list of Composable AI methods: Building AI system by composing modules.
[ICCV 2023] TM2D: Bimodality Driven 3D Dance Generation via Music-Text Integration
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
Transfer the ControlNet with any basemodel in diffusers🔥
Lossless Training Speed Up by Unbiased Dynamic Data Pruning
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Using Low-rank adaptation to quickly fine-tune diffusion models.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A Unified Framework for Surface Reconstruction
[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)