Skip to content
View dongzelian's full-sized avatar

Block or report dongzelian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A general fine-tuning kit geared toward diffusion models.

Python 2,029 195 Updated Jan 20, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,193 5,575 Updated Jan 21, 2025

Official repository for the paper PLLaVA

Python 634 48 Updated Jul 28, 2024

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,124 616 Updated Sep 26, 2024

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,258 256 Updated Nov 1, 2024

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 831 46 Updated Jan 3, 2025

[CVPR'24 Best Student Paper] Mip-Splatting: Alias-free 3D Gaussian Splatting

Python 1,170 77 Updated Dec 17, 2024

Navigate dreamscapes with a click – your chosen point guides the drone’s flight in a thrilling visual journey.

Python 44 3 Updated Dec 23, 2023

A curated list for Efficient Large Language Models

Python 1,394 104 Updated Dec 30, 2024

(CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos"

Python 29 4 Updated Apr 2, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,735 600 Updated May 31, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,837 3,459 Updated May 18, 2024

Official repo for consistency models.

Python 6,232 423 Updated Mar 22, 2024

General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX

1,747 97 Updated Nov 15, 2023

A curated list of Composable AI methods: Building AI system by composing modules.

192 5 Updated Nov 24, 2023

[ICCV 2023] TM2D: Bimodality Driven 3D Dance Generation via Music-Text Integration

Python 87 3 Updated Mar 4, 2024

Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs

Python 1,856 108 Updated Jan 12, 2025

Transfer the ControlNet with any basemodel in diffusers🔥

Python 821 48 Updated Apr 23, 2023

Lossless Training Speed Up by Unbiased Dynamic Data Pruning

Python 324 18 Updated Sep 24, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,192 1,015 Updated Jan 17, 2025

Let us control diffusion models!

Python 31,247 2,798 Updated Feb 25, 2024

A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.

393 25 Updated Sep 26, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,180 3,257 Updated Aug 17, 2024

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,155 486 Updated Mar 22, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 38,598 6,259 Updated Dec 9, 2024

A Unified Framework for Surface Reconstruction

Python 2,000 190 Updated Jul 11, 2024

[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation

Python 62 Updated Jul 25, 2023

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Python 2,130 161 Updated Dec 22, 2022

MetaFormer Baselines for Vision (TPAMI 2024)

Python 438 28 Updated Jun 1, 2024
Next