Stars
Implementation of Autoregressive Diffusion in Pytorch
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
The paper collections for the autoregressive models in vision.
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
aider is AI pair programming in your terminal
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
An Pytorch implementation of the paper Key-Locked Rank One Editing for Text-to-Image Personalization
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
[TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space"
Official pytorch implementation for SingleInsert
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
ScoreHMR: Score-Guided Diffusion for 3D Human Recovery (CVPR 2024)
COYO-700M: Large-scale Image-Text Pair Dataset
[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)
A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC
[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models
Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Official implementations for paper: Anydoor: zero-shot object-level image customization
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)
[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (CVPR 2024)