Stars
PyTorch implementation of CIDER (How to exploit hyperspherical embeddings for out-of-distribution detection), ICLR 2023
InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation 🔥
FreeStyle : Free Lunch for Text-guided Style Transfer using Diffusion Models
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
ying-fu / FreqFusion
Forked from Linwei-Chen/FreqFusionTPAMI:Frequency-aware Feature Fusion for Dense Image Prediction
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
Curated collection of human fingerprint datasets suitable for research and evaluation of fingerprint recognition algorithms.
Code for the Image similarity challenge.
Testing adaptation of the DINOv2 encoder for vision tasks with Low-Rank Adaptation (LoRA)
Densely Captioned Images (DCI) dataset repository.
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
Evaluating Data Attribution for Text-to-Image Models: a visual data attribution benchmark for evaluating and learning training image influences.
Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
S2D2R : Single-Stage Pipeline for Detected-to-Retrieval using Revisiting Google Landmark DataSets V2
Diffusion Model-Based Image Editing: A Survey (arXiv)
Collection of AWESOME vision-language models for vision tasks
A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC
🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️
✨✨Latest Advances on Multimodal Large Language Models
ConvMAE: Masked Convolution Meets Masked Autoencoders
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
Official implementations for paper: Anydoor: zero-shot object-level image customization
Code for the paper "Training Diffusion Models with Reinforcement Learning"
Reproduction of DDPO paper (RLHF for diffusion)