Skip to content
View hcw0098's full-sized avatar

Highlights

  • Pro

Block or report hcw0098

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

More relighting!

Python 6,947 409 Updated Nov 28, 2024
Python 120 7 Updated Oct 28, 2024

PyTorch implementation of CIDER (How to exploit hyperspherical embeddings for out-of-distribution detection), ICLR 2023

Python 56 8 Updated Aug 13, 2023

InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation 🔥

Python 72 3 Updated Jul 17, 2024

FreeStyle : Free Lunch for Text-guided Style Transfer using Diffusion Models

Python 114 7 Updated May 21, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,883 183 Updated Oct 31, 2024

TPAMI:Frequency-aware Feature Fusion for Dense Image Prediction

Jupyter Notebook 7 Updated Sep 26, 2024

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 875 40 Updated Sep 27, 2024

Curated collection of human fingerprint datasets suitable for research and evaluation of fingerprint recognition algorithms.

95 20 Updated Mar 11, 2024

Code for the Image similarity challenge.

Python 194 41 Updated Sep 16, 2022

Testing adaptation of the DINOv2 encoder for vision tasks with Low-Rank Adaptation (LoRA)

Jupyter Notebook 87 9 Updated Aug 1, 2024

Densely Captioned Images (DCI) dataset repository.

Python 162 5 Updated Jul 1, 2024

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1,719 100 Updated Oct 10, 2024

Open-Set Grounded Text-to-Image Generation

Python 2,036 152 Updated Mar 6, 2024

Evaluating Data Attribution for Text-to-Image Models: a visual data attribution benchmark for evaluating and learning training image influences.

Python 69 4 Updated Jun 25, 2024

Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation

Python 40 Updated Aug 1, 2024

S2D2R : Single-Stage Pipeline for Detected-to-Retrieval using Revisiting Google Landmark DataSets V2

4 Updated Apr 24, 2024

Diffusion Model-Based Image Editing: A Survey (arXiv)

511 33 Updated Nov 18, 2024

Collection of AWESOME vision-language models for vision tasks

2,609 222 Updated Dec 3, 2024

A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC

457 12 Updated Nov 13, 2024

🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️

Shell 22,189 3,379 Updated Nov 30, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,066 835 Updated Dec 13, 2024

ConvMAE: Masked Convolution Meets Masked Autoencoders

Python 490 42 Updated Mar 14, 2023

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

Jupyter Notebook 1,005 60 Updated Sep 21, 2023

The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".

Jupyter Notebook 289 34 Updated Dec 31, 2023

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Python 447 41 Updated Mar 22, 2024

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 4,033 368 Updated Apr 8, 2024

Code for the paper "Training Diffusion Models with Reinforcement Learning"

Python 367 26 Updated Jul 5, 2023

Reproduction of DDPO paper (RLHF for diffusion)

Jupyter Notebook 73 2 Updated Sep 20, 2023
Next