Text-image Alignment for Diffusion-based Perception (TADP)

Official implementation of the paper Text-Image Alignment for Diffusion-based Perception (CVPR 2024).

Neehar Kondapaneni*, Markus Marks*, Manuel Knott*, Rogerio Guimaraes, Pietro Perona

Setup

We have 2 seperate shell scripts for setting up the environment.

setup.sh for setting up the environment for Pascal VOC Semantic Segmentation and Watercolor2k and Comic2k Object Detection.
setup_mm.sh for setting up the environment for ADE20k Semantic Segmentation, NYUv2 Depth Estimation, Nighttime Driving, and Dark Zurich Semantic Segmentation (using MM libraries).

bash setup.sh

Inference

If you want to use our models for inference, there are two options available:

Single image inference

We provide a simple interface to load our model checkpoints and run inference with custom image and text inputs. Please refer to the demo/ directory for examples.

export PYTHONPATH=$PYTHONPATH:$(pwd)
python demo/depth_inference.py
python demo/seg_inference.py
python demo/detection_inference.py
python demo/seg_inference_driving.py

Whole data set testing

If you want to generate results for a whole dataset that was used in our study (e.g., ADE20k, NYUv2) using pre-generated captions, please refer to the test_tadp_mm.py and test_tadp_depth.py scripts.

Training

TODO

Experiments

All results that are reported in our paper can be reproduced using the scripts in the cvpr_experiments/ directory.

Acknowledgements

This code is based on VPD, diffusers, stable-diffusion, mmsegmentation, LAVT, and MIM-Depth-Estimation.

Citation

@article{kondapaneni2024tadp,
  title={Text-Image Alignment for Diffusion-Based Perception},
  author={Kondapaneni, Neehar and Marks, Markus and Knott, Manuel and Guimaraes, Rogerio and Perona, Pietro},
  journal={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2024},
  month={June},
  pages={13883-13893}
}

Name	Name	Last commit message	Last commit date
Latest commit nkondapa Aug 26, 2024 43bc08d · Aug 26, 2024 History 94 Commits
TADP	TADP	change import for colab reverse	Apr 19, 2024
assets	assets	assets	Mar 29, 2024
captioning	captioning	gpt interface for prompt cleaning	Apr 17, 2024
captions	captions	cross domain captions	Apr 17, 2024
cvpr_experiments	cvpr_experiments	Update pascal_segmentation.sh	Aug 26, 2024
data	data	adding class embeddings	Apr 19, 2024
datasets	datasets	minor cleanups	Apr 19, 2024
demo	demo	changing path	Apr 19, 2024
ldm_cross_attention	ldm_cross_attention	naming, comments and cleanup for txt2img and ldm cross attention	Apr 17, 2024
misc	misc	added missing file and fixed paths	May 9, 2024
model_personalization	model_personalization	generating data subsamples for personalization	Apr 16, 2024
models	models	Add full dataset testing scripts for seg/depth.	Apr 15, 2024
.gitignore	.gitignore	Added TADP seg MM training	Apr 12, 2024
LICENSE	LICENSE	Create LICENSE	Jul 18, 2024
README.md	README.md	update bibtex citation	Jun 13, 2024
download_data_and_checkpoints.sh	download_data_and_checkpoints.sh	minor cleanups	Apr 19, 2024
environment.yaml	environment.yaml	add pascal download to setup, pandas to reqs, update txt2img_ca_analy…	Apr 17, 2024
requirements.txt	requirements.txt	adding openai to reqs	Apr 17, 2024
sd_tune.yaml	sd_tune.yaml	initial commit	Mar 28, 2024
setup.py	setup.py	add setup.py needed for environment yaml	Apr 17, 2024
setup.sh	setup.sh	Remove unused setup line.	Apr 19, 2024
setup_mm.sh	setup_mm.sh	working mm_setup --> lightning version downgraded to 2.0.0 (need to c…	Apr 17, 2024
test_tadp_depth.py	test_tadp_depth.py	Add full dataset testing scripts for seg/depth.	Apr 15, 2024
test_tadp_mm.py	test_tadp_mm.py	Add full dataset testing scripts for seg/depth.	Apr 15, 2024
tests.sh	tests.sh	added test for seg pascal	May 9, 2024
train_tadp.py	train_tadp.py	added missing file and fixed paths	May 9, 2024
train_tadp_depth.py	train_tadp_depth.py	Add full dataset testing scripts for seg/depth.	Apr 15, 2024
train_tadp_mm.py	train_tadp_mm.py	Add full dataset testing scripts for seg/depth.	Apr 15, 2024
txt2img_ca_analysis.py	txt2img_ca_analysis.py	added missing file and fixed paths	May 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text-image Alignment for Diffusion-based Perception (TADP)

Setup

Inference

Single image inference

Whole data set testing

Training

Experiments

Acknowledgements

Citation

About

Releases

Packages

Contributors 3

Languages

License

damaggu/TADP

Folders and files

Latest commit

History

Repository files navigation

Text-image Alignment for Diffusion-based Perception (TADP)

Setup

Inference

Single image inference

Whole data set testing

Training

Experiments

Acknowledgements

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages