Fork to train/test TryOnDiffusion on comsumer GPU (24G VRAM)

Note: this is not a faithful implementation of TryOnDiffusion, but an image-to-image translation diffusion model with a modified U-Net, as described in jolibrain#530 This repository adds some convenience scripts for training/testing this implementation.

This project is a WIP.

Installation

Install torch packages (2.x) using https://pytorch.org/get-started/locally/
Install from requirements.txt

Preprocess VITON-HD dataset for TryOnDiffusion

https://www.joligen.com/doc/datasets.html#datasets-with-bbox-and-reference-image-conditioning

At this point, we only prepare the shirts (orange part in image-parse-v3) for training.

python scripts/preprocess_viton.py --zip-file /path/to/zalando-hd-resized.zip --target-dir /save/processed/data/here --dilate 5 --save_conditions --padding "0 0 0 0"

Train TryOnDiffusion

Start visdom for visualization: (in venv) python -m visdom.server
Start training (you may need to adjust some arguments inside shell script): python train_tryondiffusion.sh
You can see training visualization at http://localhost:8097/env/tryondiffusion_viton

Generative AI Image Toolset with GANs, Diffusion and Consistency Models for Real-World Applications

JoliGEN is an integrated framework for training custom generative AI image-to-image models

Main Features:

JoliGEN implements both GAN, Diffusion and Consistency models for unpaired and paired image to image translation tasks, including domain and style adaptation with conservation of semantics such as image and object classes, masks, ...
JoliGEN generative AI capabilities are targeted at real world applications such as Controled Image Generation, Augmented Reality, Dataset Smart Augmentation and object insertion, Synthetic to Real transforms.
JoliGEN allows for fast and stable training with astonishing results. A server with REST API is provided that allows for simplified deployment and usage.
JoliGEN has a large scope of options and parameters. To not get overwhelmed, follow the simple Quickstarts. There are then links to more detailed documentation on models, dataset formats, and data augmentation.

Useful links

Use cases

AR and metaverse: replace any image element with super-realistic objects
Image manipulation: seamlessly insert or remove objects/elements in images
Image to image translation while preserving semantics, e.g. existing source dataset annotations
Simulation to reality translation while preserving elements, metrics, ...
Image generation to enrich datasets, e.g. counter dataset imbalance, increase test sets, ...

This is achieved by combining powerful and customized generator architectures, bags of discriminators, and configurable neural networks and losses that ensure conservation of fundamental elements between source and target images.

Example results

Image translation while preserving the class

Mario to Sonic while preserving the action (running, jumping, ...)

Object insertion

Virtual Try-On with Diffusion

Car insertion (BDD100K) with Diffusion

Glasses insertion (FFHQ) with Diffusion

Object removal

Glasses removal with GANs

Style transfer while preserving label boxes (e.g. cars, pedestrians, street signs, ...)

Day to night (BDD100K) with Transformers and GANs

Clear to snow (BDD100K) by applying a generator multiple times to add snow incrementally

Clear to overcast (BDD100K)

Clear to rainy (BDD100K)

Features

SoTA image to image translation
Semantic consistency: conservation of labels of many types: bounding boxes, masks, classes.
SoTA discriminator models: projected, vision_aided, custom transformers.
Advanced generators: real-time, transformers, hybrid transformers-CNN, Attention-based, UNet with attention, StyleGAN2
Multiple models based on adversarial and diffusion generation: CycleGAN, CyCADA, CUT, Palette
GAN data augmentation mechanisms: APA, discriminator noise injection, standard image augmentation, online augmentation through sampling around bounding boxes
Output quality metrics: FID, PSNR, KID, ...
Server with REST API
Support for both CPU and GPU
Dockerized server
Production-grade deployment in C++ via DeepDetect

Code format and Contribution

If you want to contribute please use black code format. Install:

pip install black

Usage :

black .

If you want to format the code automatically before every commit :

pip install pre-commit
pre-commit install

Authors

JoliGEN is created and developed by Jolibrain.

Code structure is inspired by pytorch-CycleGAN-and-pix2pix, CUT, AttentionGAN, MoNCE, Palette among others.

Elements from JoliGEN are supported by the French National AI program "Confiance.AI"

Contact: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 952 Commits
.github/workflows		.github/workflows
ci		ci
data		data
docker		docker
docs		docs
examples		examples
imgs		imgs
models		models
options		options
scripts		scripts
server		server
tests		tests
util		util
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yaml		.readthedocs.yaml
CHANGELOG.md		CHANGELOG.md
Jenkinsfile		Jenkinsfile
LICENSE		LICENSE
README.md		README.md
client.py		client.py
evaluate.py		evaluate.py
package.json		package.json
release.sh		release.sh
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
train_model2ghost.sh		train_model2ghost.sh
train_tryondiffusion.sh		train_tryondiffusion.sh
train_viton.sh		train_viton.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fork to train/test TryOnDiffusion on comsumer GPU (24G VRAM)

Installation

Preprocess VITON-HD dataset for TryOnDiffusion

Train TryOnDiffusion

Generative AI Image Toolset with GANs, Diffusion and Consistency Models for Real-World Applications

Useful links

Use cases

Example results

Image translation while preserving the class

Object insertion

Object removal

Style transfer while preserving label boxes (e.g. cars, pedestrians, street signs, ...)

Features

Code format and Contribution

Authors

About

Releases

Packages

Languages

License

mjsh34/joliGEN-TryOnDiffusion

Folders and files

Latest commit

History

Repository files navigation

Fork to train/test TryOnDiffusion on comsumer GPU (24G VRAM)

Installation

Preprocess VITON-HD dataset for TryOnDiffusion

Train TryOnDiffusion

Generative AI Image Toolset with GANs, Diffusion and Consistency Models for Real-World Applications

Useful links

Use cases

Example results

Image translation while preserving the class

Object insertion

Object removal

Style transfer while preserving label boxes (e.g. cars, pedestrians, street signs, ...)

Features

Code format and Contribution

Authors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages