ControlNet For Stable Diffusion

Uploaded by

MEGHANA C S

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

0% found this document useful (0 votes)

23 views4 pages

ControlNet For Stable Diffusion

Uploaded by

MEGHANA C S

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

You are on page 1/ 4

Training a ControlNet for Stable Diffusion

Spring 2023

Figure 1: Example outputs from different ControlNets trained on various con-

ditions. ControlNets are fed the input prompt and the conditioning image (left
on each pair), and produce a high quality image (right on each pair). Separate
ControlNets are trained for each condition.

1 Introduction
The goal of this project is to train a ControlNet [2] to control Stable Diffusion
[1] on a new condition. ControlNet is a deep learning algorithm that can be
used for controlling image synthesis tasks by taking in a control image and a
text prompt, and producing a synthesized image that matches the prompt and
follows the constraints imposed by the control image. For example, ControlNet
allows you to generate an image based not only on a prompt, but also on a basic

1
Figure 2: Visualization of the ControlNet setup.

sketch that defines the general shape and position of the objects in your image
(Figure 1).
ControlNet essentially proposes to freeze the original Stable Diffusion UNet,
while instantiating a set of trainable copies for particular blocks. The trainable
copies, alongside ”zero convolution” blocks, are trained to receive a condition
and integrate that information into the main model (Figure 2).
This project proposes to train a new condition and qualitatively analyze the
results in terms of prompt fidelity, condition fidelity and quality of the resulting
imagery. This can be done either through an available toy dataset, Fill50k1 ,
through a dataset that you might find online, or through your own synthetically
created dataset.
We wish to document the training process, categorize challenges found, and
thoroughly analyze the resulting model in terms of quality and condition fidelity.

2 Objectives
The main objectives of this project are:

1. To train a ControlNet on a condition of your own. This can be either

done with the toy Fill50k dataset, which contains 50,000 images of circles
with prompts and corresponding filled circle images, or through your own
dataset following either one of the existing ControlNet conditions (scribble,
pose, canny edge maps, etc) or a new condition of your choosing.
2. To evaluate the performance of the trained ControlNet on a properly de-
fined test set. The performance analysis should include a qualitative anal-
ysis of prompt fidelity, condition fidelity and image quality.
1 Dataset can be found here: https://github.com/lllyasviel/ControlNet/blob/main/docs/train.md

2
3 Methodology
3.1 Suggested steps
: You are free to follow any strategy to achieve the aforementioned goals. How-
ever, here are a set of steps that we suggest you to follow:

1. Dataset preparation: Either download the Fill50K dataset or find/create

your own. In both cases, ensure that you have train and test splits.
2. ControlNet training: Train a ControlNet on the training set using the
PyTorch framework. The ControlNet will take in a control image and a
text prompt and output a synthesized image that matches the prompt.

3. ControlNet evaluation: evaluate the performance of the trained Control-

Net on the test set. Qualitative evaluation is sufficient, but feel free to
explore the literature for quantitative metrics as well.
4. Result analysis: Analyze the results and identify potential areas for im-
provement. Enumerate challenges and distill conclusions.

3.2 Resources
Here is a list of useful links and resources that can help you get started:
1. Main ControlNet Github

2. ControlNet training instructions

3. Colab with ControlNet examples

3.3 Challenges
Training stable diffusion with ControlNet will require significant computational
resources. We recommend you to use Colab, Runpod or cloud compute to facili-
tate this work. Feel free to use resources such as https://github.com/jehna/stable-
diffusion-training-tutorial to guide your setup.

4 Expected Results
We expect to obtain a ControlNet that can effectively control generations with
the given conditions, and a report that indicates how the process was conducted.
You should describe your dataset decisions, in terms of choice of data, prepro-
cessing and splitting; you should explain your training process and challenges
found; and you should qualitatively analyze the model, distill conclusions and
find potential areas for improvement.

3
References
[1] Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn
Ommer. High-resolution image synthesis with latent diffusion models. In Proceed-
ings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,
pages 10684–10695, 2022.
[2] Lvmin Zhang and Maneesh Agrawala. Adding conditional control to text-to-image
diffusion models. arXiv preprint arXiv:2302.05543, 2023.

Stable Diffusion
No ratings yet
Stable Diffusion
6 pages
Stable Diffusion
No ratings yet
Stable Diffusion
58 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
19 pages
How Does Stable Diffusion Work
No ratings yet
How Does Stable Diffusion Work
79 pages
Y2 Autumn Block 2 SOL Addition and Subtraction
No ratings yet
Y2 Autumn Block 2 SOL Addition and Subtraction
67 pages
Number Bonds Activities
No ratings yet
Number Bonds Activities
17 pages
Marilyn Burns On The Language of Math
No ratings yet
Marilyn Burns On The Language of Math
6 pages
Projects GenAI Pinnacle Program
No ratings yet
Projects GenAI Pinnacle Program
14 pages
Neural Transfer Learning For NLP
No ratings yet
Neural Transfer Learning For NLP
329 pages
PyTorch Custom Datasets
No ratings yet
PyTorch Custom Datasets
1 page
Langr GIA MEAP V04 ch1
No ratings yet
Langr GIA MEAP V04 ch1
18 pages
Deep Learning
No ratings yet
Deep Learning
45 pages
AI Skills Assessment
No ratings yet
AI Skills Assessment
20 pages
Stable Diffusion For Image Generation
No ratings yet
Stable Diffusion For Image Generation
23 pages
21 Powerful Tips Tricks and Hacks For Data Scientists
No ratings yet
21 Powerful Tips Tricks and Hacks For Data Scientists
37 pages
AIM307_Retrieval-Augmented-Generation-with-Amazon-Bedrock
No ratings yet
AIM307_Retrieval-Augmented-Generation-with-Amazon-Bedrock
15 pages
Natural Language Processing in Investigative Journalism
No ratings yet
Natural Language Processing in Investigative Journalism
53 pages
Brochure 1700567435737
No ratings yet
Brochure 1700567435737
17 pages
Machine Learning Resources
No ratings yet
Machine Learning Resources
2 pages
Complete Download (Ebook) Deep Learning With Pytorch by Eli Stevens, Luca Antiga, Thomas Viehmann ISBN 9781617295263, 1617295264 PDF All Chapters
100% (6)
Complete Download (Ebook) Deep Learning With Pytorch by Eli Stevens, Luca Antiga, Thomas Viehmann ISBN 9781617295263, 1617295264 PDF All Chapters
65 pages
ML Trends
No ratings yet
ML Trends
89 pages
Natural Language Processing
No ratings yet
Natural Language Processing
116 pages
Invertebrateanimals 140331061236 Phpapp02
No ratings yet
Invertebrateanimals 140331061236 Phpapp02
17 pages
LLM Twin Course
No ratings yet
LLM Twin Course
38 pages
00 10 Free Must-Read Machine Learning E-Books For Data Scientists & AI Engineers
No ratings yet
00 10 Free Must-Read Machine Learning E-Books For Data Scientists & AI Engineers
13 pages
Hugging Face
100% (1)
Hugging Face
11 pages
Immediate download (Ebook) Natural Language Processing Recipes: Unlocking Text Data with Machine Learning and Deep Learning Using Python by Akshay Kulkarni, Adarsha Shivananda ISBN 9781484273517, 9781484273500, 1484273508, 1484273516 ebooks 2024
100% (10)
Immediate download (Ebook) Natural Language Processing Recipes: Unlocking Text Data with Machine Learning and Deep Learning Using Python by Akshay Kulkarni, Adarsha Shivananda ISBN 9781484273517, 9781484273500, 1484273508, 1484273516 ebooks 2024
81 pages
1. LLMs for Me - Introduction LLMs & Generative Text
No ratings yet
1. LLMs for Me - Introduction LLMs & Generative Text
38 pages
Generative Adversarial Networks (GANs) - Engine and Applications PDF
No ratings yet
Generative Adversarial Networks (GANs) - Engine and Applications PDF
13 pages
Panaversity Cloud Native Applied Generative AI Engineer
No ratings yet
Panaversity Cloud Native Applied Generative AI Engineer
36 pages
Bar Modelling Number Bonds To 20 PDF
100% (1)
Bar Modelling Number Bonds To 20 PDF
4 pages
Unit 5b - Natural Language Processing
No ratings yet
Unit 5b - Natural Language Processing
41 pages
LLM Survey
100% (1)
LLM Survey
43 pages
Fig. 1. Relationship Between AI and Natural Language Processing Technology
No ratings yet
Fig. 1. Relationship Between AI and Natural Language Processing Technology
6 pages
Machine Learning Models
100% (1)
Machine Learning Models
2 pages
A Recipe For Training Neural Networks
No ratings yet
A Recipe For Training Neural Networks
15 pages
PYDS 3150713 Unit-2
No ratings yet
PYDS 3150713 Unit-2
38 pages
Full download Intelligent Natural Language Processing Trends and Applications 1st Edition Khaled Shaalan pdf docx
88% (8)
Full download Intelligent Natural Language Processing Trends and Applications 1st Edition Khaled Shaalan pdf docx
55 pages
19 Deep Learning
100% (1)
19 Deep Learning
49 pages
Done Assignment
No ratings yet
Done Assignment
9 pages
Full download Natural Language Processing for Electronic Design Automation Mathias Soeken pdf docx
50% (2)
Full download Natural Language Processing for Electronic Design Automation Mathias Soeken pdf docx
65 pages
Natural Language Processing Investment Applications 1682008453
No ratings yet
Natural Language Processing Investment Applications 1682008453
60 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
115 pages
Dms - 5e147898f022bDS and ML With Python Libraries
No ratings yet
Dms - 5e147898f022bDS and ML With Python Libraries
2 pages
KoT Minor Report
No ratings yet
KoT Minor Report
27 pages
Major Project
No ratings yet
Major Project
33 pages
Number Bond
No ratings yet
Number Bond
28 pages
Job Description For AI-ML Developer
No ratings yet
Job Description For AI-ML Developer
1 page
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
Natural Language Processing
No ratings yet
Natural Language Processing
10 pages
Semantic Kernel
100% (1)
Semantic Kernel
162 pages
Machine Learning Python
No ratings yet
Machine Learning Python
9 pages
Business Intelligence and Decision Support Systems (9 Ed., Prentice Hall)
No ratings yet
Business Intelligence and Decision Support Systems (9 Ed., Prentice Hall)
41 pages
Implemented LeNet on PyTorch
100% (1)
Implemented LeNet on PyTorch
17 pages
Application of NLP
No ratings yet
Application of NLP
10 pages
03_pytorch_computer_vision
No ratings yet
03_pytorch_computer_vision
29 pages
Natural Language Processing Rahul Sahai
No ratings yet
Natural Language Processing Rahul Sahai
30 pages
VTU Module-4 Chapter-2 Ensemble Learning and Random Forests
No ratings yet
VTU Module-4 Chapter-2 Ensemble Learning and Random Forests
61 pages
Adding Conditional Control To Text-to-Image Diffusion Models
No ratings yet
Adding Conditional Control To Text-to-Image Diffusion Models
33 pages
Zhang Adding Conditional Control To Text-to-Image Diffusion Models ICCV 2023 Paper
No ratings yet
Zhang Adding Conditional Control To Text-to-Image Diffusion Models ICCV 2023 Paper
12 pages