0% found this document useful (0 votes)

74 views54 pages

Week8 WEB

The document discusses convolutional neural networks (CNNs) and the YOLO object detection model. It provides an overview of CNN architecture including convolution, activation, pooling, flattening, and fully connected layers. It explains how CNNs use shared weights and biases to detect features across image regions. The document also describes how YOLO improves on previous models by predicting bounding boxes and class probabilities simultaneously for real-time object detection in images.

Uploaded by

Ankit Shaw

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views54 pages

Week8 WEB

Uploaded by

Ankit Shaw

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

TECHIN513 – Managing

Signal and Data Processing

Week 8
Today’s Agenda
• CNN
• YOLO
• ICTE
• FPDAWT
Today’s Agenda
• Convolutional Neural Network
• You Only Look Once
• In Class Team Exercise
• Final Project Discussion And Work Time
Announcement
• Purchasing supplies for final project
• Budget of $40 per team
• Requests must be made by Monday, February 26 at 9:59am

Link to Request Form:

TECHIN513 Final Project Supply Request Form - Google Sheets
What is a convolutional neural network?
• A network architecture for deep learning
• CNNs can have tens or hundreds of hidden layers
• Includes a typical artificial neural network architecture
• Useful for finding patterns in images to recognize objects
Stages of a CNN
• Input image
• Convolution
• Activation
• Pooling
• Flattening
• Fully Connected ANN
• Activation
image source

• Output

Convolutional Operations | Medium

pixel values range
Greyscale Image Data from 0 to 255

24x16 matrix

How Do Machines Read and Store Images? | Analytics Vidhya

Color Image Data

one image has

three matrices or
pixel values range “channels”
from 0 to 255

How Do Machines Read and Store Images? | Analytics Vidhya

CNN Overview
Feature Extraction

Feature Extraction with CNNs | Towards Data Science

Typical Artificial Neural Network
• Each neuron in the input layer
is connected to a neuron in the
hidden layer
• Each connection has a weight
value
• Each neuron has a bias value
• The model learns these values
during the training process
• Values are updated with each
new training example

Introduction to Deep Learning - MATLAB

Convolutional Neural Network
• The weights and bias values are
the same for all neurons in a
hidden layer
• All hidden layers are detecting
the same feature (e.g. edge) in
different regions of an image
• The network is better equipped
to detect the feature regardless
of its location in an image

Introduction to Deep Learning - MATLAB

Convolutional Operation

An operation on two functions

which produces a third
combined function

Convolution Integral | Statistics How To

Convolutional Operation
kernel types

• A convolutional kernal is a
small 2D matrix
• The kernal maps on to the
input image by matrix
multiplication and addition
• The output is a matrix of
lower dimensions
Sliding window protocol
where stride =1

Lower dimension matrix

(feature map) Convolutional Operations | Medium
Convoluting to Create Feature Maps

CNNs | simplilearn
45*0
+ 12*(-1)
+ 5*0
+ 22*(-1)
+ 10*5
+ 35*(-1)
+ 88*0
+ 26*(-1)
+ 51*0
= - 45
Activation Step Rectified
Linear
Unit
• Activation function takes the
output of a neuron and maps it
to the highest positive value
• If output is negative, the
function maps it to zero
• ReLU is a commonly used
activation function in deep
learning

Introduction to Deep Learning - MATLAB

ReLu activation retains only positive values

CNNs | simplilearn
CNN Overview
Pooling Step New
Feature
Map
• Pooling reduces dimensionality
of features map by using
different filters
• Condenses regions of neurons
into a single output
• Simplifies model by reducing
the number of parameters the
model needs to learn
• Pooling retains the most
important information but
lowers resolution

Introduction to Deep Learning - MATLAB

Pooling Applies Various Filters

CNNs | simplilearn
Pooling Enhances Edges Three iterations of
max pooling using a
(2, 2) kernel

Features (edges) are

enhanced, but
resolution is reduced

Pooling In Convolutional Neural Networks | paperspace

CNN Overview
Flattening
• The flatten layer lies
between the CNN and the
Softmax
ANN
• Converts the feature map
from the pooling layer into
an input that the ANN can
understand
• The ANN requires a one-
dimensional array as input
Artificial Neural Network

Feature Maps | educative.io , Dense layers | Pysource

Softmax Activation Step
Mathematical
representation
Last fully
• Often used as the last connected layer
activation function to
normalize the output of a
network to a probability
distribution over predicted
output classes
• The output of a Softmax is a
vector with probabilities of
each possible outcome.

Softmax Activation Function | Towards Data Science

CNN Output Layer
The final layer of the CNN architecture provides the final
classification output
A vector of length K
equal to the
number of classes

Introduction to Deep Learning - MATLAB

Classification, Detection, & Segmentation

or object localization

Object Segmentation vs. Object Detection | LinkedIn

You Only Look Once
• "You Only Look Once" (YOLO)
• YOLOv1 paper published May 2016
• Uses CNN as its backbone
network architecture
• YOLO predicts bounding boxes
and class probabilities for these
boxes simultaneously
• Improvement on previous model:
R-CNN

https://arxiv.org/abs/1506.02640
YOLO

https://pjreddie.com/darknet/yolo/

https://arxiv.org/abs/1506.02640
Previous Model for Image Detection: R-CNN
• Regions with CNN features
• Published Oct 2014
• link to article
• Splits an image into 2000
regions in boundary boxes
then classify each region
• Drawbacks:
• Long time to train – classify
2000 regions per image
• Detection not in real-time: 47
sec for test image
• Boundary box inaccuracies

R-CNN | Towards Data Science

How does YOLO work?
• Resizes the input image into YOLO Architecture
448x448
• A 1x1 convolution is first applied
to reduce the number of
channels
• 24 convolutional layers
• 4 max pooling layers
• The activation function is ReLU
• Two fully connected layers

https://arxiv.org/abs/1506.02640
What is Object
Detection?
First let’s talk about
object localization

36
What is object localization?
width (bw)
Object localization is
finding what and where a
(single) object exists in a
single image

height
(bh)

(bx, by)
How is object localization described
numerically in YOLO?
• The coordinates of a bounding x_train

box are described as a vector

y_train

Pc 1
Probability Bx 0.5
of class By 0.6
Bw 0.4
Bh 0.3
C1 1
C2 0
C1 = car class
C2 = motorcycle class
How is object localization described
numerically in YOLO? (0.5,0.6)
• The coordinates of a bounding (0,0) x_train

box are described as a vector

y_train

Pc 1 (bx,by)
Probability Bx 0.5 bh
of class By 0.6
0.3
Bw 0.4
Bh 0.3 bw
C1 1
C2 0 (1,1)
C1 = car class 0.4
C2 = motorcycle class
How is object localization described
numerically in YOLO? (0.5,0.6)
• The coordinates of a bounding (0,0)
box are described as a vector

Output of
Neural Network

Pc 1 (bx,by)
Probability Bx 0.5 bh
of class By 0.6
0.3
Bw 0.4
Bh 0.3 bw
C1 0.97
C2 0.03 (1,1)
C1 = car class 0.4
C2 = motorcycle class
How is object localization described
numerically in YOLO?
• The coordinates of a bounding x_train

box are described as a vector

y_train

Pc 0
Probability Bx -
of class By -
Bw -
Bh -
C1 -
C2 -
C1 = car class
C2 = motorcycle class
What about multiple objects?

YOLO algorithm | YouTube

What about multiple objects?

Pc 0
Bx -
By -
Bw -
Bh -
C1 -
C2 -

C1 = dog class
C2 = person class

YOLO algorithm | YouTube

What about multiple objects?
Person’s
object
belongs to
this cell

Pc 1
Bx 0.05
By 0.3
Bw 2
Bh 1.3
C1 1
C2 0

C1 = dog class
C2 = person class

YOLO algorithm | YouTube

What about multiple objects?

Pc 1
Bx 0.32
By 0.02
Bw 2.2
Bh 1.7
C1 0
C2 1

C1 = dog class
C2 = person class

YOLO algorithm | YouTube

What about multiple objects?

All other cells 4x4x7 matrix

Pc 0
Bx -
By -
Bw -
Bh -
C1 -
C2 -

C1 = dog class
C2 = person class

YOLO algorithm | YouTube

Training the YOLO Model

YOLO algorithm | YouTube

YOLO Prediction

YOLO algorithm | YouTube

Evaluating Image Detection Models
• Common Objects in Context
(COCO) dataset
• Published by Microsoft
• Used to evaluate algorithms’
performance of real-time
object detection
• 330,000 images
• 200,000 are labeled Pc 1

• 1.5 million object instances y_train

Bx
By
0.5
0.6
Bw 0.4
• 5 captions per image Bh
C1
0.3
1
C2 0

COCO Dataset | viso.ai

Evaluating Image Detection Models
Error Matrix

• Mean Average Precision (mAP)

• Benchmark metric used to
evaluate the robustness of
object detection models
• Incorporates mathematics image source

from:
• Error matrix
• Intersection over union (IoU)
ratio for bounding box

image source

Understanding Confusion Matrix | Towards Data Science

Best Object Detection Models

Object Detection | viso.ai

YOLOv8

YOLOv8 Tutorial - Colaboratory (google.com)

YOLOv8

Ultralytics YOLOv8 | GitHub

ICTE

Convolutional Networks
No ratings yet
Convolutional Networks
37 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
47 pages
Chap 2 DL
No ratings yet
Chap 2 DL
88 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
9 pages
CNN Notes Unit 3 Notes
No ratings yet
CNN Notes Unit 3 Notes
17 pages
Lecture - 07 (Convolutional Neural Networks)
No ratings yet
Lecture - 07 (Convolutional Neural Networks)
57 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
15 pages
CNN and Applications
No ratings yet
CNN and Applications
22 pages
Deep Learning Course Overview
No ratings yet
Deep Learning Course Overview
109 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
CNN Basics for Computer Vision Students
No ratings yet
CNN Basics for Computer Vision Students
43 pages
4th Unit Aktu Machine Learning
No ratings yet
4th Unit Aktu Machine Learning
9 pages
E-Note 33951 Content Document 20250328020322PM
No ratings yet
E-Note 33951 Content Document 20250328020322PM
29 pages
CNN Cheatsheet for CS230 Students
No ratings yet
CNN Cheatsheet for CS230 Students
17 pages
Machine Learning (CSO851) - Lecture 10
No ratings yet
Machine Learning (CSO851) - Lecture 10
83 pages
CNNs for Image Recognition
No ratings yet
CNNs for Image Recognition
16 pages
CNNs for Machine Learning Experts
No ratings yet
CNNs for Machine Learning Experts
6 pages
CNN
No ratings yet
CNN
10 pages
Understanding CNNs in Deep Learning
No ratings yet
Understanding CNNs in Deep Learning
8 pages
DL Unit2
No ratings yet
DL Unit2
25 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
Deep Learning Image Classification
No ratings yet
Deep Learning Image Classification
11 pages
Introduction to Convolutional Neural Networks
No ratings yet
Introduction to Convolutional Neural Networks
51 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
108 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
15 pages
Scan 30 Sep 23 18 20 44
No ratings yet
Scan 30 Sep 23 18 20 44
30 pages
DLCV Ch2 Neural Network
No ratings yet
DLCV Ch2 Neural Network
68 pages
Day8 (CNN)
No ratings yet
Day8 (CNN)
35 pages
Stage 424 June 2023
No ratings yet
Stage 424 June 2023
89 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
10 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
CNN
No ratings yet
CNN
35 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
CNN 3
No ratings yet
CNN 3
21 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
Additional CNN
No ratings yet
Additional CNN
82 pages
Neural Networks and CNN Overview
No ratings yet
Neural Networks and CNN Overview
268 pages
Convolutional Nets
No ratings yet
Convolutional Nets
41 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
L4 - Deep Learning
No ratings yet
L4 - Deep Learning
50 pages
What Is A Convolutional Neural Network-Unit3
No ratings yet
What Is A Convolutional Neural Network-Unit3
12 pages
Deep Learning & CNN Fundamentals
No ratings yet
Deep Learning & CNN Fundamentals
56 pages
Convolutional Neural Networks Overview
No ratings yet
Convolutional Neural Networks Overview
44 pages
Convolutional Networks Guide
No ratings yet
Convolutional Networks Guide
15 pages
Convolutional Neural Networks - Deeplearning-Notes
No ratings yet
Convolutional Neural Networks - Deeplearning-Notes
43 pages
Unit 4 (CNN and SOM)
No ratings yet
Unit 4 (CNN and SOM)
15 pages
CNN Applications in Computer Vision
No ratings yet
CNN Applications in Computer Vision
65 pages
Intro to Convolutional Networks
No ratings yet
Intro to Convolutional Networks
17 pages
CNN Basics for Image Classification
No ratings yet
CNN Basics for Image Classification
9 pages
An Introduction To Convolutional Neural Networks
No ratings yet
An Introduction To Convolutional Neural Networks
11 pages
Unit - 4 DL
No ratings yet
Unit - 4 DL
19 pages
Neural Network Brief Presentation
No ratings yet
Neural Network Brief Presentation
35 pages
Machine Learning for Image Classification
No ratings yet
Machine Learning for Image Classification
79 pages
Presented By:: SANTHOSH.K-927622BIT087 SIVA BHARAT.B-927622BIT099 NITHISH KUMAR.S-927622BIT066 SUNTHAR SHREE-927622BIT110
No ratings yet
Presented By:: SANTHOSH.K-927622BIT087 SIVA BHARAT.B-927622BIT099 NITHISH KUMAR.S-927622BIT066 SUNTHAR SHREE-927622BIT110
16 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
22 pages
Govt. of West Bengal E-Challan West Bengal Police: GRN: GRN Date: Payment Gateway
No ratings yet
Govt. of West Bengal E-Challan West Bengal Police: GRN: GRN Date: Payment Gateway
1 page
B.Tech Regular Exam Form 2019-20
No ratings yet
B.Tech Regular Exam Form 2019-20
1 page
Science Basics for Students
No ratings yet
Science Basics for Students
2 pages
Student Marks Summary Report
No ratings yet
Student Marks Summary Report
6 pages
WBJEE Info Brochure PDF
No ratings yet
WBJEE Info Brochure PDF
71 pages
C++ and Python Exam Paper
No ratings yet
C++ and Python Exam Paper
23 pages
HTML Executable Compilation Summary
No ratings yet
HTML Executable Compilation Summary
2 pages
Automatic Agarbatti Making Machine
No ratings yet
Automatic Agarbatti Making Machine
2 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
1 page
cp-1 Jozve Hoshmasnoei Matlabsitecom
No ratings yet
cp-1 Jozve Hoshmasnoei Matlabsitecom
16 pages
Calorie Detection and Alternate Food Recommendation System Using
No ratings yet
Calorie Detection and Alternate Food Recommendation System Using
12 pages
Nanotechnology - Teacher's Notes
No ratings yet
Nanotechnology - Teacher's Notes
4 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
1 page
ML Practice Questions
No ratings yet
ML Practice Questions
6 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
96 pages
Ai Module 3
No ratings yet
Ai Module 3
41 pages
Artificial Intelligence: B.E. (Computer Technology) Semester Seventh (C.B.S.)
No ratings yet
Artificial Intelligence: B.E. (Computer Technology) Semester Seventh (C.B.S.)
2 pages
AI & ML Learning Approaches
No ratings yet
AI & ML Learning Approaches
6 pages
Chapter 10: Artificial Neural Networks
No ratings yet
Chapter 10: Artificial Neural Networks
17 pages
4th AI
No ratings yet
4th AI
12 pages
Nanomaterials Course Overview
No ratings yet
Nanomaterials Course Overview
5 pages
Emerging Technologies
100% (1)
Emerging Technologies
8 pages
Accident Detection with Alert System
No ratings yet
Accident Detection with Alert System
20 pages
Global Green Nanotech Conclave 2015
No ratings yet
Global Green Nanotech Conclave 2015
5 pages
Flower Recog System
No ratings yet
Flower Recog System
11 pages
Affan Abbas: Computer Vision Engineer
No ratings yet
Affan Abbas: Computer Vision Engineer
1 page
Contextual CNN for Hyperspectral Image Classification
No ratings yet
Contextual CNN for Hyperspectral Image Classification
14 pages
Optical Neural Networks Review
No ratings yet
Optical Neural Networks Review
9 pages
Machine Learning Course Outline
No ratings yet
Machine Learning Course Outline
1 page
Deep Learning Plan
No ratings yet
Deep Learning Plan
1 page
AI and Machine Learning Courses List
No ratings yet
AI and Machine Learning Courses List
1 page
Digital Supply Chain Technologies
No ratings yet
Digital Supply Chain Technologies
9 pages
Top 10 Emerging Technologies 2024
No ratings yet
Top 10 Emerging Technologies 2024
46 pages
Philippine ALPR with InceptionV2
No ratings yet
Philippine ALPR with InceptionV2
5 pages
Deep Learning - Lesson Plan
No ratings yet
Deep Learning - Lesson Plan
5 pages
Seismic Facies Classification Using Supervised Convolutional Neural Networks and Semisupervised Generative Adversarial Networks
No ratings yet
Seismic Facies Classification Using Supervised Convolutional Neural Networks and Semisupervised Generative Adversarial Networks
12 pages
Class Note For Machine Learning at University
No ratings yet
Class Note For Machine Learning at University
58 pages
M.Tech Soft Computing Syllabus Kerala
No ratings yet
M.Tech Soft Computing Syllabus Kerala
2 pages

Week8 WEB

Uploaded by

Week8 WEB

Uploaded by

TECHIN513 – Managing

Signal and Data Processing

Link to Request Form:

Convolutional Operations | Medium

How Do Machines Read and Store Images? | Analytics Vidhya

one image has

How Do Machines Read and Store Images? | Analytics Vidhya

Feature Extraction with CNNs | Towards Data Science

Introduction to Deep Learning - MATLAB

Introduction to Deep Learning - MATLAB

Introduction to Deep Learning - MATLAB

Introduction to Deep Learning - MATLAB

An operation on two functions

Convolution Integral | Statistics How To

Lower dimension matrix

Introduction to Deep Learning - MATLAB

Introduction to Deep Learning - MATLAB

Features (edges) are

Pooling In Convolutional Neural Networks | paperspace

Feature Maps | educative.io , Dense layers | Pysource

Softmax Activation Function | Towards Data Science

Introduction to Deep Learning - MATLAB

Object Segmentation vs. Object Detection | LinkedIn

R-CNN | Towards Data Science

box are described as a vector

box are described as a vector

box are described as a vector

YOLO algorithm | YouTube

YOLO algorithm | YouTube

YOLO algorithm | YouTube

YOLO algorithm | YouTube

All other cells 4x4x7 matrix

YOLO algorithm | YouTube

YOLO algorithm | YouTube

YOLO algorithm | YouTube

• 1.5 million object instances y_train

COCO Dataset | viso.ai

• Mean Average Precision (mAP)

Understanding Confusion Matrix | Towards Data Science

Object Detection | viso.ai

YOLOv8 Tutorial - Colaboratory (google.com)

Ultralytics YOLOv8 | GitHub

You might also like