0% found this document useful (0 votes)

13 views20 pages

Module - 2.2

The document outlines a course on Deep Learning, focusing on Convolutional Neural Networks (CNNs) including architectures like LeNet and AlexNet. It details the structure and purpose of each architecture, emphasizing their applications in image classification and object detection. Key concepts such as transfer learning, pooling, and fully connected layers are also discussed.

Uploaded by

conway.rl112

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views20 pages

Module - 2.2

Uploaded by

conway.rl112

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Deep Learning

Subject Code – EC37T

Course Pre-requisite:EC37P

Dr. Nayana Mahajan

9/19/2025 Dr. Nayana Mahajan 1

Module II: Convolutional Neural Networks
(CNNs)
 Basics of CNNs (Convolution, Pooling, Padding, Stride)

 Modern Deep Learning Architectures: LeNET: Architecture,

AlexNET: Architecture

 Advanced Architectures: ResNet, DenseNet, EfficientNet

 Transfer Learning and Fine-tuning CNNs

 Applications: Image Classification, Object Detection

9/19/2025 Dr. Nayana Mahajan 2
LeNet-5:
Purpose:
• Primarily designed for handwritten digit recognition,
specifically the MNIST dataset.

• The MNIST dataset (Modified National Institute of

Standards and Technology database) is a classic
benchmark in the field of machine learning and computer
vision.

9/19/2025 Dr. Nayana Mahajan 3

LeNet-5

 This is also known as the Classic Neural Network that

was designed by Yann LeCun, Leon Bottou, Yosuha
Bengio and Patrick Haffner for handwritten and
machine-printed character recognition in 1990’s which
they called LeNet-5.

 The architecture was designed to identify handwritten

digits in the MNIST data-set.

9/19/2025 Dr. Nayana Mahajan 4

LeNet-5

 The architecture is pretty straightforward and simple to

understand.

 The input images were gray scale with dimension of 32321

followed by two pairs of Convolution layer with stride 2 and
Average pooling layer with stride 1.

 Finally, fully connected layers with Softmax activation in the

output layer.

 Traditionally, this network had 60,000 parameters in total.

9/19/2025 Dr. Nayana Mahajan 5

9/19/2025 Dr. Nayana Mahajan 6
LeNet-5:
Architecture:
A relatively shallow CNN with 7 layers (including input and output).
Input Layer: Accepts 32x32 pixel images.

• Convolutional Layers (C1, C3): Use 5x5 filters with 6 and 16 feature
maps, respectively.

• Pooling Layers (S2, S4): Employ 2x2 average pooling to reduce

spatial dimensions.

• Fully Connected Layers (C5, F6): Connect all neurons in the

preceding layer to each neuron in the current layer.

• Output Layer: Uses a sigmoid activation function.

9/19/2025 Dr. Nayana Mahajan 7
Number of Kernels (Filters)
• Each kernel learns a different feature (e.g., edge, texture,
shape).
• More kernels → more types of features learned.

9/19/2025 Dr. Nayana Mahajan 8

LeNet's Architecture
 The LeNet architecture consists of several layers
that progressively extract and condense
information from input images.

Here, is it the description of each layer of the LeNet

architecture:

1.Input Layer: Accepts 32x32 pixel images, often

zero-padded if original images are smaller.
2.First Convolutional Layer (C1): Consists of six 5x5
filters, producing six feature maps of 28x28 each.
9/19/2025 Dr. Nayana Mahajan 9
3.First
Pooling Layer (S2): Applies 2x2 average
pooling, reducing feature maps' size to 14x14.

4.Second Convolutional Layer (C3): Uses sixteen

5x5 filters, but with sparse connections, outputting
sixteen 10x10 feature maps.

5.Second Pooling Layer (S4): Further reduces

feature maps to 5x5 using 2x2 average pooling.

9/19/2025 Dr. Nayana Mahajan 10

6.First Fully Connected Layer (C5): Fully connected with
120 nodes.

6.Second Fully Connected Layer (F6): Comprises 84

nodes.

7.Output Layer: Softmax or Gaussian activation that

outputs probabilities across 10 classes (digits 0-9).

9/19/2025 Dr. Nayana Mahajan 11

AlexNet:

Purpose:
 Won the 2012 Image Net competition, demonstrating
the potential of deep learning for large-scale image
recognition.

9/19/2025 Dr. Nayana Mahajan 12

Architecture
 A deeper CNN with 8 layers (including input and output).

 Convolutional Layers: Employs 5 convolutional layers with

varying filter sizes (11x11, 5x5, 3x3).

 Pooling Layers: Uses max-pooling to reduce spatial

dimensions.

 Fully Connected Layers: Includes three fully connected layers,

with the final layer producing 1000 outputs (for ImageNet
classes).

9/19/2025 Dr. Nayana Mahajan 13

AlexNet architecture

9/19/2025 Dr. Nayana Mahajan 14

Input Layer
 AlexNet takes images of the Input size of
227x227x3 RGB Pixels.

9/19/2025 Dr. Nayana Mahajan 15

 Convolutional Layers
• First Layer: The first layer uses 96 kernels of size
11×11 with a stride of 4, activates them with the
ReLU activation function, and then performs a Max
Pooling operation.

• Second Layer: The second layer takes the output of

the first layer as the input, with 256 kernels of size
5x5x48.

• Third Layer: 384 kernels of size 3x3x256.

9/19/2025 Dr. Nayana Mahajan 16
 Convolutional Layers

• No pooling or normalization operations are

performed on the third, fourth, and fifth layers.

• Fourth Layer: 384 kernels of size 3x3x192.

• Fifth Layer: 256 kernels of size 3x3x192.

9/19/2025 Dr. Nayana Mahajan 17

Fully Connected Layers
 The fully connected layers have 4096 neurons each.

Output Layer
 The output layer is a SoftMax layer that outputs
probabilities of the 1000 class labels.

9/19/2025 Dr. Nayana Mahajan 18

Example Calculation (without padding)
 For an input of 227×227×3, applying:
 Kernel size = 11
 Stride = 4
 No padding
 Output size (per channel) =
 (So, the output feature map = 55×55×96
(assuming AlexNet uses 96 filters in the first conv
layer).

9/19/2025 Dr. Nayana Mahajan 19

9/19/2025 Dr. Nayana Mahajan 20

Al3502 - DLV Unit 3
No ratings yet
Al3502 - DLV Unit 3
11 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
15 pages
Module - 2
No ratings yet
Module - 2
117 pages
Unit V
No ratings yet
Unit V
84 pages
23-CNN Operations - Architecture - Simple Convolution Network-09!09!2024
No ratings yet
23-CNN Operations - Architecture - Simple Convolution Network-09!09!2024
8 pages
Convolutional Neural Network Report
No ratings yet
Convolutional Neural Network Report
5 pages
CSCI417 Machine Intelligence - Lec11 RNN - V1
No ratings yet
CSCI417 Machine Intelligence - Lec11 RNN - V1
61 pages
Unit 2 Part 03
No ratings yet
Unit 2 Part 03
49 pages
BEFA
No ratings yet
BEFA
23 pages
LeNet-5 and AlexNet Architectures Explained
No ratings yet
LeNet-5 and AlexNet Architectures Explained
13 pages
Keras and Tensorflow
No ratings yet
Keras and Tensorflow
11 pages
Convolutional Neural Network (CNN) Architectures - GeeksforGeeks
No ratings yet
Convolutional Neural Network (CNN) Architectures - GeeksforGeeks
17 pages
Advancements in Image Classification Using Convolutional Neural Network
No ratings yet
Advancements in Image Classification Using Convolutional Neural Network
8 pages
Kernel Slides
No ratings yet
Kernel Slides
33 pages
Ch-3 Convolutional Neural Networks (CNNS)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNS)
11 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Overview of AlexNet Architecture
No ratings yet
Overview of AlexNet Architecture
26 pages
CNNs: A Guide for Tech Enthusiasts
No ratings yet
CNNs: A Guide for Tech Enthusiasts
80 pages
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
No ratings yet
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
13 pages
Lec 6
No ratings yet
Lec 6
31 pages
CV PPT Mt101
No ratings yet
CV PPT Mt101
16 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
34 pages
DL UNIT 2 CNN Architectures
No ratings yet
DL UNIT 2 CNN Architectures
12 pages
Module3 Casestudy
No ratings yet
Module3 Casestudy
13 pages
CNN Applications in Computer Vision
No ratings yet
CNN Applications in Computer Vision
65 pages
Convolutional Neural Network Models
No ratings yet
Convolutional Neural Network Models
83 pages
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
No ratings yet
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
50 pages
ML Lec 14 LeNeT CNN Architecture
No ratings yet
ML Lec 14 LeNeT CNN Architecture
14 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
LeNet-5: CNN Architecture Overview
No ratings yet
LeNet-5: CNN Architecture Overview
14 pages
Image Processing with CNNs Overview
No ratings yet
Image Processing with CNNs Overview
63 pages
ch4 CNN
No ratings yet
ch4 CNN
35 pages
Lenet Alexnet
No ratings yet
Lenet Alexnet
14 pages
L7-CNNs NT
No ratings yet
L7-CNNs NT
82 pages
CNN Models: A Historical Overview
No ratings yet
CNN Models: A Historical Overview
82 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
465-Lecture 7
No ratings yet
465-Lecture 7
46 pages
Module2 1
No ratings yet
Module2 1
27 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
PPT
No ratings yet
PPT
20 pages
3 DL ConvNets
No ratings yet
3 DL ConvNets
46 pages
An Analysis of Convolutional Neural Network Architectures
No ratings yet
An Analysis of Convolutional Neural Network Architectures
54 pages
Co2 CNN 3
No ratings yet
Co2 CNN 3
31 pages
CNN, RNN
No ratings yet
CNN, RNN
60 pages
03 CNN
No ratings yet
03 CNN
33 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
47 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
Module 3 B
No ratings yet
Module 3 B
40 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
28 pages
Images and Convolutional Neural Networks: Practical Deep Learning
No ratings yet
Images and Convolutional Neural Networks: Practical Deep Learning
34 pages
Reviewer - Convolutional Neural Networks (CNNS) - Muqaddas Bin Tahir
No ratings yet
Reviewer - Convolutional Neural Networks (CNNS) - Muqaddas Bin Tahir
8 pages
Neural Network Basic - CNN
No ratings yet
Neural Network Basic - CNN
65 pages
5b Dana
No ratings yet
5b Dana
67 pages
Additional CNN
No ratings yet
Additional CNN
82 pages
Convolutional Neural Networks Overview
No ratings yet
Convolutional Neural Networks Overview
44 pages
Deep CNN
No ratings yet
Deep CNN
66 pages
CNN Short
No ratings yet
CNN Short
61 pages
CNN Basic
No ratings yet
CNN Basic
64 pages
02 - Introduction To Convolutional Neural Networks (CNNS)
No ratings yet
02 - Introduction To Convolutional Neural Networks (CNNS)
28 pages
AD3501-DL-Unit 2
No ratings yet
AD3501-DL-Unit 2
33 pages
Unit 7 - Week 6: Assignment 6
No ratings yet
Unit 7 - Week 6: Assignment 6
4 pages
Fractalfract 09 00337
No ratings yet
Fractalfract 09 00337
21 pages
CNN Architectures 01
No ratings yet
CNN Architectures 01
66 pages
CNNs: From Human Vision to AI
No ratings yet
CNNs: From Human Vision to AI
25 pages
Citra Tersegmentasi dalam Visi Komputer
No ratings yet
Citra Tersegmentasi dalam Visi Komputer
72 pages
Lenet-5 Architecture Explained
No ratings yet
Lenet-5 Architecture Explained
6 pages
Paper 8665
No ratings yet
Paper 8665
7 pages
1 Introduction
No ratings yet
1 Introduction
31 pages
Deep Learning Fundamentals and Techniques
No ratings yet
Deep Learning Fundamentals and Techniques
883 pages
MNIST Based Handwritten Digits Recognition
No ratings yet
MNIST Based Handwritten Digits Recognition
5 pages
11 Convolution
No ratings yet
11 Convolution
56 pages
Human vs. CNNs on CIFAR10 Recognition
No ratings yet
Human vs. CNNs on CIFAR10 Recognition
10 pages
Coloration Technology: Automatic Fabric Defect Detection Using A Deep Convolutional Neural Network
No ratings yet
Coloration Technology: Automatic Fabric Defect Detection Using A Deep Convolutional Neural Network
11 pages
Dive Into Deep Learning
No ratings yet
Dive Into Deep Learning
837 pages
Smart Attendance System
No ratings yet
Smart Attendance System
59 pages
CNN Case Studies Unit 4
No ratings yet
CNN Case Studies Unit 4
13 pages
PYNQ FPGA Hardware Implementation of LeNet-5-Based Traffic Sign Recognition Application
No ratings yet
PYNQ FPGA Hardware Implementation of LeNet-5-Based Traffic Sign Recognition Application
6 pages
Ayintarebaetal 20251872025AJRCOS138260
No ratings yet
Ayintarebaetal 20251872025AJRCOS138260
12 pages
CNN Course Notes by Andrew Ng
No ratings yet
CNN Course Notes by Andrew Ng
14 pages
Artificial Intelligence Driven Silkworm and Mulberry Plant Disease Detection With Smart Prevention Recommendations
No ratings yet
Artificial Intelligence Driven Silkworm and Mulberry Plant Disease Detection With Smart Prevention Recommendations
7 pages
Deep Learning MCQs
No ratings yet
Deep Learning MCQs
18 pages
Overview of LeNet-5 Architecture
No ratings yet
Overview of LeNet-5 Architecture
99 pages
Irjmets - Analysis of Diabetic Retinopathy in Fundus Images Using CNN
100% (2)
Irjmets - Analysis of Diabetic Retinopathy in Fundus Images Using CNN
13 pages
Automatic Detection and Staging of Diabetic Retinopathy Using Deep Learning Techniques
No ratings yet
Automatic Detection and Staging of Diabetic Retinopathy Using Deep Learning Techniques
40 pages
Vanishing Gradient Problem
No ratings yet
Vanishing Gradient Problem
3 pages
Intelligence-Based Medicine: Rutuja Shinde
No ratings yet
Intelligence-Based Medicine: Rutuja Shinde
15 pages
Deep Learning Regularization Techniques
No ratings yet
Deep Learning Regularization Techniques
36 pages
Energy-Efficient Time-Domain CNN Engine
No ratings yet
Energy-Efficient Time-Domain CNN Engine
16 pages

Module - 2.2

Uploaded by

Module - 2.2

Uploaded by

Deep Learning

Subject Code – EC37T

Dr. Nayana Mahajan

9/19/2025 Dr. Nayana Mahajan 1

 Modern Deep Learning Architectures: LeNET: Architecture,

 Advanced Architectures: ResNet, DenseNet, EfficientNet

 Transfer Learning and Fine-tuning CNNs

 Applications: Image Classification, Object Detection

• The MNIST dataset (Modified National Institute of

9/19/2025 Dr. Nayana Mahajan 3

 This is also known as the Classic Neural Network that

 The architecture was designed to identify handwritten

9/19/2025 Dr. Nayana Mahajan 4

 The architecture is pretty straightforward and simple to

 The input images were gray scale with dimension of 32*32*1

 Finally, fully connected layers with Softmax activation in the

 Traditionally, this network had 60,000 parameters in total.

9/19/2025 Dr. Nayana Mahajan 5

• Pooling Layers (S2, S4): Employ 2x2 average pooling to reduce

• Fully Connected Layers (C5, F6): Connect all neurons in the

• Output Layer: Uses a sigmoid activation function.

9/19/2025 Dr. Nayana Mahajan 8

Here, is it the description of each layer of the LeNet

1.Input Layer: Accepts 32x32 pixel images, often

4.Second Convolutional Layer (C3): Uses sixteen

5.Second Pooling Layer (S4): Further reduces

9/19/2025 Dr. Nayana Mahajan 10

6.Second Fully Connected Layer (F6): Comprises 84

7.Output Layer: Softmax or Gaussian activation that

9/19/2025 Dr. Nayana Mahajan 11

9/19/2025 Dr. Nayana Mahajan 12

 Convolutional Layers: Employs 5 convolutional layers with

 Pooling Layers: Uses max-pooling to reduce spatial

 Fully Connected Layers: Includes three fully connected layers,

9/19/2025 Dr. Nayana Mahajan 13

9/19/2025 Dr. Nayana Mahajan 14

9/19/2025 Dr. Nayana Mahajan 15

• Second Layer: The second layer takes the output of

• Third Layer: 384 kernels of size 3x3x256.

• No pooling or normalization operations are

• Fourth Layer: 384 kernels of size 3x3x192.

• Fifth Layer: 256 kernels of size 3x3x192.

9/19/2025 Dr. Nayana Mahajan 17

9/19/2025 Dr. Nayana Mahajan 18

9/19/2025 Dr. Nayana Mahajan 19

You might also like

 The input images were gray scale with dimension of 32321