0% found this document useful (0 votes)

56 views33 pages

Topic 7

The document discusses Multilayer Neural Networks (MLNN), which are essential for deep learning and consist of an input layer, hidden layers, and an output layer. It explains the roles of each layer, the importance of activation functions, and the back-propagation algorithm used for training these networks. Additionally, it illustrates the design and training process of a three-layer back-propagation neural network using the XOR logical operation as an example.

Uploaded by

ainaafndi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views33 pages

Topic 7

Uploaded by

ainaafndi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

STS5422

ARTIFICIAL
NEURAL
NETWORK 2
TOPIC 7
Multilayer Neural Networks
• A Multilayer Neural Network (MLNN) is a type of artificial neural network
composed of multiple layers of neurons, where each layer is fully or partially
connected to the next.

• MLNN are the foundation of deep learning and are used to model complex
relationships in data.

• The network consists of an input layer of source neurons, at least one middle
or hidden layer of computational neurons, and an output layer of
computational neurons.

• The input signals are propagated in a forward direction on a layer-by-layer

Ou t p u t Sig n a ls
Input Signals

basis.

First Second
Input hidden hidden Output
layer layer layer layer
What is Input Layer?
• The first layer in the network, responsible for receiving raw input data.

• Each neuron in the input layer corresponds to a feature of the input data.

o For example, if the input is an image with 28x28 pixels, the input layer will
have 784 neurons (one for each pixel).

o In a network processing tabular data, the input layer might represent

numerical features like age, income, or weight.

o For an image recognition task, the input layer accepts pixel values of an
image.
What Does The Middle Layer Hide?
• A hidden layer “hides” its desired output. Neurons in the hidden layer cannot
be observed through the input/output behaviour of the network.

• There is no obvious way to know what the desired output of the hidden layer
should be.

• For example:

o In an image classification task, early hidden layers might detect edges,

corners, and textures.

o Deeper hidden layers might identify shapes or patterns (e.g., eyes, wheels,
etc.).
How Hidden Layer Works
• Activation Functions:

o The transformation in hidden layers is controlled by activation functions

(e.g., ReLU, Sigmoid) to add non-linearity.

• Weights and Biases:

o Each hidden layer learns weights and biases during training to capture the
underlying patterns in the data.

• Commercial ANNs incorporate three and sometimes four layers, including one
or two hidden layers. Each layer can contain from 10 to 1000 neurons.
Experimental neural networks may have five or even six layers, including
three or four hidden layers, and utilise millions of neurons.

• Training multilayer neural networks can involve a number of different

algorithms, but the most popular is the back propagation algorithm or
generalized delta rule.
Multi-layer Perceptrons
• This multi-layer network has different names: multi-layer perceptron (MLP),
feed-forward neural network, artificial neural network (ANN), backprop
network.

• Recall the simple neuron-like unit:

• These units are much more powerful if we connect many of them into a neural
network.
• We can connect lots of units
together into a directed acyclic
graph.

• This gives a feed-forward neural

network. That’s in contrast to
recurrent neural networks, which
can have cycles.

• Typically, units are grouped

together into layers.
• Each layer connects N input units to M
output units.

• In the simplest case, all input units are

connected to all output units. We call this a
fully connected layer. We'll consider other
layer types later.

• Note: the inputs and outputs for a layer are

distinct from the inputs and outputs to the
network.

• Recall from multiway logistic regression:

this means we need an weight matrix.

• The output units are a function of the input

units:
• Some activation functions:
• Some activation functions:
Designing a network to compute XOR
• XOR is a Boolean function that is true for two variables if and only if one of the
variables is true and the other is false.

• a logical operation that outputs true (1) if the inputs are different and false (0)
if the inputs are the same.

• XOR is a fundamental concept in digital logic and serves as a classic example

in machine learning for demonstrating the need for non-linear models like
neural networks.
• Assume hard threshold activation function
Example
• We want to classify a single data point into one of two classes

 Input Layer: 2 neurons

 Hidden Layer: 2 neurons

 Output Layer: 1 neuron

Back Propagation Algorithm
• Learning in a multilayer network proceeds the same way as for a perceptron.

• A training set of input patterns is presented to the network.

• The network computes its output pattern, and if there is an error - or in other
words a difference between actual and desired output patterns - the weights
are adjusted to reduce this error.

• In a back-propagation neural network, the learning algorithm has two phases.

o First, a training input pattern is presented to the network input layer. The
network propagates the input pattern from layer to layer until the output
pattern is generated by the output layer.

o If this pattern is different from the desired output, an error is calculated

and then propagated backwards through the network from the output layer
to the input layer. The weights are modified as the error is propagated.
Three-layer back-propagation neural
network
Input signals
1
x1 1 y1
1
2
x2 2 y2
2

i wij j wjk
xi k yk

m
n l yl
xn
Input Hidden Output
layer layer layer

Error signals
The Back-propagation Training Algorithm
• Step 1: Initialisation

o Set all the weights and threshold levels of the network to random numbers
uniformly distributed inside a small range:

 2.4 2 .4 
  ,  
 Fi Fi 
o where is the total number of inputs of neuron in the network. The weight
initialisation is done on a neuron-by-neuron basis.
• Step 2: Activation

o Activate the back-propagation neural network by applying inputs and

desired outputs .

o a) Calculate the actual outputs of the neurons in the hidden layer:

 n 
y j ( p ) sigmoid   xi ( p ) wij ( p )   j 
 i 1 
o where is the number of inputs of neuron in the hidden layer, and sigmoid
is the sigmoid activation function.
o (b) Calculate the actual outputs of the neurons in the output layer:

 m 
yk ( p ) sigmoid   x jk ( p ) w jk ( p )   k 
 j 1 

o where is the number of inputs of neuron k in the output layer.

• Step 3: Weight training

o Update the weights in the back-propagation network propagating

backward the errors associated with output neurons.

o (a) Calculate the error gradient for the neurons in the output layer:
 k ( p)  yk ( p) 1  yk ( p) ek ( p)

o where:
ek ( p )  yd ,k ( p )  yk ( p )

o Calculate the weight corrections:w

jk ( p )  y j ( p )  k ( p )

o Update the weights at the output neurons:

w jk ( p  1) w jk ( p )  w jk ( p )
o (b) Calculate the error gradient for the neurons in the hidden layer:
l
 j ( p )  y j ( p ) [1  y j ( p )]   k ( p ) w jk ( p )
k 1

o Calculate the weight corrections:

wij ( p )  xi ( p )  j ( p )

o Update the weights at the hidden neurons:

wij ( p  1) wij ( p )  wij ( p )

• Step 4: Iteration

o Increase iteration p by one, go back to Step 2 and repeat the process until
the selected error criterion is satisfied.

• As an example, we may consider the three-layer back-propagation network.

Suppose that the network is required to perform logical operation Exclusive-
OR. Recall that a single-layer perceptron could not do this operation. Now
we will apply the three-layer net.
1


3
1
w13
x1 1 3 w35 
w23 5

5 y5
w24
x2 2 4 w45
w24
Input  Output
4
layer layer
1
Hidden layer
• The effect of the threshold applied to a neuron in the hidden or output layer is
represented by its weight, , connected to a fixed input equal to -1.

• The initial weights and threshold levels are set randomly as follows:

• We consider a training set where inputs and are equal to 1 and desired
output . The actual outputs of neurons 3 and 4 in the hidden layer are
calculated as

y3 sigmoid ( x1w13  x2 w23  3 ) 1 / 1  e  (10.510.4  10.8) 0.5250
y4 sigmoid ( x1w14  x2 w24   ) 1 /  1  e
4
 (10.9 11.0 10.1)
 0.8808
• Now the actual output of neuron 5 in the output layer is determined as:

 
y5 sigmoid ( y3w35  y4 w45  5 ) 1 / 1  e  (  0.52501.20.88081.1 10.3) 0.5097

• Thus, the following error is obtained:

e  yd ,5  y5 0  0.5097  0.5097
• The next step is weight training. To update the weights and threshold levels in
our network, we propagate the error, , from the output layer backward to the
input layer.

• First, we calculate the error gradient for neuron 5 in the output layer:
 5  y5 (1  y5 ) e 0.5097 (1  0.5097) ( 0.5097)  0.1274

• Then we determine the weight corrections assuming that the learning rate
parameter, , is equal to 0.1:
w35  y3  5 0.1 0.5250 ( 0.1274)  0.0067
w45  y4  5 0.1 0.8808 ( 0.1274)  0.0112
5  ( 1)  5 0.1 ( 1) ( 0.1274)  0.0127
• Next we calculate the error gradients for neurons 3 and 4 in the hidden layer:

 3  y3 (1  y3 )  5 w35 0.5250 (1  0.5250) (  0.1274) (  1.2) 0.0381

 4  y4 (1  y4 )  5 w45 0.8808 (1  0.8808) (  0.127 4) 1.1  0.0147

• We then determine the weight corrections:

w13  x1  3 0.1 1 0.0381 0.0038

w23  x2  3 0.1 1 0.0381 0.0038
3  ( 1)  3 0.1 ( 1) 0.0381  0.0038
w14  x1  4 0.1 1 ( 0.0147)  0.0015
w24  x2  4 0.1 1 ( 0.0147)  0.0015
 4  ( 1)  4 0.1 ( 1) ( 0.0147) 0.0015
• At last, we update all weights and threshold:

w13 = w13 + D w13 = 0 . 5 + 0 . 0038 = 0 . 5038

w14 = w14 + D w14 = 0 . 9 - 0 . 0015 = 0 . 8985
w 23 = w 23 + D w 23 = 0 . 4 + 0 . 0038 = 0 . 4038
w 24 = w 24 + D w 24 = 1 . 0 - 0 . 0015 = 0 . 9985
w 35 = w 35 + D w 35 = - 1 . 2 - 0 . 0067 = - 1 . 2067
w 45 = w 45 + D w 45 = 1 . 1 - 0 . 0112 = 1 . 0888
q 3 = q 3 + D q 3 = 0 . 8 - 0 . 0038 = 0 . 7962
q 4 = q 4 + D q 4 = - 0 . 1 + 0 . 0015 = - 0 . 0985

q 5 = q 5 + D q 5 = 0 . 3 + 0 . 0127 = 0 . 3127

• The training process is repeated until the sum of squared errors is less than 0.001.
Learning curve for operation Exclusive-OR
Sum-Squared Network Error for 224 Epochs
1
10

100
Sum-Squared Error

10-1

10-2

10-3

10-4
0 50 100 150 200
Epoch
• Final results of three-layer network learning

Inputs Desired Actual Error Sum of

output output squared
x1 x2 yd y5 e errors
1 1 0 Y
0.0155  0.0155 0.0010
e
0 1 1 0.9849 0.0151
1 0 1 0.9849 0.0151
0 0 0 0.0175  0.0175

Artificial Neural Networks - MLP
No ratings yet
Artificial Neural Networks - MLP
52 pages
Module 2 Notes - Full
No ratings yet
Module 2 Notes - Full
54 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Unit 3
100% (1)
Unit 3
11 pages
Back Propagation Algorithm Explained
No ratings yet
Back Propagation Algorithm Explained
12 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
AI17-Neural Networks
No ratings yet
AI17-Neural Networks
34 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
DWDM Unit4-2
No ratings yet
DWDM Unit4-2
4 pages
Multilayer Perceptron Neural Network
No ratings yet
Multilayer Perceptron Neural Network
17 pages
Neural Networks for Tech Enthusiasts
No ratings yet
Neural Networks for Tech Enthusiasts
15 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
Lecture 17-Classification by Backpropagation-M
No ratings yet
Lecture 17-Classification by Backpropagation-M
25 pages
Unit 4 Neural Networks
No ratings yet
Unit 4 Neural Networks
76 pages
Deep Learning 10 Hours: - Artificial Neural Networks (ANN) : Architecture
No ratings yet
Deep Learning 10 Hours: - Artificial Neural Networks (ANN) : Architecture
24 pages
ML Unit-Ii
No ratings yet
ML Unit-Ii
34 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
Neural
No ratings yet
Neural
53 pages
Lecture 5 ANN NLP
No ratings yet
Lecture 5 ANN NLP
85 pages
Multi Layer Perceptron 1
No ratings yet
Multi Layer Perceptron 1
54 pages
Neural Network
100% (1)
Neural Network
54 pages
Neural Network Learning Guide
No ratings yet
Neural Network Learning Guide
43 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
48 pages
Neuron 7 AI: Linear Threshold Units
No ratings yet
Neuron 7 AI: Linear Threshold Units
18 pages
Lect8 DNN
No ratings yet
Lect8 DNN
33 pages
Neural Network Essentials
No ratings yet
Neural Network Essentials
34 pages
Back Propagation
100% (1)
Back Propagation
27 pages
3ML.05.NeuralNetworks DeepLearning
No ratings yet
3ML.05.NeuralNetworks DeepLearning
67 pages
ML Chapter 2
No ratings yet
ML Chapter 2
17 pages
NN Introduction MES
No ratings yet
NN Introduction MES
39 pages
ML Unit 2
No ratings yet
ML Unit 2
24 pages
Multi-Layer Perceptron & Backpropagation
No ratings yet
Multi-Layer Perceptron & Backpropagation
88 pages
Multilayer Feedforward Neural Networks
No ratings yet
Multilayer Feedforward Neural Networks
7 pages
Unit2ml 230101150634 5590aaef
No ratings yet
Unit2ml 230101150634 5590aaef
202 pages
Classification BP Regression KNN Other Classifiers - Final
No ratings yet
Classification BP Regression KNN Other Classifiers - Final
116 pages
Neural Network: Prof. Subodh Kumar Mohanty
No ratings yet
Neural Network: Prof. Subodh Kumar Mohanty
37 pages
Understanding Computational Units in ANNs
No ratings yet
Understanding Computational Units in ANNs
24 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
14 pages
Perceptron MLP
No ratings yet
Perceptron MLP
51 pages
Lecture 2
No ratings yet
Lecture 2
52 pages
19 - Introduction To Neural Networks
No ratings yet
19 - Introduction To Neural Networks
7 pages
Chapter 2 - Artificial Neural Networks
No ratings yet
Chapter 2 - Artificial Neural Networks
19 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
13 pages
Neural Networks: Single & Multi-Layer Overview
No ratings yet
Neural Networks: Single & Multi-Layer Overview
35 pages
Chap11 Neural Nets
No ratings yet
Chap11 Neural Nets
38 pages
Unit-1 NN
No ratings yet
Unit-1 NN
12 pages
Lecture
No ratings yet
Lecture
59 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
7 pages
Pr3 ANN WriteUp
No ratings yet
Pr3 ANN WriteUp
8 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
31 pages
ANN Research
No ratings yet
ANN Research
18 pages
Neural Network Basics Explained
No ratings yet
Neural Network Basics Explained
10 pages
4 Multilayer Perceptrons and Radial Basis Functions
No ratings yet
4 Multilayer Perceptrons and Radial Basis Functions
6 pages
Soft Computing Unit 2
No ratings yet
Soft Computing Unit 2
22 pages
1 DL Introduction
No ratings yet
1 DL Introduction
51 pages
Chau Et Al 2023 Simplifying Sentiment Analysis On Social Media A Step by Step Approach
No ratings yet
Chau Et Al 2023 Simplifying Sentiment Analysis On Social Media A Step by Step Approach
15 pages
Certificate Project PDF
No ratings yet
Certificate Project PDF
17 pages
An Introduction To Deep Learning Part 2
100% (1)
An Introduction To Deep Learning Part 2
218 pages
SMS Spam Detection with NLP
No ratings yet
SMS Spam Detection with NLP
21 pages
AMR-based CNN Model
No ratings yet
AMR-based CNN Model
31 pages
Artificial Intelligence PE Ch1 - 2025
No ratings yet
Artificial Intelligence PE Ch1 - 2025
37 pages
Unit 5 Big Data
No ratings yet
Unit 5 Big Data
14 pages
Learning With AI - Joan Monahan Watson
No ratings yet
Learning With AI - Joan Monahan Watson
232 pages
Introduction To Deep Learning 1st Edition Eugene Charniak Instant Download
100% (1)
Introduction To Deep Learning 1st Edition Eugene Charniak Instant Download
55 pages
4.1 RNN and Optimization
No ratings yet
4.1 RNN and Optimization
28 pages
Witgenstein's Influence On Artificial Intelligence
No ratings yet
Witgenstein's Influence On Artificial Intelligence
6 pages
You Are An Expert in AI Agents and Reasoning-Drive
No ratings yet
You Are An Expert in AI Agents and Reasoning-Drive
31 pages
Scalable Pilot Assignment Scheme For Cell-Free Large-Scale Distributed MIMO With Massive Access
No ratings yet
Scalable Pilot Assignment Scheme For Cell-Free Large-Scale Distributed MIMO With Massive Access
6 pages
Depression Detection in Social Media A Comprehensive Review of Machine Learning and Deep Learning Techniques
No ratings yet
Depression Detection in Social Media A Comprehensive Review of Machine Learning and Deep Learning Techniques
30 pages
Utilising Artificial Intelligence To Predict Membrane Behaviour in Water Purification and Desalination
No ratings yet
Utilising Artificial Intelligence To Predict Membrane Behaviour in Water Purification and Desalination
24 pages
Enhancing Software Development Efficiency Through AI-Powered Code Generation
No ratings yet
Enhancing Software Development Efficiency Through AI-Powered Code Generation
12 pages
Course File Deep Learning
No ratings yet
Course File Deep Learning
31 pages
PowerPoint Presentation-3
No ratings yet
PowerPoint Presentation-3
28 pages
Brain-Inspired Artificial Intelligence: Electronics and Telecommunications Trends
No ratings yet
Brain-Inspired Artificial Intelligence: Electronics and Telecommunications Trends
13 pages
Topological Descripotors
No ratings yet
Topological Descripotors
10 pages
1 s2.0 S095741742303083X Main
No ratings yet
1 s2.0 S095741742303083X Main
16 pages
Revised Notes For AI Class 10th
No ratings yet
Revised Notes For AI Class 10th
23 pages
A Novel Transfer Learning Framework For Multimodal Skin Lesion Analysis
No ratings yet
A Novel Transfer Learning Framework For Multimodal Skin Lesion Analysis
17 pages
Summarizing Electricity Usage With A Neural Network
No ratings yet
Summarizing Electricity Usage With A Neural Network
102 pages
Assignment 8 2024 Updated
No ratings yet
Assignment 8 2024 Updated
6 pages
Variational Methods for Machine Learning with Applications to Deep Networks Lucas Pinheiro Cinelli Matheus Araújo Marins Eduardo Antônio Barros Da Silva Sérgio Lima Netto instant download full chapters
No ratings yet
Variational Methods for Machine Learning with Applications to Deep Networks Lucas Pinheiro Cinelli Matheus Araújo Marins Eduardo Antônio Barros Da Silva Sérgio Lima Netto instant download full chapters
120 pages
1 s2.0 S2590123025008291 Main
No ratings yet
1 s2.0 S2590123025008291 Main
17 pages
Discover Computing Paper Students 2023 24
No ratings yet
Discover Computing Paper Students 2023 24
24 pages
Data Driven Reservoir Modeling 1st Edition Shahab D. Mohaghegh Instant Download
100% (1)
Data Driven Reservoir Modeling 1st Edition Shahab D. Mohaghegh Instant Download
157 pages

Topic 7

Uploaded by

Topic 7

Uploaded by

STS5422

• The input signals are propagated in a forward direction on a layer-by-layer

o In a network processing tabular data, the input layer might represent

o In an image classification task, early hidden layers might detect edges,

o The transformation in hidden layers is controlled by activation functions

• Weights and Biases:

• Training multilayer neural networks can involve a number of different

• Recall the simple neuron-like unit:

• This gives a feed-forward neural

• Typically, units are grouped

• In the simplest case, all input units are

• Note: the inputs and outputs for a layer are

• Recall from multiway logistic regression:

• The output units are a function of the input

• XOR is a fundamental concept in digital logic and serves as a classic example

 Input Layer: 2 neurons

 Hidden Layer: 2 neurons

 Output Layer: 1 neuron

• A training set of input patterns is presented to the network.

• In a back-propagation neural network, the learning algorithm has two phases.

o If this pattern is different from the desired output, an error is calculated

o Activate the back-propagation neural network by applying inputs and

o a) Calculate the actual outputs of the neurons in the hidden layer:

o where is the number of inputs of neuron k in the output layer.

o Update the weights in the back-propagation network propagating

o Calculate the weight corrections:w

o Update the weights at the output neurons:

o Calculate the weight corrections:

o Update the weights at the hidden neurons:

wij ( p  1) wij ( p )  wij ( p )

• As an example, we may consider the three-layer back-propagation network.

• Thus, the following error is obtained:

 3  y3 (1  y3 )  5 w35 0.5250 (1  0.5250) (  0.1274) (  1.2) 0.0381

• We then determine the weight corrections:

w13  x1  3 0.1 1 0.0381 0.0038

w13 = w13 + D w13 = 0 . 5 + 0 . 0038 = 0 . 5038

Inputs Desired Actual Error Sum of

You might also like