Activation Function

The document discusses the key components of a neural network including the input, hidden, and output layers. It explains that activation functions introduce non-linearity which allows neural networks to learn complex patterns. Common activation functions like sigmoid, tanh, and ReLU are described along with their mathematical equations and uses.

Uploaded by

Bipin Bhadra

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views

Activation Function

Uploaded by

Bipin Bhadra

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

ACTIVATION FUNCTION IN NEURAL NETWORK

Elements of a Neural Network

Input Layer: This layer accepts input features. It provides information from the outside
world to the network, no computation is performed at this layer, nodes here just pass on the
information(features) to the hidden layer.
Hidden Layer: Nodes of this layer are not exposed to the outer world, they are part of the
abstraction provided by any neural network. The hidden layer performs all sorts of
computation on the features entered through the input layer and transfers the result to the
output layer.
Output Layer: This layer bring up the information learned by the network to the outer
world.
What is an activation function and why use them?
The activation function decides whether a neuron should be activated or not by calculating
the weighted sum and further adding bias to it. The purpose of the activation function is to
introduce non-linearity into the output of a neuron.
Explanation: We know, the neural network has neurons that work in correspondence
with weight, bias, and their respective activation function. In a neural network, we would
update the weights and biases of the neurons on the basis of the error at the output. This
process is known as back-propagation. Activation functions make the back-propagation
possible since the gradients are supplied along with the error to update the weights and
biases.
Why do we need Non-linear activation function?
A neural network without an activation function is essentially just a linear regression
model. The activation function does the non-linear transformation to the input making it
capable to learn and perform more complex tasks.
Mathematical proof
Suppose we have a Neural net like this :-

Elements of the diagram are as follows:

Hidden layer i.e. layer 1:
z(1) = W(1)X + b(1) a(1)
Here,
• z(1) is the vectorized output of layer 1
• W(1) be the vectorized weights assigned to neurons of hidden layer i.e. w1, w2,
w3 and w4
• X be the vectorized input features i.e. i1 and i2
• b is the vectorized bias assigned to neurons in hidden layer i.e. b1 and b2
• a(1) is the vectorized form of any linear function.
(Note: We are not considering activation function here)

Layer 2 i.e. output layer :-

Note : Input for layer 2 is output from layer 1
z(2) = W(2)a(1) + b(2)
a(2) = z(2)
Calculation at Output layer
z(2) = (W(2) * [W(1)X + b(1)]) + b(2)
z(2) = [W(2) * W(1)] * X + [W(2)*b(1) + b(2)]
Let,
[W(2) * W(1)] = W
[W(2)*b(1) + b(2)] = b
Final output : z(2) = W*X + b
which is again a linear function
This observation results again in a linear function even after applying a hidden layer, hence
we can conclude that, doesn’t matter how many hidden layer we attach in neural net, all
layers will behave same way because the composition of two linear function is a linear
function itself. Neuron can not learn with just a linear function attached to it. A non-linear
activation function will let it learn as per the difference w.r.t error. Hence we need an
activation function.
Variants of Activation Function
Linear Function
• Equation : Linear function has the equation similar to as of a straight line i.e. y
=x
• No matter how many layers we have, if all are linear in nature, the final
activation function of last layer is nothing but just a linear function of the input
of first layer.
• Range : -inf to +inf
• Uses : Linear activation function is used at just one place i.e. output layer.
• Issues : If we will differentiate linear function to bring non-linearity, result will
no more depend on input “x” and function will become constant, it won’t
introduce any ground-breaking behavior to our algorithm.
For example : Calculation of price of a house is a regression problem. House price may
have any big/small value, so we can apply linear activation at output layer. Even in this
case neural net must have any non-linear function at hidden layers.
Sigmoid Function

•It is a function which is plotted as ‘S’ shaped graph.

•Equation : A = 1/(1 + e-x)
•Nature : Non-linear. Notice that X values lies between -2 to 2, Y values are
very steep. This means, small changes in x would also bring about large changes
in the value of Y.
• Value Range : 0 to 1
• Uses : Usually used in output layer of a binary classification, where result is
either 0 or 1, as value for sigmoid function lies between 0 and 1 only so, result
can be predicted easily to be 1 if value is greater than 0.5 and 0 otherwise.
Tanh Function
• The activation that works almost always better than sigmoid function is Tanh
function also known as Tangent Hyperbolic function. It’s actually
mathematically shifted version of the sigmoid function. Both are similar and can
be derived from each other.
• Equation :-

•
Value Range :- -1 to +1
•
Nature :- non-linear
Uses :- Usually used in hidden layers of a neural network as it’s values lies
•
between -1 to 1 hence the mean for the hidden layer comes out be 0 or very
close to it, hence helps in centering the data by bringing mean close to 0. This
makes learning for the next layer much easier.
RELU Function

• It Stands for Rectified linear unit. It is the most widely used activation function.
Chiefly implemented in hidden layers of Neural network.
• Equation :- A(x) = max(0,x). It gives an output x if x is positive and 0
otherwise.
• Value Range :- [0, inf)
• Nature :- non-linear, which means we can easily backpropagate the errors and
have multiple layers of neurons being activated by the ReLU function.
• Uses :- ReLu is less computationally expensive than tanh and sigmoid because it
involves simpler mathematical operations. At a time only a few neurons are
activated making the network sparse making it efficient and easy for
computation.
In simple words, RELU learns much faster than sigmoid and Tanh function.

Activation Functions in Neural Networks - 241102 - 224129
No ratings yet
Activation Functions in Neural Networks - 241102 - 224129
7 pages
activatn fn 2
No ratings yet
activatn fn 2
10 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
Ad3451 Ml Unit 4 Notes
No ratings yet
Ad3451 Ml Unit 4 Notes
34 pages
activation fn
No ratings yet
activation fn
15 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
11 pages
Presentation for deep learning
No ratings yet
Presentation for deep learning
15 pages
NN unit_1
No ratings yet
NN unit_1
27 pages
Activation
No ratings yet
Activation
7 pages
UNIT II DNN
No ratings yet
UNIT II DNN
24 pages
Perceptron in Machine Learning
No ratings yet
Perceptron in Machine Learning
11 pages
4 - Activation Functions in Neural Networks
No ratings yet
4 - Activation Functions in Neural Networks
12 pages
Deep Learning: International Islamic University of Chittagong
No ratings yet
Deep Learning: International Islamic University of Chittagong
31 pages
UNIT-III Activation-function
No ratings yet
UNIT-III Activation-function
6 pages
Experiment No. 1 SL-II (ANN)
No ratings yet
Experiment No. 1 SL-II (ANN)
3 pages
Aditya Jain NN Assignment
No ratings yet
Aditya Jain NN Assignment
13 pages
0905 Cs 161183 Vishal
No ratings yet
0905 Cs 161183 Vishal
38 pages
Forward_and_Backward_Propagation_Deep_Learning_1703697260
No ratings yet
Forward_and_Backward_Propagation_Deep_Learning_1703697260
9 pages
UNIT V NEURAL NETWORKS
No ratings yet
UNIT V NEURAL NETWORKS
35 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
29 pages
Activation Function
No ratings yet
Activation Function
31 pages
Unit 2 Deep Learning and Neural Networks
No ratings yet
Unit 2 Deep Learning and Neural Networks
38 pages
Notes On Introduction To Deep Learning
No ratings yet
Notes On Introduction To Deep Learning
19 pages
Activation Function
No ratings yet
Activation Function
43 pages
Activation Functions
No ratings yet
Activation Functions
9 pages
Unit Iv
No ratings yet
Unit Iv
34 pages
Activation Function
No ratings yet
Activation Function
9 pages
Activation Functions
No ratings yet
Activation Functions
8 pages
Need and Use of Activation Functions in Anndeep Learning
No ratings yet
Need and Use of Activation Functions in Anndeep Learning
7 pages
Functii de Activare1
No ratings yet
Functii de Activare1
89 pages
Unit 4
No ratings yet
Unit 4
19 pages
cst414- Deep learning
No ratings yet
cst414- Deep learning
34 pages
UNIT V
No ratings yet
UNIT V
26 pages
lecture 9-NN- modified
No ratings yet
lecture 9-NN- modified
94 pages
Activation Function To Back Pro
No ratings yet
Activation Function To Back Pro
22 pages
12 Types of Neural Network Activation Functions
No ratings yet
12 Types of Neural Network Activation Functions
38 pages
SC ESE Cae 1
No ratings yet
SC ESE Cae 1
25 pages
Types of Neural Network Activation Functions_ How to Choose_ (1)
No ratings yet
Types of Neural Network Activation Functions_ How to Choose_ (1)
36 pages
Soft Computing Manual.-1
No ratings yet
Soft Computing Manual.-1
45 pages
SoftComp 02
No ratings yet
SoftComp 02
33 pages
Unit 2
No ratings yet
Unit 2
18 pages
Neural_Networks_Activation_Functions__1694135997
No ratings yet
Neural_Networks_Activation_Functions__1694135997
7 pages
Unit 2 - Machine Learning
No ratings yet
Unit 2 - Machine Learning
19 pages
26- netinput activation function forward and back propogation
No ratings yet
26- netinput activation function forward and back propogation
41 pages
465-Lecture 2-4
No ratings yet
465-Lecture 2-4
43 pages
Unit 2 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Machine Learning - WWW - Rgpvnotes.in
18 pages
ANN (SPPU AI&DS Insem Solved Question Paper 2019 Pattern)
No ratings yet
ANN (SPPU AI&DS Insem Solved Question Paper 2019 Pattern)
26 pages
ML_Lec-22
No ratings yet
ML_Lec-22
25 pages
2K21_EE_192 MLP
No ratings yet
2K21_EE_192 MLP
59 pages
Module 5 AIML Notes
No ratings yet
Module 5 AIML Notes
77 pages
Implementation of Activation Layer
No ratings yet
Implementation of Activation Layer
17 pages
Unit 2b
No ratings yet
Unit 2b
11 pages
Activation Functions
No ratings yet
Activation Functions
6 pages
Unit 5 Activation Function
No ratings yet
Unit 5 Activation Function
15 pages
Multilayer_Feedforward_Network- Activation Functions (1)
No ratings yet
Multilayer_Feedforward_Network- Activation Functions (1)
9 pages
Perceptron: Single Layer Neural Network
No ratings yet
Perceptron: Single Layer Neural Network
14 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
Module1 - Upto Loss Function
No ratings yet
Module1 - Upto Loss Function
137 pages
Introduction to Logarithms and Exponentials
From Everand
Introduction to Logarithms and Exponentials
Simone Malacrida
No ratings yet
Techinical Seminar Front Page
No ratings yet
Techinical Seminar Front Page
32 pages
Dm Question Bank
No ratings yet
Dm Question Bank
20 pages
ISC Unit I Topic-4
No ratings yet
ISC Unit I Topic-4
31 pages
SC Question Bank
No ratings yet
SC Question Bank
3 pages
Solar Energy Presentation
No ratings yet
Solar Energy Presentation
22 pages
Java Programming Task List
No ratings yet
Java Programming Task List
5 pages
DM 2nd Semester
No ratings yet
DM 2nd Semester
4 pages
OOP
No ratings yet
OOP
2 pages
Soft Computing Syllabus
No ratings yet
Soft Computing Syllabus
1 page
"E-Commerece Web Application": Master of Computer Applications
No ratings yet
"E-Commerece Web Application": Master of Computer Applications
388 pages
ISC Unit II Topic-5
No ratings yet
ISC Unit II Topic-5
23 pages
ISC Unit II Topic-3
No ratings yet
ISC Unit II Topic-3
19 pages
ISC Unit II Topic-4
No ratings yet
ISC Unit II Topic-4
24 pages
Genetic Algorithms
No ratings yet
Genetic Algorithms
80 pages
Compiler Bipin
No ratings yet
Compiler Bipin
94 pages
SE Notes (Unit-3) (Milu)
No ratings yet
SE Notes (Unit-3) (Milu)
12 pages
Cns Note 1 (Milu)
No ratings yet
Cns Note 1 (Milu)
49 pages
SE Notes (Unit-4) (Milu)
No ratings yet
SE Notes (Unit-4) (Milu)
15 pages
CD Question Bank
No ratings yet
CD Question Bank
3 pages
Book AI Driven Software Development 13 August
No ratings yet
Book AI Driven Software Development 13 August
219 pages
IT Dept. Machine Learning (Stanford University)
No ratings yet
IT Dept. Machine Learning (Stanford University)
9 pages
Course 5: Quantitative Techniques For Decision Making - Ii (Machine Learning Techniques)
No ratings yet
Course 5: Quantitative Techniques For Decision Making - Ii (Machine Learning Techniques)
5 pages
(2020129) On Layer Normalization in The Transformer Architecture
No ratings yet
(2020129) On Layer Normalization in The Transformer Architecture
17 pages
Scientific Machine Learning Through Physics-Informed Neural Networks: Where We Are and What's Next
No ratings yet
Scientific Machine Learning Through Physics-Informed Neural Networks: Where We Are and What's Next
67 pages
Deep Learning With Keras and Tensorflow
No ratings yet
Deep Learning With Keras and Tensorflow
557 pages
Algorithmic Financial Trading With Deep CNN Preprint
No ratings yet
Algorithmic Financial Trading With Deep CNN Preprint
30 pages
Xor Gate: Perceptron Can Not Realize An XOR Gate. We Need More Complex Network or Use Different Transfer Functions
No ratings yet
Xor Gate: Perceptron Can Not Realize An XOR Gate. We Need More Complex Network or Use Different Transfer Functions
12 pages
Phase 1 PPT Digit Recognition
No ratings yet
Phase 1 PPT Digit Recognition
8 pages
Curriculum CVDL Master Program Updated
No ratings yet
Curriculum CVDL Master Program Updated
42 pages
Slides PyConfr Bordeaux Calcagno
No ratings yet
Slides PyConfr Bordeaux Calcagno
46 pages
A Review of Recurrent Neural Networks - LSTM Cells and Network Architectures (Neural Computation) (2019)
No ratings yet
A Review of Recurrent Neural Networks - LSTM Cells and Network Architectures (Neural Computation) (2019)
36 pages
CST414-QP (1)
No ratings yet
CST414-QP (1)
2 pages
Vae - Gan 1
No ratings yet
Vae - Gan 1
136 pages
System For Detecting Deepfake in Videos
No ratings yet
System For Detecting Deepfake in Videos
11 pages
ICAIRES2024 CFP
No ratings yet
ICAIRES2024 CFP
1 page
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
No ratings yet
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
9 pages
Comparison of Tissue Segmentation Performance Between 2D U-Net and 3D U-Net On Brain MR Images
No ratings yet
Comparison of Tissue Segmentation Performance Between 2D U-Net and 3D U-Net On Brain MR Images
4 pages
History of Deep Learning
No ratings yet
History of Deep Learning
23 pages
PMNet A Probability Map Based Scaled Network - 2021 - Computerized Medical Imag
No ratings yet
PMNet A Probability Map Based Scaled Network - 2021 - Computerized Medical Imag
7 pages
Hopfield
No ratings yet
Hopfield
3 pages
Rainfall Prediction Using Machine Learning
No ratings yet
Rainfall Prediction Using Machine Learning
3 pages
K Nearest Neighbour
No ratings yet
K Nearest Neighbour
2 pages
A Survey On Deep Learning For Data-Driven Soft Sensors
No ratings yet
A Survey On Deep Learning For Data-Driven Soft Sensors
14 pages
Iris Recognition With Off-the-Shelf CNN Features: A Deep Learning Perspective
No ratings yet
Iris Recognition With Off-the-Shelf CNN Features: A Deep Learning Perspective
8 pages
Plotting Decision Regions - 1 - Mlxtend
No ratings yet
Plotting Decision Regions - 1 - Mlxtend
5 pages
Recurrent Neural Network (RNN) Are A Type of
No ratings yet
Recurrent Neural Network (RNN) Are A Type of
4 pages
AI Glossary
No ratings yet
AI Glossary
5 pages
CISC 867 Deep Learning: 15. Generative Adversarial Networks
No ratings yet
CISC 867 Deep Learning: 15. Generative Adversarial Networks
71 pages
Chapter II Build A Neural Network Step by Step
No ratings yet
Chapter II Build A Neural Network Step by Step
31 pages