DeepLearning L1 Intro

Deep learning introduces neural networks that can learn representations of data directly from large datasets. This overcomes limitations of hand-engineered features. Recent progress is due to large datasets, powerful GPUs, and improved techniques like backpropagation for training networks. The basic building block of neural networks is the perceptron, which performs a weighted sum of its inputs and applies an activation function. Networks are trained by minimizing a loss function using gradient descent and backpropagation to update weights. Techniques like dropout and early stopping help prevent overfitting during training.

Uploaded by

lafdali

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views

DeepLearning L1 Intro

Uploaded by

lafdali

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 92

Deep Learning: Introduction

Pr. Tarik Fissaa

DATA – INE2

A.U : 2022/2023
What is Deep Learning ?
Why Deep Learning and Why Now ?
Why Deep Learning ?
Hand engineered features are time consuming, brittle, and not scalable in practice.

Can we learn the underlying features directly from data?

Why Now?

Neural networks date back decades, so why the resurgence?

1. Big Data 2. Hardware 3. Software

. Larger Datasets . Graphics Processing . Improved Techniques
. Easier Collection Units (GPUs) . New Models
& Storage . Massively Parallelizable . Toolboxes
The Perceptron
The structural building block for deep learning
The perceptron: Forward Propagation
The perceptron: Forward Propagation
The perceptron: Forward Propagation
The perceptron: Forward Propagation
Common Activation Functions
Importance of Activation Functions
Importance of Activation Functions
The Perceptron: Example
The Perceptron: Example
The Perceptron: Example
The Perceptron: Example
Building neural networks with Perceptrons
The Perceptron: simplified
The Perceptron: simplified
Multi Output Perceptron
Because all inputs are densely connected to all outputs, these layers are called Dense layers
Multi Output Perceptron
Because all inputs are densely connected to all outputs, these layers are called Dense layers
Single Layer Neural Network
Single Layer Neural Network
Single Layer Neural Network
Deep neural Network
Deep neural Network
Applying Neural Networks
Exemple

Will I pass this class?

let’s start with a simple two feature model:

𝑥1 = Number of lectures you attend.

𝑥2 = Hours spent on the final project

Exemple problem: Will I pass this class?

𝑥2 = Hours
spent on the
final project

𝑥1 = Number of lectures you attend

Exemple problem: Will I pass this class?

𝑥2 = Hours
spent on the
final project

𝑥1 = Number of lectures you attend

Exemple problem: Will I pass this class?
Exemple problem: Will I pass this class?
Quantifying Loss
The loss of our network measures the cost incurred from incorrect predictions
Empirical Loss
The empirical loss measures the total loss over our entire dataset
Binary Cross Entropy Loss
Cross entropy loss can be used with models that ouput a probability between o and 1
Mean Squared Error Loss MSE
Mean squared error loss can be used with regression models that ouput continuos
real numbers
Training Neural Networks
Loss Optimization
We want to find the network weights that achieve the lowest loss
Loss Optimization
We want to find the network weights that achieve the lowest loss
Loss Optimization
Loss Optimization
Loss Optimization
Loss Optimization
Loss Optimization
Algorithme du gradient (Gradient descent)
Computing gradients: Backpropagation

How does a small change in one weight (ex. 𝑤2 ) affect the final loss 𝐽 𝑊 ?
Computing gradients: Backpropagation
Computing gradients: Backpropagation
Computing gradients: Backpropagation
Computing gradients: Backpropagation

Repeat this for every weight in the network using gradients from later layers
Computing gradients: Backpropagation
Neural Networks in Practice:
Optimization
Training Neural Networks is difficult

« Visualizing the loss

landscape ». Hao Li, Dec 2017
Loss functions can be difficult to optimize
Loss functions can be difficult to optimize
Setting the learning Rate
Setting the learning Rate
Setting the learning Rate
Comment gérer cela?

Idea 1:
Try lots of different learning rates and see what works « just right »
Comment gérer cela?

Idée 1:
essayez de nombreux taux d'apprentissage différents et voyez ce qui fonctionne « juste »

Idée 2:
Do something smarter!
Design and adaptive learning rate that adapts to the landscape
Adaptive Learning Rates

• Learning rates are no longer fixed

• Can be made larger or smaller depending on:
• How large gradient is
• How fast learning is happening
• Size of particular weights
• Etc...
Gradient Descent Algorithms
Neural Networks in Practice:
Mini-batches
Mini-batches while training

More accurate estimation of gradient

smoother convergence

Allows for larger learning rates

Mini-batches while training
Estimation plus précise du gradient
Convergence plus fluide

Permet des taux d'apprentissage plus élevés

Mini-batches lead to faster training

Can parallelize computation + achieve significant speed increase on GPU’s
Neural Networks in Practice:
Overfitting
Regularization

what is it?
Technique that constrains our optimization problem to discourage complex models
Regularization

C’est quoi?
Technique qui contraint notre problème d'optimisation à décourager les modèles complexes

Why?
Improve generalisation of our model on unseen data
Regularization 1: Dropout
Regularization 1: Dropout
Regularization 1: Dropout
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Résumé: les fondations de base

DL Unit 1
No ratings yet
DL Unit 1
200 pages
6S191 MIT DeepLearning L1
No ratings yet
6S191 MIT DeepLearning L1
101 pages
ANN-CNN-RNN
No ratings yet
ANN-CNN-RNN
26 pages
6S191 MIT DeepLearning L1
No ratings yet
6S191 MIT DeepLearning L1
108 pages
Deep Learning Tutorial: Reference: Hung-Yi Lee
100% (1)
Deep Learning Tutorial: Reference: Hung-Yi Lee
179 pages
2. Deep Neural Network
No ratings yet
2. Deep Neural Network
60 pages
03-Lecture Notes-Mid
No ratings yet
03-Lecture Notes-Mid
23 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
Components-Algorithms/: The Basic Architecture of Neural Networks: Single Computational Layer
No ratings yet
Components-Algorithms/: The Basic Architecture of Neural Networks: Single Computational Layer
65 pages
2 Deep Neural Network_241120_095158
No ratings yet
2 Deep Neural Network_241120_095158
47 pages
Lect 12 -Deep Feed Forward NN- Review
No ratings yet
Lect 12 -Deep Feed Forward NN- Review
93 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
26 pages
6S191_MIT_DeepLearning_L1
No ratings yet
6S191_MIT_DeepLearning_L1
108 pages
Neural network
No ratings yet
Neural network
7 pages
4.0 The Complete Guide To Artificial Neural Networks
No ratings yet
4.0 The Complete Guide To Artificial Neural Networks
23 pages
8. Deep learning
No ratings yet
8. Deep learning
95 pages
Deep Learning Tutorial Complete (v3)
No ratings yet
Deep Learning Tutorial Complete (v3)
109 pages
ML.8-Neural Networks - Deep Learning (Week 12,13)
No ratings yet
ML.8-Neural Networks - Deep Learning (Week 12,13)
80 pages
Lec 8 Training NN
No ratings yet
Lec 8 Training NN
71 pages
Lec 8 Training NN (1)
No ratings yet
Lec 8 Training NN (1)
71 pages
Notes 7sem Pec Csm701
No ratings yet
Notes 7sem Pec Csm701
23 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
195 pages
Tensorflow
No ratings yet
Tensorflow
25 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
168 pages
Unit-4 ML Notes Part-2
No ratings yet
Unit-4 ML Notes Part-2
21 pages
Deep Learning (All in One)
No ratings yet
Deep Learning (All in One)
23 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
Deep Learning UNIT-II Part1
No ratings yet
Deep Learning UNIT-II Part1
48 pages
Deep Learning_Part II-1
No ratings yet
Deep Learning_Part II-1
23 pages
Fixing Neural Network Course 2 1659759284
No ratings yet
Fixing Neural Network Course 2 1659759284
30 pages
Unit 3
No ratings yet
Unit 3
8 pages
Artificial Intelligence Interview Questions
From Everand
Artificial Intelligence Interview Questions
Tech Interviews
5/5 (2)
AI Chapter 4
No ratings yet
AI Chapter 4
63 pages
6S191 MIT DeepLearning L1
No ratings yet
6S191 MIT DeepLearning L1
104 pages
Deep Learning - Intro, Methods & Applications
100% (1)
Deep Learning - Intro, Methods & Applications
37 pages
ANN-TP (1)
No ratings yet
ANN-TP (1)
40 pages
Main
No ratings yet
Main
183 pages
Neural Networks Bias
No ratings yet
Neural Networks Bias
7 pages
Curriculum: Tuesday, February 15, 2022 3:30 PM
No ratings yet
Curriculum: Tuesday, February 15, 2022 3:30 PM
358 pages
file
No ratings yet
file
408 pages
1703929933487-NLP Language
No ratings yet
1703929933487-NLP Language
106 pages
Inbound 8392301798635648784
No ratings yet
Inbound 8392301798635648784
43 pages
Curriculum: Tuesday, February 15, 2022 3:30 PM
No ratings yet
Curriculum: Tuesday, February 15, 2022 3:30 PM
408 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
a imprimer 4
No ratings yet
a imprimer 4
4 pages
DL Intro
No ratings yet
DL Intro
64 pages
Session 2 ANN 2024
No ratings yet
Session 2 ANN 2024
29 pages
Unit 1 (1)
No ratings yet
Unit 1 (1)
72 pages
DEEP LEARNING
No ratings yet
DEEP LEARNING
38 pages
IS23A Chuong 7 Hocsau-Deep Learning v1
No ratings yet
IS23A Chuong 7 Hocsau-Deep Learning v1
44 pages
Mod 2.1,2.2
No ratings yet
Mod 2.1,2.2
24 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
NISS Deep Learning Tutorial
No ratings yet
NISS Deep Learning Tutorial
58 pages
Optimization of Deep Networks
No ratings yet
Optimization of Deep Networks
84 pages
Supervised Deep Learning
No ratings yet
Supervised Deep Learning
28 pages
1 Slides ANN
No ratings yet
1 Slides ANN
90 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
Smarter Decisions – The Intersection of Internet of Things and Decision Science
From Everand
Smarter Decisions – The Intersection of Internet of Things and Decision Science
Jojo Moolayil
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Performance Metrics
No ratings yet
Performance Metrics
402 pages
DeepLearning L2 RNN
No ratings yet
DeepLearning L2 RNN
81 pages
Sup Learning 1
No ratings yet
Sup Learning 1
150 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
274 pages
Clustering
No ratings yet
Clustering
210 pages
TYBA SEM VI Practicals in Cognitive Processes and Psychological Testing English Version
100% (1)
TYBA SEM VI Practicals in Cognitive Processes and Psychological Testing English Version
141 pages
Lab Manual FPA 580 PDF
No ratings yet
Lab Manual FPA 580 PDF
34 pages
Probability Distributions: Lecture #5
No ratings yet
Probability Distributions: Lecture #5
50 pages
The Latin Square Design
No ratings yet
The Latin Square Design
16 pages
Aguilar, Cherry Mhay M. - MMW
No ratings yet
Aguilar, Cherry Mhay M. - MMW
5 pages
Analytical Data - Interpretation and Treatment
No ratings yet
Analytical Data - Interpretation and Treatment
31 pages
Data Analysis With Python - FreeCodeCamp
No ratings yet
Data Analysis With Python - FreeCodeCamp
28 pages
Internal Control and Risk Management in Oil and Gas Companies in Nigeria
No ratings yet
Internal Control and Risk Management in Oil and Gas Companies in Nigeria
11 pages
Assessment of Total Quality Management Implementation in Indian Service Industries
No ratings yet
Assessment of Total Quality Management Implementation in Indian Service Industries
23 pages
The Effect of Therapeutic Back Massage On The Quality of Sleep Among Critically Ill Elderly Admitted in Intensive Care Units
No ratings yet
The Effect of Therapeutic Back Massage On The Quality of Sleep Among Critically Ill Elderly Admitted in Intensive Care Units
4 pages
2019 - 1 - Lagging Inference Networks and Posterior Collapse in Variational Autoencoders
No ratings yet
2019 - 1 - Lagging Inference Networks and Posterior Collapse in Variational Autoencoders
15 pages
Where can buy (Ebook) Applied Spatial Statistics and Econometrics by Katarzyna Kopczewska ISBN 9780367470760, 9780367470777, 9781003033219, 0367470764, 0367470772, 1003033210 ebook with cheap price
100% (4)
Where can buy (Ebook) Applied Spatial Statistics and Econometrics by Katarzyna Kopczewska ISBN 9780367470760, 9780367470777, 9781003033219, 0367470764, 0367470772, 1003033210 ebook with cheap price
77 pages
B. Tech CSE Syllabus
No ratings yet
B. Tech CSE Syllabus
119 pages
Probabilistic Models: AA228/CS238 Exercises
No ratings yet
Probabilistic Models: AA228/CS238 Exercises
9 pages
Bozhan Tif ch04
No ratings yet
Bozhan Tif ch04
12 pages
Aeb SM CH17 1 PDF
No ratings yet
Aeb SM CH17 1 PDF
28 pages
Group 7 - Hypothesis Testing - 1
No ratings yet
Group 7 - Hypothesis Testing - 1
25 pages
Inferential Statistics
No ratings yet
Inferential Statistics
33 pages
1 - Business Statistics
No ratings yet
1 - Business Statistics
8 pages
Basu Et Al-2011-Genetic Epidemiologypaper
No ratings yet
Basu Et Al-2011-Genetic Epidemiologypaper
14 pages
Download Full Petroleum Reservoir Modeling and Simulation. Geology, Geostatistics, and Performance Reduction Sanjay Srinivasan PDF All Chapters
100% (3)
Download Full Petroleum Reservoir Modeling and Simulation. Geology, Geostatistics, and Performance Reduction Sanjay Srinivasan PDF All Chapters
41 pages
Notino.28 2016
No ratings yet
Notino.28 2016
24 pages
Crystal Ball Report - Full
No ratings yet
Crystal Ball Report - Full
7 pages
Chapter 10-Inference About Means and Proportions With Two Populations
No ratings yet
Chapter 10-Inference About Means and Proportions With Two Populations
69 pages
Leadership Styles and Employee Motivation in Qatar Organizations PDF
No ratings yet
Leadership Styles and Employee Motivation in Qatar Organizations PDF
148 pages
tutorialKS PDF
No ratings yet
tutorialKS PDF
7 pages
VGAM Ajah
No ratings yet
VGAM Ajah
885 pages
Statistics 1 Year Paper Pattern
No ratings yet
Statistics 1 Year Paper Pattern
7 pages
Chapter 3 Lesson 2 Finding The Mean and Variance of Sampling Distribution of Means
No ratings yet
Chapter 3 Lesson 2 Finding The Mean and Variance of Sampling Distribution of Means
4 pages
Unit 2 HRM
No ratings yet
Unit 2 HRM
68 pages