0% found this document useful (0 votes)

15 views7 pages

Slides - Module 1 Lesson 3

Deep Learning for Finance Slides. It discusses Recurrent Neural Networks.

Uploaded by

Githu Robert

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views7 pages

Slides - Module 1 Lesson 3

Deep Learning for Finance Slides. It discusses Recurrent Neural Networks.

Uploaded by

Githu Robert

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Module 1: Lesson 3

Hyperparameter Optimization (HPO)

Outline
▶ Hyperparameter Optimization (HPO): why, what, and how?

▶ Which hyperparameters?

▶ Optimization algorithms

2
Hyperparameter Optimization (HPO)
▶ So far, we have used neural networks in a kind of brute force approach: simply by using trial and error in
our models, we would see which combination of features (i.e., number of hidden layers, learning rate,
optimizer, etc.) perform best.
▶ This is however quite inefficient because it requires spending a lot of time deciphering the best choices
for a model.
▶ Solution: Hyperparameter Optimization (HPO)

3
Hyperparameter Optimization (HPO)
▶ Solution: Hyperparameter Optimization (HPO)

- Hyperparameters refer to parameters that cannot be updated during the training of the model.
- These hyperparameters can be structure-related (e.g., # of hidden layers), or training-related
(e.g., learning rate or minibatch size)
- Final step of model design and first step of training a neural network

▶ Purpose of HPO:
1. Reduce costly menial work by the researcher
2. Improve accuracy and efficiency of training
3. Make the choice of hyper-parameters more convincing and reproducible

▶ How?
- We’ll be using a series of optimization techniques
- These can be classified as search algorithms (e.g., grid search) and trial algorithms (e.g., curve
fitting).
4
Which hyperparameters?
Given that HPO takes considerable computational resources, it is important to understand which
hyperparameters take preferential treatment in the tuning. Here, we review major hyperparameters based on
previous researchers’ experience.

▶ Training-related hyperparameters:

- Learning rate (constant, linear/exponential decay)

- Optimizer (Mini-batch SGD, RMSprop, Adam)
- ...

▶ Model-design hyperparameters:

- Number of hidden layers

- Width (number of neurons) of hidden layers
- Regularization in cost function (ℓ1 vs. ℓ2 norms)
- Dropout (which rate?)
- Activation function (ReLU, Sigmoid, Tanh?)
- ...
5
Optimization algorithms
- Mathematically, HPO consists simply of finding a set of hyperparameters that achieve the minimum loss (or
maximum accuracy) of a network/model.
- Computationally, HPO represents a complex computational problem to be solved by state-of-the-art
algorithms of mainly two types: search and trial.

▶ Search algorithms

- Grid search: straightforward, but computationally costly (remember the curse of dimensionality!)
- Random search: similar to grid with randomized search (less time-consuming)
- Bayesian Optimization: selects next hyperparameter based on previous experience

▶ (Trial) Early-stopping algorithms

- Curve fitting: LPA algorithm (Learn, Predict, Assess)

- Successive Halving (SHA) and Hyperband (HB): random search sampling method + bandit-based
early-stopping policy
- Extensions: Asynchronous Successive Halving (ASHA) and Bayesian Optimization Hyperband
6 (BOHB)
Summary of Lesson 3
In Lesson 3, we have looked at:

▶ What is HPO, why we need it, and the basics of how to perform it
▶ Main hyper-parameters we can tune
▶ Basic review of algorithms for HPO

⇒ References for this lesson:

Yu, Tong, and Hong Zhu. ”Hyperparameter Optimization: A Review of Algorithms and Applications.” arXiv,
2020,https://arxiv.org/abs/2003.05689
. TO-DO NEXT: There is no associated Jupyter Notebook for this lesson, but it is extremely important that

you go over the required readings for this lesson thoroughly.

In the next lesson, we will come back one more time to our stock timing example to see how a regression model
based on a somewhat simple neural network can benefit from hyperparameter tuning using the Keras Tuner.

Model Training: (Anything Done While We Train The Model)
No ratings yet
Model Training: (Anything Done While We Train The Model)
194 pages
Hyperparameter Optimization in ML Models
No ratings yet
Hyperparameter Optimization in ML Models
69 pages
Hyperparameter Optimization of ML Algorithms
No ratings yet
Hyperparameter Optimization of ML Algorithms
69 pages
WIREs Data Min Knowl - 2023 - Bischl - Hyperparameter Optimization Foundations Algorithms Best Practices and Open
No ratings yet
WIREs Data Min Knowl - 2023 - Bischl - Hyperparameter Optimization Foundations Algorithms Best Practices and Open
43 pages
WIREs Data Min Knowl - 2023 - Bischl - Hyperparameter Optimization Foundations Algorithms Best Practices and Open
No ratings yet
WIREs Data Min Knowl - 2023 - Bischl - Hyperparameter Optimization Foundations Algorithms Best Practices and Open
43 pages
Hyperparameter Tuning for ML Experts
No ratings yet
Hyperparameter Tuning for ML Experts
70 pages
Hyperparameter Optimization Survey
No ratings yet
Hyperparameter Optimization Survey
13 pages
Hyper-Parameter Optimization: A Review of Algorithms and Applications
No ratings yet
Hyper-Parameter Optimization: A Review of Algorithms and Applications
56 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Howtotune Hyperp inRL
No ratings yet
Howtotune Hyperp inRL
46 pages
Hyperparameter Tunning Machine Learning Cse
No ratings yet
Hyperparameter Tunning Machine Learning Cse
5 pages
Hyperparameter Optimization For Machine Learning Models Based On Bayesian Optimization
No ratings yet
Hyperparameter Optimization For Machine Learning Models Based On Bayesian Optimization
15 pages
1 s2.0 S1674862X19300047 Main
No ratings yet
1 s2.0 S1674862X19300047 Main
15 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
Hyper Parameters
No ratings yet
Hyper Parameters
7 pages
Automl: A Perspective Where Industry Meets Academy
No ratings yet
Automl: A Perspective Where Industry Meets Academy
154 pages
Hyperparameter Tuning for ML Experts
No ratings yet
Hyperparameter Tuning for ML Experts
22 pages
Adaptive Bayesian Hyperparameter Tuning
No ratings yet
Adaptive Bayesian Hyperparameter Tuning
11 pages
Hyper Parameter
No ratings yet
Hyper Parameter
2 pages
Lec 2
No ratings yet
Lec 2
5 pages
Neural Network Pruning for Faster HPO
No ratings yet
Neural Network Pruning for Faster HPO
17 pages
Bergstra12a PDF
No ratings yet
Bergstra12a PDF
25 pages
Hyperparameter Tuning in Machine Learning
No ratings yet
Hyperparameter Tuning in Machine Learning
11 pages
Hyper
No ratings yet
Hyper
4 pages
11 Intern
No ratings yet
11 Intern
14 pages
Lecture6c HyperparameterOptimization
No ratings yet
Lecture6c HyperparameterOptimization
19 pages
MCL Ind Assign
No ratings yet
MCL Ind Assign
10 pages
Hyper Parameters
No ratings yet
Hyper Parameters
24 pages
ML Individual Assigenment 1
No ratings yet
ML Individual Assigenment 1
11 pages
Wistuba Et Al DSAA 2015
No ratings yet
Wistuba Et Al DSAA 2015
10 pages
ML Chap 5
No ratings yet
ML Chap 5
14 pages
Metalearning For Hyperparameter Optimization
No ratings yet
Metalearning For Hyperparameter Optimization
20 pages
CNN 02 Batch Normalization
No ratings yet
CNN 02 Batch Normalization
19 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
3 pages
Hyperparameter Search Guide
No ratings yet
Hyperparameter Search Guide
6 pages
Optuna Hyperparameter Tuning Guide
No ratings yet
Optuna Hyperparameter Tuning Guide
29 pages
Hyperparameters
No ratings yet
Hyperparameters
2 pages
RO47002 - Lecture 2C - Hyperparameters and Cross-Validation
No ratings yet
RO47002 - Lecture 2C - Hyperparameters and Cross-Validation
10 pages
PHYPER
No ratings yet
PHYPER
3 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
17 pages
Lecture 9 Model Selection
No ratings yet
Lecture 9 Model Selection
15 pages
15-Hyperparameter Tuning - Batch Normalization-14!08!2024
No ratings yet
15-Hyperparameter Tuning - Batch Normalization-14!08!2024
4 pages
Understanding Hyperparameters vs Parameters
No ratings yet
Understanding Hyperparameters vs Parameters
8 pages
Hyper Parameter New
No ratings yet
Hyper Parameter New
4 pages
Hyperparameter Tuning Is The Process of Optimizing The Model
No ratings yet
Hyperparameter Tuning Is The Process of Optimizing The Model
3 pages
Shapley Value in Sequential Optimization
No ratings yet
Shapley Value in Sequential Optimization
64 pages
Optimizing Deep Learning Models From Multi-Objective Perspective Via Bayesian Optimization
No ratings yet
Optimizing Deep Learning Models From Multi-Objective Perspective Via Bayesian Optimization
10 pages
Deep Learning Module 3
No ratings yet
Deep Learning Module 3
15 pages
Hyperparameter Tunability in ML Algorithms
No ratings yet
Hyperparameter Tunability in ML Algorithms
32 pages
Cost Effectictive OPtimization Research Paper
No ratings yet
Cost Effectictive OPtimization Research Paper
28 pages
Optuna
No ratings yet
Optuna
6 pages
Bayesian HPO Interpretability Boost
No ratings yet
Bayesian HPO Interpretability Boost
24 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
4 pages
Hyperparameter Tuning in Machine Learning
No ratings yet
Hyperparameter Tuning in Machine Learning
4 pages
Futureinternet 15 00332 v2
No ratings yet
Futureinternet 15 00332 v2
29 pages
Cours 6
No ratings yet
Cours 6
26 pages
30 Easy Hyperparameter Tuning Questions For Deep Learning: Basic Concepts
No ratings yet
30 Easy Hyperparameter Tuning Questions For Deep Learning: Basic Concepts
6 pages
Slides
No ratings yet
Slides
10 pages
Slides - Module 1 Lesson 2
No ratings yet
Slides - Module 1 Lesson 2
9 pages
Slides - Module 2 Lesson 1
No ratings yet
Slides - Module 2 Lesson 1
16 pages
Slides - Module 1 Lesson 4
No ratings yet
Slides - Module 1 Lesson 4
7 pages
Investment Appraisal: Qs 435 Construction ECONOMICS II
100% (1)
Investment Appraisal: Qs 435 Construction ECONOMICS II
38 pages
Methods of Valuation: Qs 435 Construction ECONOMICS II
No ratings yet
Methods of Valuation: Qs 435 Construction ECONOMICS II
25 pages
Developer'S Budget: Qs 435 Construction ECONOMICS II
100% (1)
Developer'S Budget: Qs 435 Construction ECONOMICS II
19 pages
The Natural Medicine Handbook The Truth About The Most Effective Herbs, Vitamins, and Supplements For Common Conditions Accessible PDF Download
100% (9)
The Natural Medicine Handbook The Truth About The Most Effective Herbs, Vitamins, and Supplements For Common Conditions Accessible PDF Download
15 pages
D 400S 400SC 460SJ 460SJC JLG Interactive Schematic English
No ratings yet
D 400S 400SC 460SJ 460SJC JLG Interactive Schematic English
4 pages
Maitreya: The World Teacher's Return
No ratings yet
Maitreya: The World Teacher's Return
20 pages
Understanding Gas Laws and Kinetic Theory
No ratings yet
Understanding Gas Laws and Kinetic Theory
71 pages
Present Simple Quiz 1
No ratings yet
Present Simple Quiz 1
7 pages
Recipes
No ratings yet
Recipes
238 pages
Risk Assessment for Offshore Systems
No ratings yet
Risk Assessment for Offshore Systems
63 pages
Challenges and Solutions in Agriculture Marketing
No ratings yet
Challenges and Solutions in Agriculture Marketing
3 pages
Varadarajan and Clark 1994
No ratings yet
Varadarajan and Clark 1994
13 pages
ICSE Class 10 English Literature Question Paper 2015
No ratings yet
ICSE Class 10 English Literature Question Paper 2015
7 pages
Foam Concentrate Compatibility Guide
No ratings yet
Foam Concentrate Compatibility Guide
2 pages
A Secret Life Rosemary Hammond PDF
81% (31)
A Secret Life Rosemary Hammond PDF
190 pages
Paiton Baru 660 MW Piping Test Report
No ratings yet
Paiton Baru 660 MW Piping Test Report
24 pages
Carbon Black Uses in Rubber and Tires
No ratings yet
Carbon Black Uses in Rubber and Tires
2 pages
Free Energy WKST KEY
No ratings yet
Free Energy WKST KEY
2 pages
Ray and Wavefront Aberrations in Optics
No ratings yet
Ray and Wavefront Aberrations in Optics
27 pages
Interactive Maintenance
No ratings yet
Interactive Maintenance
10 pages
IC Routing & Optimization Guide
No ratings yet
IC Routing & Optimization Guide
23 pages
Derher 5784 Gimmel Tammuz Supplement
No ratings yet
Derher 5784 Gimmel Tammuz Supplement
28 pages
Energetics - Thermochemistry - IB Chemistree
No ratings yet
Energetics - Thermochemistry - IB Chemistree
5 pages
Vector User Manual
No ratings yet
Vector User Manual
13 pages
Condor T60 T66J 92375
100% (2)
Condor T60 T66J 92375
342 pages
Guidelines On Tree Pruning
No ratings yet
Guidelines On Tree Pruning
61 pages
GL-XX-Mobil-Vactra-Oil-Numbered-Series SDS
No ratings yet
GL-XX-Mobil-Vactra-Oil-Numbered-Series SDS
3 pages
Miraco Ceiling Ducted AC Systems
No ratings yet
Miraco Ceiling Ducted AC Systems
67 pages
Example of Discussion Text
No ratings yet
Example of Discussion Text
8 pages
EGCSE Agriculture 2024 Examiner Report
No ratings yet
EGCSE Agriculture 2024 Examiner Report
30 pages
Mohamed Haj Yousef - DUALITY of TIME - Complex-Time Geometry and Perpetual Creation of Space (The Single Monad Model of The Cosmos Book 2) - IBNALARABI (2018)
100% (1)
Mohamed Haj Yousef - DUALITY of TIME - Complex-Time Geometry and Perpetual Creation of Space (The Single Monad Model of The Cosmos Book 2) - IBNALARABI (2018)
647 pages
Weichert Method
No ratings yet
Weichert Method
44 pages
Risk Management Tools in Pharmaceuticals
No ratings yet
Risk Management Tools in Pharmaceuticals
4 pages

Slides - Module 1 Lesson 3

Uploaded by

Slides - Module 1 Lesson 3

Uploaded by

Module 1: Lesson 3

Hyperparameter Optimization (HPO)

- Learning rate (constant, linear/exponential decay)

- Number of hidden layers

▶ (Trial) Early-stopping algorithms

- Curve fitting: LPA algorithm (Learn, Predict, Assess)

⇒ References for this lesson:

you go over the required readings for this lesson thoroughly.

You might also like