0% found this document useful (0 votes)

14 views36 pages

lecture7-linear-regression

The document provides an overview of linear regression, including its application in predicting prices, such as gold and housing prices. It discusses the concept of cost functions, specifically L1 and L2 cost functions, and methods for optimizing these functions, such as gradient descent. Additionally, it touches on feature scaling and non-linear cases, suggesting the use of feature mapping to handle non-linear relationships.

Uploaded by

i220600

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Download as pptx, pdf, or txt

0% found this document useful (0 votes)

14 views36 pages

lecture7-linear-regression

Uploaded by

i220600

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Download as pptx, pdf, or txt

You are on page 1/ 36

Data Science Boot

Camp
Sibt ul Hussain
http://sites.google.com/SibtulHussain

Linear Regression

Majority of the content is borrowed from multiple online

resources. So
2

Predict Gold Prices Over the next

day….
43000

42000

41000

40000

39000
Series1

38000

37000

36000

35000
0 20 40 60 80 100 120 140 160 180 200
3
500000

Housing Prices400000
300000

Price 200000
(in
1000s of 100000
dollars)
0
500 1000 1500 2000 2500 3000
Size (feet2)

Regression Problem
Predict real-valued
output
4

Training set of
housing prices Size in Price ($) in
feet2 (x) 1000's (y)
2104 460
1416 232
1534 315
852 178
Notation: … …
m = Number of training examples
x’s = “input” variable / features
y’s = “output” variable / “target” variable
5

Training Set Size in Price ($) in

feet2 (x) 1000's (y)
2104 460
1416 232
1534 315
852 178
… …
Hypothesis:

‘s: Parameters

How to choose ‘s ?
6

3 3 3

2 2 2

1 1 1

0 0 0
0 1 2 3 0 1 2 3 0 1 2 3
7

Idea: Choose so
that
is close
to for our
training examples
8

Score Function (Or Hypothesis)

• Score Function or Hypthosis is
used to generate the output given an
input X. In linear regression our
9

Cost Function
• Cost function is used to evaluate our
hypothesis, i.e. how good is our chosen
hypothesis.
▫ For instance, in case of linear regression cost
functions can be:

• Goal of learning thus reduces to searching in

hypthosis space for the best possible hypothesis
(a hypothesis thats optimizes our cost function).
• In other words cost functions specifies the
purpose of learning algorithm.
10

Hypothesis:

Parameters:

Cost Function:

Goal:
11

Possible Cost Functions for LR

• Absolute (or L1) Cost Function:

• Properties:
▫ Penalty for positive and negative deviations the
same
▫ Penalty for large deviations remains same, that
is an error with small value as well as large
value receives same treatment.
▫ Difficult to derivate (non-differentiable at zero).
▫ Convex
12

Possible Cost Functions for LR

• L2 Cost Function:

• Properties:
▫ Penalty for positive and negative deviations
is same
▫ Penalty for large deviations is large
compared to small deviations.
▫ Easy to derivate.
▫ Convex
13

L2 Cost Function
14

How to Optimize Cost Function

• Random Search
• Define a finite interval for values of
• Iterate over the interval
 Evaluate the cost function,
 Cache the results.

• Choose the values of parameters that

give optimum value of cost function.
15

How to Optimize Cost Function

• Derivation Approach:
▫ Compute derivate of J w.r.t each
variable.
▫ Set all the derivatives equal to zero, i.e.

and solve system of linear equations to

achieve optimum values for the
parameters.
16

How to Optimize Cost Function

• Gradient Descent

How ???
17

How to Optimize Cost Function

• Gradient Descent
18
19
20
21
22
23

Using Multiple Input

Features
24
25
26
27
28
29

Remember Debugging Trick

• Always plot your cost-
function in your gradient
descent loop w.r.t. number
of iterations. If gradient
descent is working then
J(θ) should decrease after
every iteration
• If J(θ) value is increasing -
means you probably need
a smaller α. This is
because your are
overshooting, like in right
graph.
30

Feature Scaling:

Always remember feature scaling can make

your convergence faster
31

What about Non-Linear Cases ?

32
33
34

Feature Mapping (A Simple

Trick)
• We will map our features to higher
dimensions using a simple trick.

• For example, You are given only feature X,

but you can expand this feature by
including higher-order polynomials of X,
i.e.
35

Different Mappings can be

used
36

Non-Linear Case
• Algorithm:
▫ Expand each feature to include the non-
linear mapping.
▫ Learn set of parameters using gradient
descent.

Calculus Data Science
100% (1)
Calculus Data Science
271 pages
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Unit 1 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 1 - Machine Learning - WWW - Rgpvnotes.in
23 pages
(MLP) Lecture Notes
No ratings yet
(MLP) Lecture Notes
22 pages
Linear Regression
100% (1)
Linear Regression
51 pages
ML 02 Linear Regression
No ratings yet
ML 02 Linear Regression
51 pages
Lecture 4 - Cost Function
No ratings yet
Lecture 4 - Cost Function
18 pages
L3 Linear Regression and Gradient Descent
No ratings yet
L3 Linear Regression and Gradient Descent
46 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Introduction To Machine Learning Algorithms: Linear Regression
No ratings yet
Introduction To Machine Learning Algorithms: Linear Regression
1 page
Week 04
No ratings yet
Week 04
101 pages
Linear Regression For Absolute Beginners With Implementation in Python
No ratings yet
Linear Regression For Absolute Beginners With Implementation in Python
17 pages
Linear+regression+with+one+variable
No ratings yet
Linear+regression+with+one+variable
48 pages
Gradient Descent - Linear Regression
100% (1)
Gradient Descent - Linear Regression
47 pages
[MLP] MidtermNote
No ratings yet
[MLP] MidtermNote
31 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
What Is Machine Learning?
No ratings yet
What Is Machine Learning?
12 pages
What Is Machine Learning by Coursera
No ratings yet
What Is Machine Learning by Coursera
47 pages
04 LinearRegression PDF
No ratings yet
04 LinearRegression PDF
61 pages
cs229 Notes1 PDF
No ratings yet
cs229 Notes1 PDF
28 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
15 pages
Machine Learning Summary
No ratings yet
Machine Learning Summary
38 pages
Linear Regression Python Programming
No ratings yet
Linear Regression Python Programming
25 pages
Lecture 1, Part 1: Linear Regression: Roger Grosse
No ratings yet
Lecture 1, Part 1: Linear Regression: Roger Grosse
9 pages
Everything You Need To Know About Linear Regression - by Sushant Patrikar - Towards Data Science
No ratings yet
Everything You Need To Know About Linear Regression - by Sushant Patrikar - Towards Data Science
20 pages
Linear Regression
No ratings yet
Linear Regression
37 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
293 pages
Linearna Regresija - NG
No ratings yet
Linearna Regresija - NG
7 pages
Stanford ML CS229-Merged Notes
No ratings yet
Stanford ML CS229-Merged Notes
126 pages
Machine Learning Notes by Standard Andrew Ng
No ratings yet
Machine Learning Notes by Standard Andrew Ng
142 pages
Machine Learning Notes AndrewNg
No ratings yet
Machine Learning Notes AndrewNg
141 pages
Top 7 Loss Functions to Evaluate Regression Models
No ratings yet
Top 7 Loss Functions to Evaluate Regression Models
8 pages
L02 Linear Regression
No ratings yet
L02 Linear Regression
9 pages
2 (1)
No ratings yet
2 (1)
18 pages
Regression
No ratings yet
Regression
30 pages
Regression PDF
No ratings yet
Regression PDF
37 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
7 pages
01B-DL2023-LinearModels
No ratings yet
01B-DL2023-LinearModels
47 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
cs229 2
No ratings yet
cs229 2
275 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
CS229
No ratings yet
CS229
69 pages
Linear Regression
No ratings yet
Linear Regression
63 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Unit 4 - Linear Regression
No ratings yet
Unit 4 - Linear Regression
52 pages
Lecture 8: Gradient Descent and Logistic Regression
No ratings yet
Lecture 8: Gradient Descent and Logistic Regression
39 pages
Question 1 B
No ratings yet
Question 1 B
6 pages
Linear Regression: Jia-Bin Huang Virginia Tech
No ratings yet
Linear Regression: Jia-Bin Huang Virginia Tech
59 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
15 pages
Lecture 3 Ai
No ratings yet
Lecture 3 Ai
48 pages
Slide 3 - Linear Regression One Variable
No ratings yet
Slide 3 - Linear Regression One Variable
60 pages
Machine Learning Notes Cs229 1
No ratings yet
Machine Learning Notes Cs229 1
217 pages
Machine Learning Algorithns - Unit3
No ratings yet
Machine Learning Algorithns - Unit3
124 pages
[PR 2024] Lec2 Regression II
No ratings yet
[PR 2024] Lec2 Regression II
41 pages
vertopal.com_22644501_lab02 (4)
No ratings yet
vertopal.com_22644501_lab02 (4)
14 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Let's Practise: Maths Workbook Coursebook 6
From Everand
Let's Practise: Maths Workbook Coursebook 6
ExcelSoft Technologies Pvt. Ltd.
No ratings yet
Hands-On AI: Building ML Models with Python
From Everand
Hands-On AI: Building ML Models with Python
Anand Vemula
No ratings yet
Let's Practise: Maths Workbook Coursebook 7
From Everand
Let's Practise: Maths Workbook Coursebook 7
ExcelSoft Technologies Pvt. Ltd.
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Subjective Questions
No ratings yet
Subjective Questions
8 pages
Survey TPAMI 2023 Preprint
No ratings yet
Survey TPAMI 2023 Preprint
20 pages
Nesterov Momentum
No ratings yet
Nesterov Momentum
3 pages
CS 771A: Introduction To Machine Learning Name Roll No Dept
No ratings yet
CS 771A: Introduction To Machine Learning Name Roll No Dept
2 pages
Deep Learning - AD3501 - Notes - Unit 1 - Deep Networks Basics
No ratings yet
Deep Learning - AD3501 - Notes - Unit 1 - Deep Networks Basics
45 pages
Deep Learning
No ratings yet
Deep Learning
189 pages
Richi's Neural Nets Summary
No ratings yet
Richi's Neural Nets Summary
114 pages
Foundations of Machine Learning - 3
No ratings yet
Foundations of Machine Learning - 3
38 pages
ANN Unit-2 Chapter-2
No ratings yet
ANN Unit-2 Chapter-2
56 pages
Unit 4 Notes
100% (1)
Unit 4 Notes
45 pages
Optimization Methods (MFE) : Elena Perazzi
100% (1)
Optimization Methods (MFE) : Elena Perazzi
31 pages
DL Full Merged
No ratings yet
DL Full Merged
454 pages
4-1 Data Science Syllabus
No ratings yet
4-1 Data Science Syllabus
7 pages
1 - Introduction To Deep Learning
No ratings yet
1 - Introduction To Deep Learning
87 pages
OQM Lecture Note - Part 8 Unconstrained Nonlinear Optimisation
No ratings yet
OQM Lecture Note - Part 8 Unconstrained Nonlinear Optimisation
23 pages
Lecture03 Linear Regression
No ratings yet
Lecture03 Linear Regression
54 pages
Method of Steepest Descent
No ratings yet
Method of Steepest Descent
2 pages
Module 3 Ppt
No ratings yet
Module 3 Ppt
83 pages
Lec 21 Marquardt Method
No ratings yet
Lec 21 Marquardt Method
29 pages
CS 256: LMS Algorithms
No ratings yet
CS 256: LMS Algorithms
23 pages
ODSExams Merged
No ratings yet
ODSExams Merged
103 pages
An Effective Document Image Deblurring Algorithm
No ratings yet
An Effective Document Image Deblurring Algorithm
8 pages
SOC Estimation of Lithium-Ion Battery For Electric
No ratings yet
SOC Estimation of Lithium-Ion Battery For Electric
12 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
80 pages
DL Unit 1
No ratings yet
DL Unit 1
16 pages
Discussion 4 CS771
No ratings yet
Discussion 4 CS771
25 pages
Artificial Neural Network Notes
No ratings yet
Artificial Neural Network Notes
9 pages
Application of Anfis
No ratings yet
Application of Anfis
9 pages