Lecture 05 - Linear Regression

This document discusses linear regression and the standard normal distribution. It explains that linear regression finds the linear trend in data by fitting a line described by the equation y=ax+b. It describes the least squares method which minimizes the sum of the squares of the differences between observed and predicted values to calculate the slope (a) and intercept (b). The coefficient of determination, R^2, measures how well the linear model fits the data. It also discusses how the central limit theorem implies that the average of samples from a non-normal distribution will be approximately normally distributed.

Uploaded by

Ramona Cirstian

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

0% found this document useful (0 votes)

85 views12 pages

Lecture 05 - Linear Regression

Uploaded by

Ramona Cirstian

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

You are on page 1/ 12

https://www.google.nl/url?

sa=i&rct=j&q=&esrc=s&source=images&cd=&ved=0ahUKEwiZ9PXwh-
vLAhXF7w4KHUxwBaYQjRwIBw&url=https%3A%2F%2Fsummer-heart-0930.chufeiyun1688.workers.dev%3A443%2Fhttps%2Fwww.pinterest.com
%2Fpin
%2F18929260905651518%2F&psig=AFQjCNGK4zRH_WnqZQAsCBknZMbDRb
rROQ&ust=1459517828773858&cad=rjt

Statistics
Lecture 5: Linear Regression
Recap
normal distribution
•  add more and more discrete events
–  example measuring a physical
quantity n-times
•  Has a mean, median and mode
•  symmetric: 50% values are higher than
mean and 50% are lower than mean
https://www.mathsisfun.com/data/standard-normal-distribution.html

1 −( x−µ )
2
2σ 2
N ( x) = e
σ 2π

1/04/16 2
The Standard Normal distribution
shifting the normal distribution to the mean = 0
•  Standardize normal distribution:
–  subtract the mean
–  divide by the standard deviation
•  Standardize by z

x−µ
z=
σ

https://www.mathsisfun.com/data/standard-normal-distribution.html

1/04/16 3
The Standard Normal distribution
In more detail

1/04/16 4
The central limit theorem
The CLT http://www.value-at-risk.net/central-limit-theorem/

•  distribution of an average tends to be Normal, even when the

distribution from which the average is computed is decidedly non-
Normal.
•  foundation for many statistical procedures, including Quality Control
Charts, because the distribution of the phenomenon under study does
not have to be Normal because its average will be.
•  this normal distribution will have the same mean as the parent
distribution, AND, variance equal to the variance of the parent divided
by the sample size.
x−µ
z=
σ n
1/04/16 5
Linear Regression
Finding linear trends in data
•  plotting/fitting a line to data that are linear
related
–  a – slope
–  b – intercept

•  Goal:
y = ax + b
–  error reduction
(wikipedia)
–  predicting/forcasting
–  calibration

1/04/16 6
Linear Regression
Finding linear trends in data – how to
•  plotting/fitting a line to data that are linear
related

y = ax + b
•  most common method:
–  least square methods: minimizing the
squares of the differences between the
mean line and the actual value
∂R 2
R = ∑"# yi − f ( xi , a1, a2 ,..., an )$%
2
=0
∂ai http://onlinestatbook.com/2/regression/intro.html

1/04/16 7
Linear Regression
Least squares
•  After a bit of math the equation for least square straight line fitting:

–  slope: a=
∑ (x − x )(y − y )
i i
2
∑( x − x )
i

–  intercept: b = y − ax

1/04/16 8
Linear Regression
Least Squares
•  coefficient of determination: R2 gives an idea of how well the fit is:
2
SSregression # ∑( x − x ) ( y − y ) &
1
R2 = =% (
SStotal %n σ xσ y (
$ '

•  values range: 0 ≤ R2 ≤ 1
•  R2 = 0; dependent variable cannot be predicted by the model
•  R2 = 1; dependent variable can be predicted without error
•  R2 between 0 and 1 indicated to what extend the dependent variable
can be predicted

2/04/16 9
Linear Regression
Least Squares – some things to note
•  Excel calculates trendline BUT use scatterplot!

•  What if there is no linear relation between x and y

–  try to transform into linear relation

•  How good is your fit?

–  R2 > 0.8 otherwise you should go back to the drawing board
–  Keep in mind your sample size – don’t fit a line through 2 or 3 points!

1/04/16 10
Linear Regression
Example
The sales of a company (in million dollars) for each year are shown in the
table below.

•  x (year) 2012 2013 2014 2015 2016

•  y (sales) 12 19 29 37 45
• 

a) Find the least square regression line y = ax + b.

b) Use the least squares regression line as a model to estimate the

sales of the company in 2019.

2/04/16 11
Linear Regression
Example
The sales of a company (in million dollars) for each year are shown in the
table below.

•  x (year) 2012 2013 2014 2015 2016

•  y (sales) 12 19 29 37 45
• 

a) Find the least square regression line y = ax + b.

b) Use the least squares regression line as a model to estimate the

sales of the company in 2019.

3/04/16 12

Jacaranda Maths Quest 12 Specialist Mathematics VCE Units 3 4 Second Edition Jacaranda 2024 Scribd Download
83% (6)
Jacaranda Maths Quest 12 Specialist Mathematics VCE Units 3 4 Second Edition Jacaranda 2024 Scribd Download
40 pages
2024 Year 7 Data Classification and Visualisation
No ratings yet
2024 Year 7 Data Classification and Visualisation
24 pages
Accounting For Non-Accountants
100% (7)
Accounting For Non-Accountants
129 pages
Guided Notes Conditional Probability
No ratings yet
Guided Notes Conditional Probability
6 pages
Caie As Level Business 9609 Theory v1
100% (1)
Caie As Level Business 9609 Theory v1
44 pages
Open Music Theory 1646935663
No ratings yet
Open Music Theory 1646935663
1,052 pages
Data Communication Questions
100% (3)
Data Communication Questions
6 pages
Statistics 1 AQA Revision Notes
No ratings yet
Statistics 1 AQA Revision Notes
7 pages
Introduction To The Normal Distribution PDF
No ratings yet
Introduction To The Normal Distribution PDF
6 pages
Stats Formula
No ratings yet
Stats Formula
2 pages
Criterion B Summative Assessment Unit 1 Patterns with fractions
No ratings yet
Criterion B Summative Assessment Unit 1 Patterns with fractions
7 pages
Teacher's Notes: Graphs
No ratings yet
Teacher's Notes: Graphs
30 pages
Normal Probability Distributions
No ratings yet
Normal Probability Distributions
72 pages
Further Pure 1 Chapter 5::: The - Formulae
No ratings yet
Further Pure 1 Chapter 5::: The - Formulae
15 pages
Binomial Theorem Notes
No ratings yet
Binomial Theorem Notes
10 pages
Functions and Graphs
No ratings yet
Functions and Graphs
23 pages
Caie As Level Mathematics 9709 Pure 1 v5
No ratings yet
Caie As Level Mathematics 9709 Pure 1 v5
22 pages
A3 Trigonometry Topic Booklet 1 Hard CIE A Level Maths P3 - 1
No ratings yet
A3 Trigonometry Topic Booklet 1 Hard CIE A Level Maths P3 - 1
6 pages
s1 Past Paper Questions
No ratings yet
s1 Past Paper Questions
377 pages
High Standards in Maths - DKC
No ratings yet
High Standards in Maths - DKC
56 pages
Integration - HSC: Question 1s
No ratings yet
Integration - HSC: Question 1s
23 pages
P3 Merged Removed
No ratings yet
P3 Merged Removed
216 pages
Cosine Rule Worksheet-1
No ratings yet
Cosine Rule Worksheet-1
1 page
Coordinates Methods in Geometry (HSC Questions)
No ratings yet
Coordinates Methods in Geometry (HSC Questions)
9 pages
FEMSSISA Grade 11 Mathematics Olympiads 2022
No ratings yet
FEMSSISA Grade 11 Mathematics Olympiads 2022
5 pages
Caie A2 Maths 9709 Statistics 2 v1
No ratings yet
Caie A2 Maths 9709 Statistics 2 v1
10 pages
E Math Notes
No ratings yet
E Math Notes
10 pages
Math Guide Book To Digital SAT ACT EST by MR Mohamed Abdallahwm
No ratings yet
Math Guide Book To Digital SAT ACT EST by MR Mohamed Abdallahwm
30 pages
2019 MA Geometric Series Worksheet HSC Questions W Solutions
No ratings yet
2019 MA Geometric Series Worksheet HSC Questions W Solutions
17 pages
Mathematics of Cryptography: Part I: Modular Arithmetic, Congruence, and Matrices
No ratings yet
Mathematics of Cryptography: Part I: Modular Arithmetic, Congruence, and Matrices
78 pages
Algebra Ii
No ratings yet
Algebra Ii
2 pages
ECON 410: Advanced Microeconomic Theory (20 Points)
No ratings yet
ECON 410: Advanced Microeconomic Theory (20 Points)
4 pages
1 Exam Paper - IB - Sequences and Series - Binomial Expansion
No ratings yet
1 Exam Paper - IB - Sequences and Series - Binomial Expansion
17 pages
Stats1 Chapter 2::: Measures of Location & Spread
No ratings yet
Stats1 Chapter 2::: Measures of Location & Spread
53 pages
Inequalities by Random
No ratings yet
Inequalities by Random
23 pages
Maharashtra HSC Mathematics Paper 1
No ratings yet
Maharashtra HSC Mathematics Paper 1
40 pages
1.2 Set Notation & Venn Diagrams Hard P2
No ratings yet
1.2 Set Notation & Venn Diagrams Hard P2
8 pages
IB Mathematics Internal Assessment - Standard Level
No ratings yet
IB Mathematics Internal Assessment - Standard Level
21 pages
Myp Command Terms
No ratings yet
Myp Command Terms
1 page
Cambridge 3 Unit Year1 12
No ratings yet
Cambridge 3 Unit Year1 12
520 pages
Mathematics Advanced Year 11 Topic Guide Functions
No ratings yet
Mathematics Advanced Year 11 Topic Guide Functions
13 pages
Upper and Lower Bounds
No ratings yet
Upper and Lower Bounds
12 pages
Calc Qualifying Practice Exam - 2024
No ratings yet
Calc Qualifying Practice Exam - 2024
9 pages
Ch3 - Test A-1
No ratings yet
Ch3 - Test A-1
5 pages
Y11 MAA SL-Venn Diagram
No ratings yet
Y11 MAA SL-Venn Diagram
9 pages
Combined MS - M1 Edexcel PDF
No ratings yet
Combined MS - M1 Edexcel PDF
224 pages
Worksheet Math Grade 8 Linear Equations 3
No ratings yet
Worksheet Math Grade 8 Linear Equations 3
2 pages
Complete Download Further Mechanics 1 For AS and A Level Coll. PDF All Chapters
100% (1)
Complete Download Further Mechanics 1 For AS and A Level Coll. PDF All Chapters
57 pages
01252022010047AnGeom - Q3 - Module 3 - Rotation of Axes
No ratings yet
01252022010047AnGeom - Q3 - Module 3 - Rotation of Axes
15 pages
Quadratic Functions
No ratings yet
Quadratic Functions
13 pages
Algebra 2: Section 5.2
No ratings yet
Algebra 2: Section 5.2
20 pages
Edexcel IAL Pure Mathematics P2 June 2022 Wma12-01-Que-20220520
No ratings yet
Edexcel IAL Pure Mathematics P2 June 2022 Wma12-01-Que-20220520
32 pages
YouTube A-Level Maths Specification
No ratings yet
YouTube A-Level Maths Specification
50 pages
Topic: Set: Mathematics-XI
No ratings yet
Topic: Set: Mathematics-XI
14 pages
C2 Differentiation - Stationary Points PDF
No ratings yet
C2 Differentiation - Stationary Points PDF
45 pages
2023 Hurlstone Agricultural High School - S2 - Trial
No ratings yet
2023 Hurlstone Agricultural High School - S2 - Trial
54 pages
9) S1 The Normal Distribution
No ratings yet
9) S1 The Normal Distribution
35 pages
Maa HL 5.17 Integration by Parts
No ratings yet
Maa HL 5.17 Integration by Parts
16 pages
Chapter Test - 05D
No ratings yet
Chapter Test - 05D
7 pages
P3 Edexcel Mock QP
No ratings yet
P3 Edexcel Mock QP
3 pages
Comprehensive Guide to LaTeX: Advanced Techniques and Best Practices
From Everand
Comprehensive Guide to LaTeX: Advanced Techniques and Best Practices
Adam Jones
No ratings yet
Master SAT Prep Maths: Maths, #1
From Everand
Master SAT Prep Maths: Maths, #1
Subbalakshmi Devaki
No ratings yet
Topic 7 Linear Regreation CHP14
No ratings yet
Topic 7 Linear Regreation CHP14
21 pages
Listino 25-2020 inglese_WEB_singole
No ratings yet
Listino 25-2020 inglese_WEB_singole
40 pages
Sound
100% (1)
Sound
26 pages
The 38 Bach Remedies
100% (1)
The 38 Bach Remedies
20 pages
Automatic Car Washing System: P.B.Patel, S.V.Rokade, P.S.Tujare
100% (1)
Automatic Car Washing System: P.B.Patel, S.V.Rokade, P.S.Tujare
5 pages
The1001method (A4) PDF
No ratings yet
The1001method (A4) PDF
20 pages
Schott Lab 860 Laboratory pH-Meter - User Manual
No ratings yet
Schott Lab 860 Laboratory pH-Meter - User Manual
74 pages
Shawalis Document 8
No ratings yet
Shawalis Document 8
2 pages
JTS Clinical Practice Guidelines
No ratings yet
JTS Clinical Practice Guidelines
12 pages
Enerpacp 50
No ratings yet
Enerpacp 50
2 pages
Harvester C.R.I.S.
No ratings yet
Harvester C.R.I.S.
16 pages
Vitamin D in Older People: Miles D Witham and Gavin Francis
No ratings yet
Vitamin D in Older People: Miles D Witham and Gavin Francis
15 pages
Certified Trainer
No ratings yet
Certified Trainer
7 pages
7113 MA4872 L02p
No ratings yet
7113 MA4872 L02p
19 pages
Mandatory Madness : Colonial Psychiatry and Mental Illness in British Mandate Palestine Chris Sandal-Wilson - Instantly access the complete ebook with just one click
100% (1)
Mandatory Madness : Colonial Psychiatry and Mental Illness in British Mandate Palestine Chris Sandal-Wilson - Instantly access the complete ebook with just one click
52 pages
Helping 100 Plan
No ratings yet
Helping 100 Plan
23 pages
2007 U.S. Epinet Needlestick and Sharp-Object Injury Report: Job Category
No ratings yet
2007 U.S. Epinet Needlestick and Sharp-Object Injury Report: Job Category
5 pages
Project 2024-25-3 For Computer Class 10
No ratings yet
Project 2024-25-3 For Computer Class 10
6 pages
Science Workbook 3rd Quarter
No ratings yet
Science Workbook 3rd Quarter
25 pages
Paper 3 - Section A
No ratings yet
Paper 3 - Section A
19 pages
1.2 Properties of Rational Numbers: 1.2.1 Closure
No ratings yet
1.2 Properties of Rational Numbers: 1.2.1 Closure
1 page
saliha
No ratings yet
saliha
16 pages
Lorax Truax Questions
No ratings yet
Lorax Truax Questions
2 pages
Structural Organization in Cockroach
No ratings yet
Structural Organization in Cockroach
30 pages
Final Exam Bridging Program BATCH 1 2021
No ratings yet
Final Exam Bridging Program BATCH 1 2021
4 pages
CCEC Standards Regulations Guide EN
100% (1)
CCEC Standards Regulations Guide EN
9 pages
Dewan Public School Subject History Class X The Rise of Nationalism in Europe
No ratings yet
Dewan Public School Subject History Class X The Rise of Nationalism in Europe
7 pages