0% found this document useful (0 votes)

39 views6 pages

Introduction To Linear Regression

This document introduces linear regression analysis. It defines linear regression as predicting scores on a criterion variable (Y) from a predictor variable (X) using a best-fitting straight line. The best-fitting line, called the regression line, minimizes the sum of the squared errors of prediction between the data points and the line. The formula for a simple linear regression line is Y' = bX + A, where b is the slope and A is the Y-intercept. The document provides an example of predicting university GPA from high school GPA and calculates the regression equation.

Uploaded by

gmujtaba

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Download as docx, pdf, or txt

0% found this document useful (0 votes)

39 views6 pages

Introduction To Linear Regression

Uploaded by

gmujtaba

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Download as docx, pdf, or txt

You are on page 1/ 6

Introduction to Linear Regression

Author(s)

David M. Lane

Prerequisites

Measures of Variability, Describing Bivariate Data

Learning Objectives

1. Define linear regression

2. Identify errors of prediction in a scatter plot with a regression line

In simple linear regression, we predict scores on one variable from the scores on a second
variable. The variable we are predicting is called the criterion variable and is referred to as Y.
The variable we are basing our predictions on is called the predictor variable and is referred to as
X. When there is only one predictor variable, the prediction method is called simple regression.
In simple linear regression, the topic of this section, the predictions of Y when plotted as a
function of X form a straight line.

The example data in Table 1 are plotted in Figure 1. You can see that there is a positive
relationship between X and Y. If you were going to predict Y from X, the higher the value of X,
the higher your prediction of Y.

Table 1. Example data.

X Y
1.00 1.00
2.00 2.00
3.00 1.30
4.00 3.75
5.00 2.25
Figure 1. A scatter plot of the example data.

Linear regression consists of finding the best-fitting straight line through the points. The best-
fitting line is called a regression line. The black diagonal line in Figure 2 is the regression line
and consists of the predicted score on Y for each possible value of X. The vertical lines from the
points to the regression line represent the errors of prediction. As you can see, the red point is
very near the regression line; its error of prediction is small. By contrast, the yellow point is
much higher than the regression line and therefore its error of prediction is large.
Figure 2. A scatter plot of the example data. The black line consists of the predictions, the points
are the actual data, and the vertical lines between the points and the black line represent errors of
prediction.

The error of prediction for a point is the value of the point minus the predicted value (the value
on the line). Table 2 shows the predicted values (Y') and the errors of prediction (Y-Y'). For
example, the first point has a Y of 1.00 and a predicted Y (called Y') of 1.21. Therefore, its error
of prediction is -0.21.

Table 2. Example data.

X Y Y' Y-Y' (Y-Y')2

1.00 1.00 1.210 -0.210 0.044
2.00 2.00 1.635 0.365 0.133
3.00 1.30 2.060 -0.760 0.578
4.00 3.75 2.485 1.265 1.600
5.00 2.25 2.910 -0.660 0.436

You may have noticed that we did not specify what is meant by "best-fitting line." By far, the
most commonly-used criterion for the best-fitting line is the line that minimizes the sum of the
squared errors of prediction. That is the criterion that was used to find the line in Figure 2. The
last column in Table 2 shows the squared errors of prediction. The sum of the squared errors of
prediction shown in Table 2 is lower than it would be for any other regression line.
The formula for a regression line is

Y' = bX + A

where Y' is the predicted score, b is the slope of the line, and A is the Y intercept. The equation
for the line in Figure 2 is

Y' = 0.425X + 0.785

For X = 1,

Y' = (0.425)(1) + 0.785 = 1.21.

For X = 2,

Y' = (0.425)(2) + 0.785 = 1.64.

Computing the Regression Line

In the age of computers, the regression line is typically computed with statistical software.
However, the calculations are relatively easy, and are given here for anyone who is interested.
The calculations are based on the statistics shown in Table 3. MX is the mean of X, MY is the
mean of Y, sX is the standard deviation of X, sY is the standard deviation of Y, and r is the
correlation between X and Y.

Formula for standard deviation

Formula for correlation

Table 3. Statistics for computing the regression line.

MX MY sX sY r
3 2.06 1.581 1.072 0.627

The slope (b) can be calculated as follows:

b = r sY/sX

and the intercept (A) can be calculated as

A = MY - bMX.

For these data,

b = (0.627)(1.072)/1.581 = 0.425

A = 2.06 - (0.425)(3) = 0.785

Note that the calculations have all been shown in terms of sample statistics rather than
population parameters. The formulas are the same; simply use the parameter values for means,
standard deviations, and the correlation.

Standardized Variables

The regression equation is simpler if variables are standardized so that their means are equal to 0
and standard deviations are equal to 1, for then b = r and A = 0. This makes the regression line:

ZY' = (r)(ZX)

where ZY' is the predicted standard score for Y, r is the correlation, and ZX is the standardized
score for X. Note that the slope of the regression equation for standardized variables is r.

A Real Example

The case study "SAT and College GPA" contains high school and university grades for 105
computer science majors at a local state school. We now consider how we could predict a
student's university GPA if we knew his or her high school GPA.

Figure 3 shows a scatter plot of University GPA as a function of High School GPA. You can see
from the figure that there is a strong positive relationship. The correlation is 0.78. The regression
equation is

University GPA' = (0.675)(High School GPA) + 1.097

Therefore, a student with a high school GPA of 3 would be predicted to have a university GPA
of

University GPA' = (0.675)(3) + 1.097 = 3.12.

Figure 3. University GPA as a

function of High School GPA.

Assumptions

It may surprise you, but the

calculations shown in this section
are assumption-free. Of course, if
the relationship between X and Y
were not linear, a different shaped
function could fit the data better.
Inferential statistics in regression are based on several assumptions, and these assumptions are
presented in a later section of this chapter.

Atv DVWK M - 368e
No ratings yet
Atv DVWK M - 368e
36 pages
Unit 9 Simple Linear Regression: Structure
No ratings yet
Unit 9 Simple Linear Regression: Structure
22 pages
ZXSDR R8852E Product Description
No ratings yet
ZXSDR R8852E Product Description
19 pages
Introduction To Linear Regression
No ratings yet
Introduction To Linear Regression
6 pages
Unit 3 Notes
100% (1)
Unit 3 Notes
32 pages
Standard Error of The Estimate
No ratings yet
Standard Error of The Estimate
3 pages
Tugas Ridho 2.100-2.102
No ratings yet
Tugas Ridho 2.100-2.102
9 pages
Unit Regression Analysis: Objectives
No ratings yet
Unit Regression Analysis: Objectives
18 pages
R-programming - Unit 5
No ratings yet
R-programming - Unit 5
43 pages
Regression and Correlation
No ratings yet
Regression and Correlation
14 pages
Simple Linear Regression and Multiple Linear Regression: MAST 6474 Introduction To Data Analysis I
No ratings yet
Simple Linear Regression and Multiple Linear Regression: MAST 6474 Introduction To Data Analysis I
15 pages
Topic 6 Mte3105
No ratings yet
Topic 6 Mte3105
9 pages
Unit 3 notes
No ratings yet
Unit 3 notes
35 pages
Multiple Regression
No ratings yet
Multiple Regression
4 pages
Linear Regression
100% (1)
Linear Regression
56 pages
Linear Regression Analysis: BX A y
No ratings yet
Linear Regression Analysis: BX A y
6 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
20 pages
Functions and Applications
No ratings yet
Functions and Applications
30 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Simple Regression Model: Erbil Technology Institute
No ratings yet
Simple Regression Model: Erbil Technology Institute
9 pages
STATS 4
No ratings yet
STATS 4
23 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Halifean Rentap 2020836444 Tutorial 2
No ratings yet
Halifean Rentap 2020836444 Tutorial 2
9 pages
LINEST Function
No ratings yet
LINEST Function
8 pages
Analysis of Experimental Data
No ratings yet
Analysis of Experimental Data
9 pages
OpenStax Chapter 12 Power Point
No ratings yet
OpenStax Chapter 12 Power Point
81 pages
06 Simple Linear Regression Part1
No ratings yet
06 Simple Linear Regression Part1
8 pages
Regression Analysis: Basic Concepts: 1 The Simple Linear Model
No ratings yet
Regression Analysis: Basic Concepts: 1 The Simple Linear Model
4 pages
Module 2 Transcripts_v3
No ratings yet
Module 2 Transcripts_v3
103 pages
Regresión y Calibración
No ratings yet
Regresión y Calibración
6 pages
The Scalar Algebra of Means, Covariances, and Correlations
No ratings yet
The Scalar Algebra of Means, Covariances, and Correlations
21 pages
Regression Primer
No ratings yet
Regression Primer
4 pages
Chapter 6: How To Do Forecasting by Regression Analysis
No ratings yet
Chapter 6: How To Do Forecasting by Regression Analysis
7 pages
Chapter 2 Econometrics
No ratings yet
Chapter 2 Econometrics
5 pages
A Tutorial On How To Run A Simple Linear Regression in Excel
No ratings yet
A Tutorial On How To Run A Simple Linear Regression in Excel
19 pages
UNIT I Notes
No ratings yet
UNIT I Notes
23 pages
UNIT I Notes-1
No ratings yet
UNIT I Notes-1
18 pages
13 - Data Mining Within A Regression Framework
No ratings yet
13 - Data Mining Within A Regression Framework
27 pages
STA2100-Regression Analysis
No ratings yet
STA2100-Regression Analysis
15 pages
Engineering Analysis & Statistics: Lect. # 11
No ratings yet
Engineering Analysis & Statistics: Lect. # 11
22 pages
12 Gen Ch3 Least Squares Regression Notes 2024
No ratings yet
12 Gen Ch3 Least Squares Regression Notes 2024
17 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
7 pages
Multiple Correlation
No ratings yet
Multiple Correlation
5 pages
Regression 2006-03-01
No ratings yet
Regression 2006-03-01
53 pages
Simple Linear Regression: Coefficient of Determination
No ratings yet
Simple Linear Regression: Coefficient of Determination
21 pages
Regression and Multiple Regression Analysis
100% (1)
Regression and Multiple Regression Analysis
21 pages
Best-Fit Curves: x+1 Is Zero When X 1. The Sine of X Is 1 When X Is
No ratings yet
Best-Fit Curves: x+1 Is Zero When X 1. The Sine of X Is 1 When X Is
3 pages
Module 3 EDA
No ratings yet
Module 3 EDA
14 pages
Econometrics: Two Variable Regression: The Problem of Estimation
No ratings yet
Econometrics: Two Variable Regression: The Problem of Estimation
28 pages
Chapter 12 Notes
No ratings yet
Chapter 12 Notes
60 pages
Short - Notes - Econometric Methods
No ratings yet
Short - Notes - Econometric Methods
22 pages
Maths Project 2
No ratings yet
Maths Project 2
6 pages
Regression Analysis (Simple)
100% (1)
Regression Analysis (Simple)
8 pages
Making Sense of Methods and Measurements: Simple Linear Regression
No ratings yet
Making Sense of Methods and Measurements: Simple Linear Regression
2 pages
Chapter2
No ratings yet
Chapter2
20 pages
1.10 Simple Linear Regression - Answers
No ratings yet
1.10 Simple Linear Regression - Answers
22 pages
Statistical Models in R
No ratings yet
Statistical Models in R
18 pages
Econometrics - Functional Forms
No ratings yet
Econometrics - Functional Forms
22 pages
Calculus III Essentials
From Everand
Calculus III Essentials
Editors of REA
1/5 (2)
Born in the year 1959: Astrological character profiles for every day of the year
From Everand
Born in the year 1959: Astrological character profiles for every day of the year
Christoph Däppen
No ratings yet
Master the Fundamentals of Electromagnetism and EM-Induction
From Everand
Master the Fundamentals of Electromagnetism and EM-Induction
Space Learn
No ratings yet
Course Outline Probability Methods
No ratings yet
Course Outline Probability Methods
5 pages
BEE4 Assignment 2
No ratings yet
BEE4 Assignment 2
1 page
Q: Solve No 75 and 85 of Chapter 2 of 9 Edition Walpole and Myers. 10 Marks
No ratings yet
Q: Solve No 75 and 85 of Chapter 2 of 9 Edition Walpole and Myers. 10 Marks
1 page
Probability Methods in Engineering BEE4r Assignment 1: Question: Answer The Following. Due Date: 4/10/2018 Before 7:30 PM
No ratings yet
Probability Methods in Engineering BEE4r Assignment 1: Question: Answer The Following. Due Date: 4/10/2018 Before 7:30 PM
2 pages
Lec 10 Req Validation
No ratings yet
Lec 10 Req Validation
17 pages
8086 Datasheet
0% (1)
8086 Datasheet
30 pages
RC-Design-U-2022-v12.6
No ratings yet
RC-Design-U-2022-v12.6
69 pages
Datasheet Cable Plug Play Automotive
No ratings yet
Datasheet Cable Plug Play Automotive
19 pages
Myxedema
No ratings yet
Myxedema
16 pages
Nursing Education Lesson Plan On Intradermal Injection
100% (2)
Nursing Education Lesson Plan On Intradermal Injection
12 pages
Crm Yaaranaholidays Com Client Default Aspx Package 7lmVxSCg1hlbkFPg3OmLEA
No ratings yet
Crm Yaaranaholidays Com Client Default Aspx Package 7lmVxSCg1hlbkFPg3OmLEA
1 page
General Appliance Corporation
No ratings yet
General Appliance Corporation
3 pages
Honey and Water Therapy by T Blends
No ratings yet
Honey and Water Therapy by T Blends
17 pages
Lesson Plans PDF
No ratings yet
Lesson Plans PDF
11 pages
HUBCO Power Plant Final
No ratings yet
HUBCO Power Plant Final
13 pages
DZ Catalogue Spreads LR Final PDF
No ratings yet
DZ Catalogue Spreads LR Final PDF
80 pages
Complete Download Essentials of Neonatal Ventilation, 1st edition Rajiv Pk (Editor) - eBook PDF PDF All Chapters
100% (1)
Complete Download Essentials of Neonatal Ventilation, 1st edition Rajiv Pk (Editor) - eBook PDF PDF All Chapters
62 pages
Alloy Steel
No ratings yet
Alloy Steel
1 page
Cube Complex_Apart_3_Constructive_1.06.24-2
No ratings yet
Cube Complex_Apart_3_Constructive_1.06.24-2
46 pages
CHAPTER - 1 - Differential Diagnoses - 2011 - Small Animal Dermatology PDF
No ratings yet
CHAPTER - 1 - Differential Diagnoses - 2011 - Small Animal Dermatology PDF
21 pages
Logan ZPE Orgone 0
No ratings yet
Logan ZPE Orgone 0
210 pages
CMTSE Test 3 Question Paper II EM
No ratings yet
CMTSE Test 3 Question Paper II EM
4 pages
Product Code Revision: Outokumpu Mintec Oy
No ratings yet
Product Code Revision: Outokumpu Mintec Oy
23 pages
HSS Connections PDF
No ratings yet
HSS Connections PDF
29 pages
Chemistry Practical Report - Naphthalene
43% (7)
Chemistry Practical Report - Naphthalene
3 pages
RM032 - Mark Tan_NCCS - Risk Assessment on use of traditional curtains
No ratings yet
RM032 - Mark Tan_NCCS - Risk Assessment on use of traditional curtains
1 page
Module 1
No ratings yet
Module 1
10 pages
Barangay Profile
No ratings yet
Barangay Profile
13 pages
GB 2012-2013 E01 - Web-Versjon
100% (1)
GB 2012-2013 E01 - Web-Versjon
252 pages
Color of Aura
No ratings yet
Color of Aura
2 pages
DIN 6935 Soğuk Bükme
No ratings yet
DIN 6935 Soğuk Bükme
13 pages
DE On Thi HK2 ANH 11 GLOBAL SUCCESS de 3
No ratings yet
DE On Thi HK2 ANH 11 GLOBAL SUCCESS de 3
13 pages
Process Flow Chart Description: AM RCS Plant Manager Cast Iron Foundry GM Plant
No ratings yet
Process Flow Chart Description: AM RCS Plant Manager Cast Iron Foundry GM Plant
1 page
Aswini Gold Emporium: Manufacture of Gold and Silver Ornaments Charichara Bazar, Nabadwip, Nadia, West Bengal 741302
No ratings yet
Aswini Gold Emporium: Manufacture of Gold and Silver Ornaments Charichara Bazar, Nabadwip, Nadia, West Bengal 741302
2 pages