0% found this document useful (0 votes)

79 views

05 Logistic - Regression

Logistic regression is a classification algorithm used when the response variable is categorical. It finds a relationship between features and the probability of a particular outcome class. The sigmoid function is used in logistic regression since its range is between 0 and 1, making it suitable for calculating probabilities. Logistic regression models the log odds (log(p/1-p)) as a linear combination of features to classify observations into binary categories based on a probability threshold of 0.5.

Uploaded by

adalina

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

79 views

05 Logistic - Regression

Uploaded by

adalina

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Logistic Regression

It’s a classification algorithm, that is used where the response variable is categorical. The idea of Logistic
Regression is to find a relationship between features and probability of particular outcome.
E.g. When we have to predict if a student passes or fails in an exam when the number of hours spent
studying is given as a feature, the response variable has two values, pass and fail.
If the probability is more than 50%, it assigns the value in that particular class else if the probability is less
than 50%, the value is assigned to the other class. Therefore, we can say that logistic regression acts as a
binary classifier.

Working of a Logistic Model

For linear regression, the model is defined by: 𝑦 = 𝛽0 + 𝛽1 𝑥 - (i)

and for logistic regression, we calculate probability, i.e. y is the probability of a given variable x belonging to a
certain class. Thus, it is obvious that the value of y should lie between 0 and 1.

But, when we use equation(i) to calculate probability, we would get values less than 0 as well as greater than 1.
That doesn’t make any sense . So, we need to use such an equation which always gives values between 0 and
1, as we desire while calculating the probability.

So here we Use Sigmoid Function

Sigmoid function

We use the sigmoid function as the underlying function in Logistic regression. Mathematically and graphically, it
is shown as:

Why do we use the Sigmoid Function?

1) The sigmoid function’s range is bounded between 0 and 1. Thus it’s useful in calculating the probability for
the Logistic function.

2) It’s derivative is easy to calculate than other functions which is useful during gradient descent calculation.

3) It is a simple way of introducing non-linearity to the model.

Now Logistic function On Sigmoid Function

Logit Function
Logistic regression can be expressed as:

where, the left hand side is called the logit or log-odds function, and p(x)/(1-p(x)) is called odds.
The odds signifies the ratio of probability of success to probability of failure. Therefore, in Logistic
Regression, linear combination of inputs are mapped to the log(odds) - the output being equal to 1.
The cost function for the whole training set is given as :

Logistic And Linear Model

Pratical Demonstrate Of Logistic Regression

Importing the libraries

In [1]:

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
import seaborn as sns

import warnings
warnings.filterwarnings("ignore")

Importing the dataset

In [2]:

dataset = pd.read_csv('Social_Network_Ads.csv')

In [3]:

dataset.head()

Out[3]:

User ID Gender Age EstimatedSalary Purchased

0 15624510 Male 19 19000 0

1 15810944 Male 35 20000 0

2 15668575 Female 26 43000 0

3 15603246 Female 27 57000 0

4 15804002 Male 19 76000 0

In [4]:

X = dataset.drop(['Purchased','User ID','Gender'],axis=1)
y = dataset['Purchased']

In [5]:

X.shape,y.shape

Out[5]:

((400, 2), (400,))

Splitting the dataset into the Training set and Test set
In [6]:

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.25, random_state =

Feature Scaling
In [7]:

from sklearn.preprocessing import StandardScaler

sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)

Training the Logistic Regression model on the Training set

In [8]:

from sklearn.linear_model import LogisticRegression

classifier = LogisticRegression(C=1.0)
classifier.fit(X_train, y_train)

Out[8]:

LogisticRegression()

Predicting the Test set results

In [9]:

y_pred = classifier.predict(X_test)
In [10]:

calculation = pd.DataFrame(np.c_[y_test,y_pred], columns = ["Original Purchased","Predict P

calculation

Out[10]:

Original Purchased Predict Purchased

0 0 0

1 0 0

2 0 0

3 0 0

4 0 0

... ... ...

95 1 0

96 0 0

97 1 0

98 1 1

99 1 1

100 rows × 2 columns

Visualising the Training set results

In [11]:

from matplotlib.colors import ListedColormap

X_set, y_set = X_train, y_train
X1, X2 = np.meshgrid(np.arange(start = X_set[:, 0].min() - 1, stop = X_set[:, 0].max() + 1,
np.arange(start = X_set[:, 1].min() - 1, stop = X_set[:, 1].max() + 1,
plt.contourf(X1, X2, classifier.predict(np.array([X1.ravel(), X2.ravel()]).T).reshape(X1.sh
alpha = 0.75, cmap = ListedColormap(('red', 'green')))
plt.xlim(X1.min(), X1.max())
plt.ylim(X2.min(), X2.max())
for i, j in enumerate(np.unique(y_set)):
plt.scatter(X_set[y_set == j, 0], X_set[y_set == j, 1],
c = ListedColormap(('red', 'green'))(i), label = j)
plt.title('Logistic Regression (Training set)')
plt.xlabel('Age')
plt.ylabel('Estimated Salary')
plt.legend()
plt.show()

'c' argument looks like a single numeric RGB or RGBA sequence, which should
be avoided as value-mapping will have precedence in case its length matches
with 'x' & 'y'. Please use a 2-D array with a single row if you really want
to specify the same RGB or RGBA value for all points.
'c' argument looks like a single numeric RGB or RGBA sequence, which should
be avoided as value-mapping will have precedence in case its length matches
with 'x' & 'y'. Please use a 2-D array with a single row if you really want
to specify the same RGB or RGBA value for all points.

Visualising the Test set results

In [12]:

from matplotlib.colors import ListedColormap

X_set, y_set = X_test, y_test
X1, X2 = np.meshgrid(np.arange(start = X_set[:, 0].min() - 1, stop = X_set[:, 0].max() + 1,
np.arange(start = X_set[:, 1].min() - 1, stop = X_set[:, 1].max() + 1,
plt.contourf(X1, X2, classifier.predict(np.array([X1.ravel(), X2.ravel()]).T).reshape(X1.sh
alpha = 0.75, cmap = ListedColormap(('red', 'green')))
plt.xlim(X1.min(), X1.max())
plt.ylim(X2.min(), X2.max())
for i, j in enumerate(np.unique(y_set)):
plt.scatter(X_set[y_set == j, 0], X_set[y_set == j, 1],
c = ListedColormap(('red', 'green'))(i), label = j)
plt.title('Logistic Regression (Test set)')
plt.xlabel('Age')
plt.ylabel('Estimated Salary')
plt.legend()
plt.show()

Complete Download Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, PDF All Chapters
100% (4)
Complete Download Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, PDF All Chapters
55 pages
2nd Exam Question Paper 2
No ratings yet
2nd Exam Question Paper 2
16 pages
Scan To BIM - Presentation
No ratings yet
Scan To BIM - Presentation
61 pages
Standard Specifications For Construction Works 2019 - Module 19
No ratings yet
Standard Specifications For Construction Works 2019 - Module 19
964 pages
Falha Hyster R1 .6
100% (2)
Falha Hyster R1 .6
22 pages
ST2195 Programming For Data Science
No ratings yet
ST2195 Programming For Data Science
11 pages
Pattern Recognition Presenation
100% (1)
Pattern Recognition Presenation
83 pages
What Is A Support Vector Machine?: Primer
No ratings yet
What Is A Support Vector Machine?: Primer
3 pages
6 XG Boost - Jupyter Notebook
100% (1)
6 XG Boost - Jupyter Notebook
3 pages
Project Report: CS 574 - Computer Vision Using Machine Learning
No ratings yet
Project Report: CS 574 - Computer Vision Using Machine Learning
38 pages
Career Plans For Next 2 Years
No ratings yet
Career Plans For Next 2 Years
11 pages
ENG 202: Computers and Engineering Object Oriented Programming in PYTHON
No ratings yet
ENG 202: Computers and Engineering Object Oriented Programming in PYTHON
56 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
Super Study Guide: Data Science Tools: Afshine Amidi and Shervine Amidi August 21, 2020
No ratings yet
Super Study Guide: Data Science Tools: Afshine Amidi and Shervine Amidi August 21, 2020
23 pages
Pandas
100% (1)
Pandas
1,131 pages
Data Science Interview Questions 2019
No ratings yet
Data Science Interview Questions 2019
16 pages
Building A Career in Data Science - The Overview
No ratings yet
Building A Career in Data Science - The Overview
2 pages
Math For Data Science
100% (1)
Math For Data Science
554 pages
SAS Presentation
No ratings yet
SAS Presentation
49 pages
Pattern Classification
100% (1)
Pattern Classification
42 pages
Data Science Analytics For Ordinary People PDF
No ratings yet
Data Science Analytics For Ordinary People PDF
199 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
20 pages
Essentials of Machine Learning Algorithms (With Python and R Codes) PDF
100% (1)
Essentials of Machine Learning Algorithms (With Python and R Codes) PDF
20 pages
MACHINE LEARNING ALGORITHM Unit-II
No ratings yet
MACHINE LEARNING ALGORITHM Unit-II
115 pages
Mastering Machine Learning With Scikit-Learn: Chapter No. 5 "Nonlinear Classification and Regression With Decision Trees"
No ratings yet
Mastering Machine Learning With Scikit-Learn: Chapter No. 5 "Nonlinear Classification and Regression With Decision Trees"
23 pages
Module 2
No ratings yet
Module 2
20 pages
U02Lecture07 Classification
100% (1)
U02Lecture07 Classification
56 pages
Get Applied Machine Learning and AI for Engineers Jeff Prosise free all chapters
100% (3)
Get Applied Machine Learning and AI for Engineers Jeff Prosise free all chapters
40 pages
Face Detection & Emotion Recognition
No ratings yet
Face Detection & Emotion Recognition
26 pages
Simple Libraries in Python
No ratings yet
Simple Libraries in Python
12 pages
ML Unit 1 Notes
100% (1)
ML Unit 1 Notes
19 pages
Data Science With Python Training in Bangalore - Python Training Institutes in Bangalore, Marathahalli, Jayanagar
100% (1)
Data Science With Python Training in Bangalore - Python Training Institutes in Bangalore, Marathahalli, Jayanagar
8 pages
ST2195 Complete
No ratings yet
ST2195 Complete
430 pages
CS7641 Machine Learning Midterm Notes PDF
No ratings yet
CS7641 Machine Learning Midterm Notes PDF
239 pages
Stock Price Prediction Using Machine Learning With Python
No ratings yet
Stock Price Prediction Using Machine Learning With Python
10 pages
Data Science in Business
No ratings yet
Data Science in Business
9 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Figure Style and Scale: Darkgrid Whitegrid Dark White Ticks Darkgrid
No ratings yet
Figure Style and Scale: Darkgrid Whitegrid Dark White Ticks Darkgrid
15 pages
Chapter 5.3-Mulitple Linear Regression
No ratings yet
Chapter 5.3-Mulitple Linear Regression
26 pages
Keras Cheat Sheet Python
No ratings yet
Keras Cheat Sheet Python
1 page
76 - Sample - Chapter Kunci M2K3 No 9
No ratings yet
76 - Sample - Chapter Kunci M2K3 No 9
94 pages
Database Management Systems by Raghu Ramakrishnan: Special Features of Book
No ratings yet
Database Management Systems by Raghu Ramakrishnan: Special Features of Book
3 pages
Data Visualization Ebook
No ratings yet
Data Visualization Ebook
15 pages
ML Handwritten Notes
No ratings yet
ML Handwritten Notes
35 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Data Science Learning Path For 50 Days
No ratings yet
Data Science Learning Path For 50 Days
15 pages
Complete Download An Introduction to Statistical Learning: with Applications in Python Gareth James PDF All Chapters
No ratings yet
Complete Download An Introduction to Statistical Learning: with Applications in Python Gareth James PDF All Chapters
55 pages
Cheet Sheet
No ratings yet
Cheet Sheet
47 pages
Lecture 01 (Introduction To Pattern Recognition)
No ratings yet
Lecture 01 (Introduction To Pattern Recognition)
26 pages
XG Boost
100% (1)
XG Boost
4 pages
Building and Evaluating ML Models
No ratings yet
Building and Evaluating ML Models
27 pages
A Guide To Teaching Data Science PDF
No ratings yet
A Guide To Teaching Data Science PDF
26 pages
Data Science: Concepts and Practice: Course Slides
No ratings yet
Data Science: Concepts and Practice: Course Slides
9 pages
Building Powerful Image Classification Models Using Very Little Data
No ratings yet
Building Powerful Image Classification Models Using Very Little Data
20 pages
DataMining Lecture 1
No ratings yet
DataMining Lecture 1
35 pages
ML0101EN Clus K Means Customer Seg Py v1
100% (1)
ML0101EN Clus K Means Customer Seg Py v1
8 pages
L3 - State Based Search - Revised
No ratings yet
L3 - State Based Search - Revised
83 pages
Text Mining: Fundamentals and Applications
From Everand
Text Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Effective Amazon Machine Learning
From Everand
Effective Amazon Machine Learning
Alexis Perrier
No ratings yet
Machine Learning with Python: Design and Develop Machine Learning and Deep Learning Technique using real world code examples
From Everand
Machine Learning with Python: Design and Develop Machine Learning and Deep Learning Technique using real world code examples
Abhishek Vijayvargia
No ratings yet
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Data Science Career Guide Interview Preparation
From Everand
Data Science Career Guide Interview Preparation
Gradient Publication
No ratings yet
Deductive Reasoning Activity
No ratings yet
Deductive Reasoning Activity
3 pages
Iso 21149 2006
100% (1)
Iso 21149 2006
12 pages
Kentmaster Catalog Pork 2009
No ratings yet
Kentmaster Catalog Pork 2009
40 pages
Articles Worksheets
No ratings yet
Articles Worksheets
3 pages
Idea Gprs Trick
No ratings yet
Idea Gprs Trick
3 pages
Concept Map: Dishita Solanki VI D
No ratings yet
Concept Map: Dishita Solanki VI D
5 pages
POM Chapter 3 With Attc Ans
100% (1)
POM Chapter 3 With Attc Ans
11 pages
OM3 CH 05 Technology and Operations Management
No ratings yet
OM3 CH 05 Technology and Operations Management
14 pages
Using Advertising and Promotion To Build Brands
No ratings yet
Using Advertising and Promotion To Build Brands
24 pages
Drama - Character in Drama
No ratings yet
Drama - Character in Drama
12 pages
Special Thermal Back Fill Material Surround For Better Performance of Ehv Cables
No ratings yet
Special Thermal Back Fill Material Surround For Better Performance of Ehv Cables
12 pages
Elearning Industry Mastering The Art of Employee Motivation
No ratings yet
Elearning Industry Mastering The Art of Employee Motivation
27 pages
TGA2237-SM Data Sheet-3
No ratings yet
TGA2237-SM Data Sheet-3
18 pages
Acid Resistant Mortar
No ratings yet
Acid Resistant Mortar
13 pages
Funk - Full Score
100% (1)
Funk - Full Score
6 pages
Sixteenth Edition: Strategy Analysis and Choice
No ratings yet
Sixteenth Edition: Strategy Analysis and Choice
56 pages
m10 BW
No ratings yet
m10 BW
2 pages
Microsoft Word - Travel Prashna KB Analysis XXX
No ratings yet
Microsoft Word - Travel Prashna KB Analysis XXX
5 pages
SR-013 Denah Struktur Lantai Atap (Superimposed)
No ratings yet
SR-013 Denah Struktur Lantai Atap (Superimposed)
1 page
Chegg Solutions
100% (1)
Chegg Solutions
9 pages
7th Part
No ratings yet
7th Part
4 pages
Implementation of The NDIS in The Early Childhood Intervention Sector in NSW Final Report
No ratings yet
Implementation of The NDIS in The Early Childhood Intervention Sector in NSW Final Report
135 pages
Infusion Pump Inspection Form
No ratings yet
Infusion Pump Inspection Form
1 page
Report 1 (Photogrammetry)
No ratings yet
Report 1 (Photogrammetry)
8 pages
How To Map On An Etom Level 2
100% (1)
How To Map On An Etom Level 2
75 pages
Socialism in Europe and The Russian Revolution Class 9 Notes Social Science History Chapter 2
No ratings yet
Socialism in Europe and The Russian Revolution Class 9 Notes Social Science History Chapter 2
4 pages
History of AI - Phase 1
No ratings yet
History of AI - Phase 1
4 pages
Zone C Cb8 To Outfall (Exit To Seawater) Section: Drainage Plan
No ratings yet
Zone C Cb8 To Outfall (Exit To Seawater) Section: Drainage Plan
1 page