0% found this document useful (0 votes)

183 views7 pages

Logistic Regression on Iris Dataset

notes

Uploaded by

rayalbeast1000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

183 views7 pages

Logistic Regression on Iris Dataset

notes

Uploaded by

rayalbeast1000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

In [1]: import numpy as np

import pandas as pd
import [Link] as plt
import seaborn as sns

from sklearn.linear_model import LogisticRegression

from sklearn.model_selection import train_test_split
from [Link] import confusion_matrix, classification_report, accurac

In [2]: from [Link] import load_iris

# Load the dataset

iris = load_iris()

# Convert to pandas DataFrame for easier handling

data = [Link](data= np.c_[iris['data'], iris['target']], columns= iris

Exploring the Dataset

In [3]: print([Link]())

sepal length (cm) sepal width (cm) petal length (cm) petal width (cm)
\
0 5.1 3.5 1.4 0.2
1 4.9 3.0 1.4 0.2
2 4.7 3.2 1.3 0.2
3 4.6 3.1 1.5 0.2
4 5.0 3.6 1.4 0.2

target
0 0.0
1 0.0
2 0.0
3 0.0
4 0.0

In [4]: # Data types and non-null counts

print([Link]())

<class '[Link]'>
RangeIndex: 150 entries, 0 to 149
Data columns (total 5 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 sepal length (cm) 150 non-null float64
1 sepal width (cm) 150 non-null float64
2 petal length (cm) 150 non-null float64
3 petal width (cm) 150 non-null float64
4 target 150 non-null float64
dtypes: float64(5)
memory usage: 6.0 KB
None
In [5]: # Summary statistics
print([Link]())

sepal length (cm) sepal width (cm) petal length (cm) \

count 150.000000 150.000000 150.000000
mean 5.843333 3.057333 3.758000
std 0.828066 0.435866 1.765298
min 4.300000 2.000000 1.000000
25% 5.100000 2.800000 1.600000
50% 5.800000 3.000000 4.350000
75% 6.400000 3.300000 5.100000
max 7.900000 4.400000 6.900000

petal width (cm) target

count 150.000000 150.000000
mean 1.199333 1.000000
std 0.762238 0.819232
min 0.100000 0.000000
25% 0.300000 0.000000
50% 1.300000 1.000000
75% 1.800000 2.000000
max 2.500000 2.000000

In [7]: # Data members correlation

print([Link]())

sepal length (cm) sepal width (cm) petal length (cm) \

sepal length (cm) 1.000000 -0.117570 0.871754
sepal width (cm) -0.117570 1.000000 -0.428440
petal length (cm) 0.871754 -0.428440 1.000000
petal width (cm) 0.817941 -0.366126 0.962865
target 0.782561 -0.426658 0.949035

petal width (cm) target

sepal length (cm) 0.817941 0.782561
sepal width (cm) -0.366126 -0.426658
petal length (cm) 0.962865 0.949035
petal width (cm) 1.000000 0.956547
target 0.956547 1.000000

Mapping Target Values to Species

In [8]: # Mapping target to species
species_map = {0.0: 'setosa', 1.0: 'versicolor', 2.0: 'virginica'}
data['species'] = data['target'].map(species_map)

print(data[['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'p
sepal length (cm) sepal width (cm) petal length (cm) petal width (cm)
\
0 5.1 3.5 1.4 0.2
1 4.9 3.0 1.4 0.2
2 4.7 3.2 1.3 0.2
3 4.6 3.1 1.5 0.2
4 5.0 3.6 1.4 0.2

species
0 setosa
1 setosa
2 setosa
3 setosa
4 setosa

Checking for Missing Values

In [9]: # Checking for missing values
print([Link]().sum())

sepal length (cm) 0

sepal width (cm) 0
petal length (cm) 0
petal width (cm) 0
target 0
species 0
dtype: int64

No missing values are present in the dataset.

Data Preprocessing

Feature Selection
Splitting Features and Target

Separate the dataset into features (X) and target (y).

In [10]: # Features
X = [Link][:, 0:4].values # or data[['sepal length (cm)', 'sepal width (

# Target
y = data['target'].values

Feature Scaling

In [11]: from [Link] import StandardScaler

# Initialize the scaler

scaler = StandardScaler()

# Fit the scaler on the features and transform

X_scaled = scaler.fit_transform(X)
Splitting the Dataset
In [12]: # Split the dataset into training and testing sets
# Test size = 20% of the dataset, random_state for reproducibility
X_train, X_test, y_train, y_test = train_test_split(X_scaled, y, test_size=0

print(f'Training set size: {X_train.shape[0]} samples')

print(f'Testing set size: {X_test.shape[0]} samples')

Training set size: 120 samples

Testing set size: 30 samples

Implementing Logistic Regression

Initializing the Model

In [13]: # Initialize the Logistic Regression model

# Using multinomial since the target has more than two classes
# solver='lbfgs' is suitable for multinomial loss
model = LogisticRegression(multi_class='multinomial', solver='lbfgs', max_it

Parameters Explained:

multi_class='multinomial': Specifies that the loss function should be the

multinomial loss fit across all classes. Suitable for multi-class classification.

solver='lbfgs': An optimization algorithm suitable for small datasets and

supports multinomial loss.

max_iter=200: Maximum number of iterations taken for the solvers to converge.

Increased from default 100 to ensure convergence.

Training the Model

In [14]: # Train the model using the training data
[Link](X_train, y_train)

Out[14]: ▾ LogisticRegression

LogisticRegression(max_iter=200, multi_class='multinomial')

Making Predictions
In [15]: # Predict the classes for the testing set
y_pred = [Link](X_test)
In [16]: # Create a DataFrame to compare actual and predicted values
comparison = [Link]({'Actual': y_test, 'Predicted': y_pred})
print(comparison)

Actual Predicted
0 1.0 1.0
1 0.0 0.0
2 2.0 2.0
3 1.0 1.0
4 1.0 1.0
5 0.0 0.0
6 1.0 1.0
7 2.0 2.0
8 1.0 1.0
9 1.0 1.0
10 2.0 2.0
11 0.0 0.0
12 0.0 0.0
13 0.0 0.0
14 0.0 0.0
15 1.0 1.0
16 2.0 2.0
17 1.0 1.0
18 1.0 1.0
19 2.0 2.0
20 0.0 0.0
21 2.0 2.0
22 0.0 0.0
23 2.0 2.0
24 2.0 2.0
25 2.0 2.0
26 2.0 2.0
27 2.0 2.0
28 0.0 0.0
29 0.0 0.0

Evaluating the Model Using Confusion

Matrix
Creating the Confusion Matrix
In [17]: # Generate the confusion matrix
cm = confusion_matrix(y_test, y_pred)

print(cm)

[[10 0 0]
[ 0 9 0]
[ 0 0 11]]

In this output:
Row 0 (Actual class 0 - setosa): 10 correctly predicted as setosa.

Row 1 (Actual class 1 - versicolor): 10 correctly predicted as versicolor.

Row 2 (Actual class 2 - virginica): 10 correctly predicted as virginica.

Total samples correctly predicted: 30 out of 30 (100% accuracy in this sample

run).

Visualizing the Confusion Matrix

In [18]: [Link](figsize=(8,6))
[Link](cm, annot=True, fmt='d', cmap='Blues',
xticklabels=iris.target_names,
yticklabels=iris.target_names)
[Link]('Actual')
[Link]('Predicted')
[Link]('Confusion Matrix')
[Link]()

Interpreting Results
Accuracy Score
In [19]: # Computing the Accuracy
accuracy = accuracy_score(y_test, y_pred)
print(f'Accuracy: {accuracy * 100:.2f}%')

Accuracy: 100.00%

Classification Report

In [20]: from [Link] import classification_report

# Generate classification report

report = classification_report(y_test, y_pred, target_names=iris.target_name
print(report)

precision recall f1-score support

setosa 1.00 1.00 1.00 10

versicolor 1.00 1.00 1.00 9
virginica 1.00 1.00 1.00 11

accuracy 1.00 30
macro avg 1.00 1.00 1.00 30
weighted avg 1.00 1.00 1.00 30

This notebook was converted to PDF with [Link]

DSE 6 - Colab
No ratings yet
DSE 6 - Colab
5 pages
Machine Learning - Lab Record
No ratings yet
Machine Learning - Lab Record
43 pages
Import As Import As From Import Import As Import As From Import From Import From Import
No ratings yet
Import As Import As From Import Import As Import As From Import From Import From Import
6 pages
TranMinhTu1 bt2 2
No ratings yet
TranMinhTu1 bt2 2
5 pages
Practical No - 1
No ratings yet
Practical No - 1
5 pages
L3 - Classification - RandomForest - Jupyter Notebook
No ratings yet
L3 - Classification - RandomForest - Jupyter Notebook
6 pages
Ass - 10.ipynb - Colab
No ratings yet
Ass - 10.ipynb - Colab
8 pages
Unsupervised ML
No ratings yet
Unsupervised ML
17 pages
EXP 07 (ML) - Sarthak
No ratings yet
EXP 07 (ML) - Sarthak
4 pages
L6 Tutorial - KNN - Jupyter Notebook
No ratings yet
L6 Tutorial - KNN - Jupyter Notebook
7 pages
7 Output
No ratings yet
7 Output
4 pages
EXP 07 (ML) - Darshu
No ratings yet
EXP 07 (ML) - Darshu
4 pages
EXP 07 (ML) - Ashu
No ratings yet
EXP 07 (ML) - Ashu
4 pages
Exp 07 (ML)
No ratings yet
Exp 07 (ML)
4 pages
PR 6
No ratings yet
PR 6
6 pages
'Iris - CSV': Import As
No ratings yet
'Iris - CSV': Import As
3 pages
Lab Manual
No ratings yet
Lab Manual
32 pages
AI & ML Lab Journal for MCA Students
No ratings yet
AI & ML Lab Journal for MCA Students
77 pages
Assignment 5'
No ratings yet
Assignment 5'
4 pages
Experiment 1
No ratings yet
Experiment 1
2 pages
1 Assignment 3 - Classification
No ratings yet
1 Assignment 3 - Classification
16 pages
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
No ratings yet
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
4 pages
Code
No ratings yet
Code
3 pages
Chi-Square and T-Test Analysis Guide
No ratings yet
Chi-Square and T-Test Analysis Guide
9 pages
SC Assignment Q2
No ratings yet
SC Assignment Q2
7 pages
Data Visualization
No ratings yet
Data Visualization
18 pages
Dsa 1
No ratings yet
Dsa 1
8 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
ML Lab Record
No ratings yet
ML Lab Record
64 pages
Program1 MLA Lab 2025 250109 144615
No ratings yet
Program1 MLA Lab 2025 250109 144615
17 pages
Dsbda Assig 6 Data Analytcs 3
No ratings yet
Dsbda Assig 6 Data Analytcs 3
6 pages
Exp 5,6,7
No ratings yet
Exp 5,6,7
2 pages
ML Lab Programs
No ratings yet
ML Lab Programs
23 pages
Normalization
No ratings yet
Normalization
4 pages
ML FINAL Lab Manual
No ratings yet
ML FINAL Lab Manual
7 pages
ML#07
No ratings yet
ML#07
21 pages
Pre-Processing Techniques - Ipynb - Colab
No ratings yet
Pre-Processing Techniques - Ipynb - Colab
3 pages
Exno 4
No ratings yet
Exno 4
13 pages
Train Test Splitting
No ratings yet
Train Test Splitting
3 pages
Anuj Khandelwal 3029 BCP A Business Analytics Continuous Assessment 2
No ratings yet
Anuj Khandelwal 3029 BCP A Business Analytics Continuous Assessment 2
20 pages
LAB-Skill Advanced Course Machine Learning With Python Experiments
No ratings yet
LAB-Skill Advanced Course Machine Learning With Python Experiments
23 pages
Eai Exp 2-5
No ratings yet
Eai Exp 2-5
13 pages
Import As Import As Import As Import As From Import
No ratings yet
Import As Import As Import As Import As From Import
3 pages
K Fold
No ratings yet
K Fold
2 pages
ML Lab Manual
No ratings yet
ML Lab Manual
23 pages
Support Vector Machine (SVM Classifier) Implemenation in Python With Scikit-Learn
No ratings yet
Support Vector Machine (SVM Classifier) Implemenation in Python With Scikit-Learn
21 pages
Import As Import As Import As From Import Import As Import
No ratings yet
Import As Import As Import As From Import Import As Import
7 pages
EM vs K-Means Clustering Analysis
No ratings yet
EM vs K-Means Clustering Analysis
3 pages
Data Visualizationyuo
No ratings yet
Data Visualizationyuo
28 pages
10 TH
No ratings yet
10 TH
7 pages
Random Forest
No ratings yet
Random Forest
5 pages
085
No ratings yet
085
4 pages
Introduction To Neural Networks
No ratings yet
Introduction To Neural Networks
4 pages
Data Manipulation & Visualization Guide
No ratings yet
Data Manipulation & Visualization Guide
16 pages
Trần Mạnh Hùng 20192643.Ipynb - Colab
No ratings yet
Trần Mạnh Hùng 20192643.Ipynb - Colab
6 pages
Ploomber Notebook Conversion - 2
No ratings yet
Ploomber Notebook Conversion - 2
14 pages
Assignment 3
No ratings yet
Assignment 3
7 pages
Agriculture Crop Recommendation System Using Machine Learning
No ratings yet
Agriculture Crop Recommendation System Using Machine Learning
11 pages
Detecting and Classifying DNS Tunneling Through Novel Machine Learning Approach
No ratings yet
Detecting and Classifying DNS Tunneling Through Novel Machine Learning Approach
7 pages
TSP CMC 35498 March2023
No ratings yet
TSP CMC 35498 March2023
17 pages
Election Prediction Model Analysis
100% (2)
Election Prediction Model Analysis
46 pages
67 81 KNN .Ipynb - Colab
No ratings yet
67 81 KNN .Ipynb - Colab
34 pages
ML Assignment
No ratings yet
ML Assignment
5 pages
Computer Vision-Based Military Tank Recognition Using Object Detection Technique An Application of The YOLO Framework
No ratings yet
Computer Vision-Based Military Tank Recognition Using Object Detection Technique An Application of The YOLO Framework
7 pages
DTC Assignment Unit 3
No ratings yet
DTC Assignment Unit 3
2 pages
Development of A Vision - Based Anti-Drone Identification Friend or Foe Model To Recognize Birds and Drones Using Deep Learning
No ratings yet
Development of A Vision - Based Anti-Drone Identification Friend or Foe Model To Recognize Birds and Drones Using Deep Learning
30 pages
Predicting Time To Graduation of Open University Students: An Educational Data Mining Study
No ratings yet
Predicting Time To Graduation of Open University Students: An Educational Data Mining Study
14 pages
S3 Data Processing and Classification
No ratings yet
S3 Data Processing and Classification
25 pages
Brain Tumour Detection
No ratings yet
Brain Tumour Detection
32 pages
Unit3 ML
No ratings yet
Unit3 ML
7 pages
Beamer Presentation
No ratings yet
Beamer Presentation
103 pages
Data Preprocessing in Machine Learning
No ratings yet
Data Preprocessing in Machine Learning
18 pages
Performance Evaluation and Comparison of Classification Techniques For Outcome Estimation in Strategic Board Games
No ratings yet
Performance Evaluation and Comparison of Classification Techniques For Outcome Estimation in Strategic Board Games
8 pages
AI Interview Notes
No ratings yet
AI Interview Notes
11 pages
DTC Algorithm Implementation Guide
No ratings yet
DTC Algorithm Implementation Guide
7 pages
BST Bme688 An001 Part B
No ratings yet
BST Bme688 An001 Part B
38 pages
Data Visualization
No ratings yet
Data Visualization
16 pages
Worksheet For 8th
100% (1)
Worksheet For 8th
5 pages
F8 DP 2017 Maurerova Veronika Thesis
No ratings yet
F8 DP 2017 Maurerova Veronika Thesis
133 pages
Class 10 Artificial Intelligence Sample Paper Set 12
No ratings yet
Class 10 Artificial Intelligence Sample Paper Set 12
9 pages
Confusion Matrix Print
No ratings yet
Confusion Matrix Print
4 pages
Diabetes Prediction with Logistic Regression
No ratings yet
Diabetes Prediction with Logistic Regression
9 pages
IEEE Conference Template-1
No ratings yet
IEEE Conference Template-1
6 pages
Machine Learning Lab Assignments
100% (2)
Machine Learning Lab Assignments
23 pages
Unit 2 Supervised Learning Regression
No ratings yet
Unit 2 Supervised Learning Regression
111 pages
EfficientNet for Brain Tumor Classification
No ratings yet
EfficientNet for Brain Tumor Classification
12 pages
DMMLASSIGNMENT
No ratings yet
DMMLASSIGNMENT
36 pages