0% found this document useful (0 votes)

31 views5 pages

Module 10 Notes

HGGU

Uploaded by

abdullahikulei

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views5 pages

Module 10 Notes

HGGU

Uploaded by

abdullahikulei

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

MODULE 10: PERFORMANCE MEASURES

The Confusion Matrix

A confusion matrix is a table showing counts of samples that have been correctly
classified and those that have been incorrectly classified by a model. As an
example, suppose we have images of we have photo of people where 20 are male
and 30 are female. We want our model to classify each photo as either male or
female. With female being the positive class. In other words, the question the
model needs to answer is: ”is this a photo of a female person”. If it answers
yes (1) then it has predicted female. If it answers no (0) it has predicted male.
Table 10.1 represents the counts of predictions by the model.
Table 10.1 Confusion matrix of gender classification

Female Male
Female 25 TP 5 FN
Male 4 FP 16 TN

The rows show the people in each gender. The columns show the classi-
fication of the model. The TP (true positive) cell shows the number females
correctly classified as female. The FN (false negative) cell shows the number
females incorrectly
classified as male. The FP (false positive) shows the number of males incorrectly
classified as female. The TN shows the number of males correctly classified as
male. Most of the other measures of performance are calculated from the values
in the confusion matrix.

Accuracy
Accuracy is a very common measure of reporting performance. It intuitive and
easy for most people to understand. It is calculated using the formula below.
TP + TN
accuracy =
TP + TN + FP + FN

Error Rate
This is the opposite of the accuracy. It represents the proportion of misclassifi-
cations. it is calculated using the formula below:
FP + FN
errorRate =
TP + TN + FP + FN

Sensitivity/Recall/True positive Rate

This measures the ability of the model to classify the members of the positive
class correctly. It answers the question: what percent of the positive class were
correctly classified? It is calculated using the formula below.

1
TP
sensitivity =
TP + FN

Specificity/True Negative Rate

This measures the ability of the model to classify the members of the negative
class correctly. It answers the question: what percent of the negative class were
correctly classified? It is calculated using the formula below.
sensitivity
TN
=
TN + FP

Precision
This measures the ability of the model to classify to classify only members of the
positive class as positive. It answers the question: what percent of the samples
classified as positive are actually positive? It is calculated using the formula
below.
precision
TP
=
TP + FP

F1 Score
This is an average of the precision and the recall using the harmonic mean of
the two. The harmonic mean is a type of average that gives greater weight to
the lesser of the two values in calculating the average. It is calculated using
the formula below. It is a more reliable measure of performance for unbalanced
datasets. A unbalanced dataset is one which the number of samples of one class
is much greater than the number of samples of the other class.

F1 −
score
2 × precision × recall
=
precision + recall

Balanced Accuracy
To calculate the balanced accuracy, we calculate the accuracy for each class
separately and then average the two results. It also a more reliable measure for
unbalanced datasets. It is calculated using the formula below which assumes we
have two classes only.
balanced accuracy = 12 T PT+FP
N + TN
T N +F P

2
Area under the Curve (AUC)
This measure tries to balance the costs and benefits. For a given model, it shows
how the benefits are affected by the costs. A receiver operator characteristic
(ROC) curve are used for comparing classification models. Models with greater
area under their ROC curves are more beneficial i.e. have a reduced risk of false
positive outcome. In an ROC curve, the TPR (precision) is plotted against the
FPR (1-specificity). You will learn more about the AUC curve and how it is
calculated in the reading material provided later in the lesson.

Figure 10.1 ROC curve

Unbalanced Data Sets

Consider a dataset consisting fraudulent and legitimate online transactions. Un-
der normal circumstances, only a very small proportion of the transactions are
fraudulent. As an example, in the Credit Card Fraud Detection, only 0.18% of
the transactions are fraudulent. This is less than one percent. Such a dataset
is called imbalanced. Since most of the training examples are legitimate, a clas-
sification model will learn to classify the legitimate transactions correctly while
performing poorly on the fraudulent transactions. Some measures of perfor-
mance are not appropriate for imbalanced datasets. Accuracy for instance may
not give a true picture when applied to unbalanced dataset since the accuracy of
the majority class will dominate. Balanced accuracy is however a good measure
for imbalance datasets. Other measures such as sensitivity and specificity can
also be applied to imbalanced datasets.
There other ways for dealing with imbalances in the dataset. Oversampling
is a technique where a new dataset is created by replicating samples belonging to
the minority class so that their numbers are approximately equal to the majority
class. The training and test sets are then obtained from this new dataset.
Undersampling is another technique where samples are randomly selected from
the majority class such that the number of majority class samples selected is
approximately equal to the samples in the minority class.

3
Cross-Validation
Cross validation is a method that is used to come up with more reliable per-
formance estimates. If we use accuracy as an example, in cross-validation, you
calculate the accuracy several times (say n times), then get an average of the n
accuracy values. In k-fold cross-validation, the dataset is divided into k distinct
subsets ( d1 , d2 , . . . dk ). The training and testing cycle is repeated k times. In
the ith repetition, the subset di is used as the test set while all the other subsets
are combined and used for training. K can be any number however k = 10 is
commonly used. Leave-one-out is a special case of k-fold cross-validation where
k is equal to the number of samples in the
dataset. Therefore in each train test cycle, only one sample is used for testing.
The rest of the samples are used for training.
Cross-validation is commonly used in the selection of hyper parameters for
training algorithm. As an example, it can be used in the selection of k in the
k-nearest neighbours algorithm. It also used in deciding which model to im-
plement (model selection). The models are performances are compared using
cross-validation and the best performing model is selected.

IMPLEMENTATION OF TEXT MINING USING PYTHON

You are going to implement cross validation using GridSearchCV class. This
class determines the average score for each hyper parameter setting using cross-
validation for a given machine learning model. You will see how the best value
for the k hyper parameter in k-nearest neighbours algorithm can be determined

#import the packages and classes we will use

from sklearn.neighbors import KNeighborsClassifier
import numpy as np
from sklearn.datasets import load_iris
from sklearn.preprocessing import StandardScaler
from sklearn.model_selection import GridSearchCV
from sklearn import metrics
import pandas as pd
#load the data
data=load_iris()
#get the target values
y=data.target
#get the labels
labels=data.target_names
#get the x values
#use the standard scaler
scaler=StandardScaler()
x=scaler.fit_transform(x)
#some metrics you can use to test performance

4
metrics.get_scorer_names()
#set the hyper parameters to be examined k=1 to 9
grid={’n_neighbors’:np.arange(1,10) }
#create the classifier
knn=KNeighborsClassifier()
#since this is a multiclass dataset. GridSearchCV will use
StratifiedKFold splitting stategy
#The data will be split into 10 subsets.
#For each value of k, }9\mathrm{ of the subsets are used for training while one
subset is reserved for testing
#This is repeated for each subset
GSKnnCv=GridSearchCV(knn,grid,cv=10, refit=False,
scoring=[’accuracy’,’balanced_accuracy’,’precision_macro’,’f1_macro’])
#The results are returned in a dictionary
GSKnnCv.cv_results_
#The dictionary can be imported into a dataframe
df=pd.DataFrame(GSKnnCv.cv_results_)
df.head(10)
x=data.data
#we select only the average scores for each number of neighbours and
the associated
#with each score for a given number of neighbours
df1=df.loc[:,
[’param_n_neighbors’,’mean_test_accuracy’,’rank_test_accuracy’,’mean_te
st_balanced_accuracy’,’rank_test_balanced_accuracy’,
’mean_test_precision_macro’,’rank_test_precision_macro’,
’mean_test_f1_macro’,’rank_test_f1_macro’]]
df1.head(10)

CH-5 ML
No ratings yet
CH-5 ML
36 pages
Module 6
No ratings yet
Module 6
24 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
Lecture 5 Evaluation - Classifer
No ratings yet
Lecture 5 Evaluation - Classifer
61 pages
Lec 12 13 Evaluation Measures
No ratings yet
Lec 12 13 Evaluation Measures
45 pages
ML - 03 Evaluation Metrics
No ratings yet
ML - 03 Evaluation Metrics
17 pages
Cofusion Matrix Cross - Validation
No ratings yet
Cofusion Matrix Cross - Validation
34 pages
Data Mining: Evaluation Methods
No ratings yet
Data Mining: Evaluation Methods
25 pages
Comparing Multiple Algorithms
No ratings yet
Comparing Multiple Algorithms
70 pages
Presentation On Classification
No ratings yet
Presentation On Classification
18 pages
Chapter 3 Model Evaluation Final
No ratings yet
Chapter 3 Model Evaluation Final
30 pages
T1 ML QB Soln
No ratings yet
T1 ML QB Soln
23 pages
Advanced ML Classification Guide
No ratings yet
Advanced ML Classification Guide
40 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
41 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
25 pages
6 Model Evalution
No ratings yet
6 Model Evalution
16 pages
Instruction & Option Choice
No ratings yet
Instruction & Option Choice
6 pages
MLA CT1 - Notes
No ratings yet
MLA CT1 - Notes
17 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
CS 620 / DASC 600 Introduction To Data Science & Analytics: Lecture 8-Performance Evaluation
No ratings yet
CS 620 / DASC 600 Introduction To Data Science & Analytics: Lecture 8-Performance Evaluation
62 pages
Modelling and Evaluation
No ratings yet
Modelling and Evaluation
36 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
Imbalanced Classes in Big Data
No ratings yet
Imbalanced Classes in Big Data
20 pages
Handling Imbalanced Data in ML
No ratings yet
Handling Imbalanced Data in ML
6 pages
Model Evaluation & Cross-Validation Guide
No ratings yet
Model Evaluation & Cross-Validation Guide
43 pages
K-Nearest Neighbors Overview
No ratings yet
K-Nearest Neighbors Overview
31 pages
Module 2
No ratings yet
Module 2
151 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
Machine Learning Terminology
No ratings yet
Machine Learning Terminology
16 pages
Day 6 Model Evaluation Generalization
No ratings yet
Day 6 Model Evaluation Generalization
49 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
37 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
ML Pyq Ans
No ratings yet
ML Pyq Ans
37 pages
Model Performance Assessment
No ratings yet
Model Performance Assessment
13 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
Understanding Class Imbalance
No ratings yet
Understanding Class Imbalance
12 pages
Unit Iii
No ratings yet
Unit Iii
67 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
Machine Learning: Professor Department of Computer Science & Engineering
No ratings yet
Machine Learning: Professor Department of Computer Science & Engineering
53 pages
CH 6
No ratings yet
CH 6
24 pages
Intro to Machine Learning Steps
No ratings yet
Intro to Machine Learning Steps
35 pages
Topic 3
No ratings yet
Topic 3
48 pages
Accuracy Precision and Recall
No ratings yet
Accuracy Precision and Recall
21 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
8 pages
DS Notes
No ratings yet
DS Notes
36 pages
DS Notes Unit - V
No ratings yet
DS Notes Unit - V
13 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
22 pages
MACHINELEARNING
No ratings yet
MACHINELEARNING
20 pages
BSC ML CH1
No ratings yet
BSC ML CH1
63 pages
4.8 Estimating The Performance of A Classifier
No ratings yet
4.8 Estimating The Performance of A Classifier
19 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
Unit - 5
No ratings yet
Unit - 5
57 pages
ERROR and Confusion Matrix
No ratings yet
ERROR and Confusion Matrix
29 pages
8 2 Lecture AI 8 2
No ratings yet
8 2 Lecture AI 8 2
35 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Behavioural Science25
No ratings yet
Behavioural Science25
21 pages
CNC Plotter
No ratings yet
CNC Plotter
17 pages
Mathematical Science
No ratings yet
Mathematical Science
22 pages
2025 - Exhibition Judging Criteria
No ratings yet
2025 - Exhibition Judging Criteria
4 pages
TT Mid T1 23-1
No ratings yet
TT Mid T1 23-1
1 page
Behavioural Science Script
No ratings yet
Behavioural Science Script
3 pages
13/07/2025 VOLLEYBALL GAME The Senior Chief Adano Girls Furaha Girls Sec
No ratings yet
13/07/2025 VOLLEYBALL GAME The Senior Chief Adano Girls Furaha Girls Sec
2 pages
KKVXAz
No ratings yet
KKVXAz
1 page
TT Mid T1 23-1
No ratings yet
TT Mid T1 23-1
1 page
Mulki 2
No ratings yet
Mulki 2
2 pages
Minutes of Language Department Meeting Held On 17TH May 2024 in The Ict Lab at 2
No ratings yet
Minutes of Language Department Meeting Held On 17TH May 2024 in The Ict Lab at 2
4 pages
Module 5 Notes
No ratings yet
Module 5 Notes
8 pages
The Pilar of Le-Wps Office
No ratings yet
The Pilar of Le-Wps Office
3 pages
Module 2 Notes
No ratings yet
Module 2 Notes
4 pages
CNC Plotter
No ratings yet
CNC Plotter
16 pages
Mod 6 DSC 204
No ratings yet
Mod 6 DSC 204
10 pages
Ninema Is An Epitome of A Good Businessperson That Deserves To Be Emulated
100% (1)
Ninema Is An Epitome of A Good Businessperson That Deserves To Be Emulated
2 pages
3.7 CHEMISTRY (233) 3.7.1 Chemistry Paper 1 (233/1)
No ratings yet
3.7 CHEMISTRY (233) 3.7.1 Chemistry Paper 1 (233/1)
10 pages
Module 7 Notes
No ratings yet
Module 7 Notes
3 pages
Hsthe Ledger - Business Studies Form 3 Notes-2050
No ratings yet
Hsthe Ledger - Business Studies Form 3 Notes-2050
14 pages
DSC 202
No ratings yet
DSC 202
8 pages
Nizikeni Ningali Hai
No ratings yet
Nizikeni Ningali Hai
1 page
MEIMUNA
No ratings yet
MEIMUNA
2 pages
Pop Talent Show
No ratings yet
Pop Talent Show
2 pages
THE UMBRELLA.-WPS Office
No ratings yet
THE UMBRELLA.-WPS Office
2 pages
Senior Chief Adano Girls High School
No ratings yet
Senior Chief Adano Girls High School
1 page
962 Geography Paper 2 Revision
No ratings yet
962 Geography Paper 2 Revision
9 pages
CamScanner 29-05-2025 17.19
No ratings yet
CamScanner 29-05-2025 17.19
12 pages
Abdullahi Wilson - FeeStatement - 20250527162542
No ratings yet
Abdullahi Wilson - FeeStatement - 20250527162542
1 page
Not Everything You Read Is True Fake News Detection Using Machine Learning Algorithms
No ratings yet
Not Everything You Read Is True Fake News Detection Using Machine Learning Algorithms
4 pages
Lecture 21
No ratings yet
Lecture 21
16 pages
AI Mini Project Report
No ratings yet
AI Mini Project Report
7 pages
Brain Tumor Detection Using Machine Learning
No ratings yet
Brain Tumor Detection Using Machine Learning
26 pages
Thyroid Disease Detection - Using ML
No ratings yet
Thyroid Disease Detection - Using ML
8 pages
Land Use Planning in Monatélé, Cameroon
No ratings yet
Land Use Planning in Monatélé, Cameroon
15 pages
Big Data Artificial Intelligence and Data Analytics in Climate Change Research Gaurav Tripathi PDF Download
100% (1)
Big Data Artificial Intelligence and Data Analytics in Climate Change Research Gaurav Tripathi PDF Download
84 pages
Fraud Detection with Machine Learning
No ratings yet
Fraud Detection with Machine Learning
34 pages
Machine Learning PBL
No ratings yet
Machine Learning PBL
9 pages
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
No ratings yet
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
22 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
Machine Learning Lab Manual 2023
No ratings yet
Machine Learning Lab Manual 2023
41 pages
@vtudeveloper - in ISMLA Mod 5
No ratings yet
@vtudeveloper - in ISMLA Mod 5
30 pages
Land Use Mapping Using Remote Sensing & ML Tools
No ratings yet
Land Use Mapping Using Remote Sensing & ML Tools
14 pages
Confusion Matrix
No ratings yet
Confusion Matrix
13 pages
IoT Botnet Detection via LDA Optimization
No ratings yet
IoT Botnet Detection via LDA Optimization
12 pages
Artificial Neural Networks Guide
No ratings yet
Artificial Neural Networks Guide
63 pages
Predicting Overdue Invoices
No ratings yet
Predicting Overdue Invoices
67 pages
UPDATED - HGDML - ALL QUIZ QUESTIONS and ANSWERS v2.3.1
100% (1)
UPDATED - HGDML - ALL QUIZ QUESTIONS and ANSWERS v2.3.1
15 pages
Capstone Project Vivek
100% (4)
Capstone Project Vivek
145 pages
Vishal FOML Micro Project Vishal & Milan
No ratings yet
Vishal FOML Micro Project Vishal & Milan
26 pages
Wine Quality Prediction Using ML Models
No ratings yet
Wine Quality Prediction Using ML Models
19 pages
Music Genre Classification Report
No ratings yet
Music Genre Classification Report
36 pages
Importing Necessary Libraries for SVM
No ratings yet
Importing Necessary Libraries for SVM
3 pages
Forecasting Cryptocurrency Returns From Sentiment Signals: An Analysis of BERT Classifiers and Weak Supervision
No ratings yet
Forecasting Cryptocurrency Returns From Sentiment Signals: An Analysis of BERT Classifiers and Weak Supervision
29 pages
Hand - Based Multimodal Biometric Authentication System
No ratings yet
Hand - Based Multimodal Biometric Authentication System
5 pages
Smart Crop Recommendation System With Plant Disease Identification
No ratings yet
Smart Crop Recommendation System With Plant Disease Identification
6 pages
III BCA ML - Syll - Model - All Units
No ratings yet
III BCA ML - Syll - Model - All Units
85 pages
Deep Unit 4
No ratings yet
Deep Unit 4
31 pages
jcc2024126 51732728
No ratings yet
jcc2024126 51732728
13 pages

Module 10 Notes

Uploaded by

Module 10 Notes

Uploaded by

MODULE 10: PERFORMANCE MEASURES

The Confusion Matrix

Sensitivity/Recall/True positive Rate

Specificity/True Negative Rate

Figure 10.1 ROC curve

Unbalanced Data Sets

IMPLEMENTATION OF TEXT MINING USING PYTHON

#import the packages and classes we will use

You might also like