0% found this document useful (0 votes)

35 views17 pages

Hands On ML Workshop-Classification

Uploaded by

Gauranga Baishya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views17 pages

Hands On ML Workshop-Classification

Uploaded by

Gauranga Baishya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Classification

From Binary to Multioutput Systems

Gauranga Kumar Baishya

August 29, 2025

Gauranga Kumar Baishya Classification August 29, 2025 1 / 17

Outline

1 Introduction to Classification & MNIST

2 Training a Binary Classifier

3 Performance Measures

4 Multiclass Classification

5 Error Analysis

6 Advanced Classification Tasks

Gauranga Kumar Baishya Classification August 29, 2025 2 / 17

The MNIST Dataset: “Hello World” of ML
What is MNIST?
A dataset of 70,000 small, grayscale images of handwritten digits (0-9).
It’s a benchmark for testing new classification algorithms.

Dataset Structure
70,000 instances (images).
784 features per instance.
Each image is 28x28 pixels.
Each feature represents one
pixel’s intensity (0-255).
Figure: A set of few digits from the
MNIST dataset.

Important First Step

Gauranga Kumar Baishya Classification August 29, 2025 3 / 17
Creating a “5-Detector”

Simplifying the Problem

To start, we’ll build a binary classifier that can only distinguish between
two classes: “5” and “not-5”.

Target Vector Creation

We create new target labels that are boolean: True for all 5s, False for all
other digits:
ytrain5 = (ytrain == 5)
ytest5 = (ytest == 5)

Training an SGD Classifier

A good starting point is the Stochastic Gradient Descent (SGD)
classifier. It’s efficient and handles large datasets well.

Gauranga Kumar Baishya Classification August 29, 2025 4 / 17

The Problem with Accuracy

Initial Accuracy Score

Using 3-fold cross-validation, the SGDClassifier achieves over 93%
accuracy.

array([0.96355, 0.93795, 0.95615])

This seems great, but is it?

The Pitfall of Skewed Datasets

Let’s consider a classifier that always predicts “not-5”.
Only about 10% of the images are 5s.
So, this “dumb” classifier will be correct about 90% of the time!
This shows that accuracy is not a good performance measure for
classifiers, especially on skewed datasets.

Gauranga Kumar Baishya Classification August 29, 2025 5 / 17

The Confusion Matrix

A Better Way to Evaluate

The confusion matrix provides a much better view of a classifier’s
performance by showing the number of times instances of class A are
classified as class B.
Terminology:
True Negatives (TN):
Correctly classified as not-5.
False Positives (FP):
Incorrectly classified as 5.
False Negatives (FN):
Incorrectly classified as not-5.
Figure: Structure of a confusion matrix. True Positives (TP): Correctly
classified as 5.

Gauranga Kumar Baishya Classification August 29, 2025 6 / 17

Our 5-Detector’s Matrix

53057 1522
1325 4096

1522 non-5s were wrongly classified as 5s (FP).

1325 5s were wrongly classified as not-5s (FN).
Gauranga Kumar Baishya Classification August 29, 2025 7 / 17
Precision, Recall, and F1 Score

Precision: Accuracy of Positive Predictions

What proportion of positive identifications was actually correct?
TP
Precision =
TP + FP
For our model, precision is 4096/(4096 + 1522) ≈ 72.9%.

Recall (Sensitivity): True Positive Rate

What proportion of actual positives was identified correctly?
TP
Recall =
TP + FN
For our model, recall is 4096/(4096 + 1325) ≈ 75.6%.

Gauranga Kumar Baishya Classification August 29, 2025 8 / 17

Precision, Recall and F1 Score

F1 Score: The Harmonic Mean

A single metric that combines precision and recall. It gives more weight to
low values, so a high F1 score requires both high precision and high recall.
Precision × Recall
F1 = 2 ×
Precision + Recall
For our model, F1 is 74.22.

Gauranga Kumar Baishya Classification August 29, 2025 9 / 17

The Precision-Recall Tradeoff

Figure: Plotting precision and recall against the decision threshold.

Gauranga Kumar Baishya Classification August 29, 2025 10 / 17

The Precision-Recall Trade-off

The Inherent Conflict

Unfortunately increasing precision reduces recall, and vice versa; the
precision-recall trade-off.

How it Works: The Decision Threshold

Classifiers compute a score for each instance. If the score is above a
threshold, it’s classified as positive.
Raising the threshold: Increases precision (fewer false positives) but
decreases recall (more false negatives).
Lowering the threshold: Increases recall (fewer false negatives) but
decreases precision (more false positives).

Gauranga Kumar Baishya Classification August 29, 2025 11 / 17

The ROC Curve
Receiver Operating Characteristic (ROC)
Another common tool for binary classifiers. It plots the True Positive
Rate (Recall) against the False Positive Rate (FPR).
FPR: The ratio of negative instances that are incorrectly classified as
positive.
A good classifier stays as far away from the diagonal line as possible
(toward the top-left corner).

Facts!
A perfect classifier has an AUC
of 1.
A purely random classifier has
an AUC of 0.5.

Gauranga Kumar Baishya Classification August 29, 2025 12 / 17

Receiver Operating Characteristic (ROC) – Rule of Thumb

When to use ROC vs. Precision-Recall?

Since the ROC curve is so similar to the precision/recall (PR) curve, one
may wonder how to decide which one to use. As a rule of thumb, it is
preferable to use the PR curve whenever the positive class is rare or when
you care more about the false positives than the false negatives; otherwise,
the ROC curve is suitable. For example, when looking at the ROC curve
and the ROC AUC score for the digit classifier, one might think that the
classifier is very good. However, this is mostly because there are few
positives (5s) compared to the negatives (non-5s). In contrast, the PR
curve makes it clear that the classifier has room for improvement, as the
curve could be closer to the top-left corner.

Gauranga Kumar Baishya Classification August 29, 2025 13 / 17

Handling More Than Two Classes

Multiclass (or Multinomial) Classifiers

These classifiers can distinguish between more than two classes. Some
algorithms (like SGD, Random Forests) support this natively. Others (like
SVMs) are strictly binary.

One-vs-the-Rest (OvR) One-vs-One (OvO)

Train 1 binary classifier for Train 1 binary classifier for
each class (e.g., a 0-detector, every pair of classes (0 vs 1,
a 1-detector, etc.). To 0 vs 2, 1 vs 2, etc.). For N
classify a new image, get the classes, this requires
decision score from each N*(N-1)/2 classifiers. The
classifier and pick the class class that wins the most
with the highest score. “duels” is chosen.

Scikit-Learn automatically applies OvR or OvO based on the algorithm.

Gauranga Kumar Baishya Classification August 29, 2025 14 / 17
Improving Models by Analyzing Errors

The Multiclass Confusion Matrix

Just like with binary classification, we can create a confusion matrix to see
where the model is making mistakes.

Figure: Confusion matrix Figure: Rows are actual classes, columns are predicted.

The column for class 8 is bright, meaning many other digits are misclassified as 8s.
3s - 5s & 7s/4s - 9s, are often confused.

Gauranga Kumar Baishya Classification August 29, 2025 15 / 17

Multilabel and Multioutput Classification
Multilabel Classification
A system that can output multiple binary classes for each instance.
Example: A face-recognition system that identifies multiple people in
one photo. If it sees Alice and Charlie, the output would be [1, 0,
1].
Evaluation can be done by calculating the F1 score for each label and
averaging the result.

Multioutput Classification
A generalization of multilabel classification where each label can be
multiclass (i.e., have more than two possible values).
Example: A system that removes noise from an image. The input is
a noisy image, and the output is a clean image.
Here, the output is multilabel (one label per pixel) and each label is
multiclass (pixel intensity from 0 to 255).
Gauranga Kumar Baishya Classification August 29, 2025 16 / 17
Questions?

Thank You!

Gauranga Kumar Baishya Classification August 29, 2025 17 / 17

Lec5 Classification
No ratings yet
Lec5 Classification
27 pages
ML - Mod2 Classification
No ratings yet
ML - Mod2 Classification
74 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
Binary Classification PDF
No ratings yet
Binary Classification PDF
27 pages
Unit 2 Classification
No ratings yet
Unit 2 Classification
59 pages
Classification Metrics Mod 6
No ratings yet
Classification Metrics Mod 6
8 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
Accuracy Precision and Recall
No ratings yet
Accuracy Precision and Recall
21 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Lecture 3 1611410001002
No ratings yet
Lecture 3 1611410001002
51 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
Multiclass Classification Overview
No ratings yet
Multiclass Classification Overview
6 pages
Imbalance Problem
No ratings yet
Imbalance Problem
13 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
Classification: Prof. Gheith Abandah
No ratings yet
Classification: Prof. Gheith Abandah
30 pages
Multiclass Classification
No ratings yet
Multiclass Classification
45 pages
CIVI6731 Lecture (Week9)
No ratings yet
CIVI6731 Lecture (Week9)
18 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
Machine Learning Evaluation Metrics
No ratings yet
Machine Learning Evaluation Metrics
16 pages
Machine Learning II
No ratings yet
Machine Learning II
61 pages
Evaluation in Ai
No ratings yet
Evaluation in Ai
25 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
ML-Unit 3 Classification
No ratings yet
ML-Unit 3 Classification
41 pages
Evaluation Measures For Machine Learning Models
No ratings yet
Evaluation Measures For Machine Learning Models
6 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
CLASSIFICATION
No ratings yet
CLASSIFICATION
36 pages
Performance Metrics
No ratings yet
Performance Metrics
34 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
8 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
Machine Learning: Professor Department of Computer Science & Engineering
No ratings yet
Machine Learning: Professor Department of Computer Science & Engineering
53 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
Compare Class I Fiers Part 13
No ratings yet
Compare Class I Fiers Part 13
32 pages
06 EnsembleLearning
No ratings yet
06 EnsembleLearning
65 pages
Binary Classifier Evaluation Guide
No ratings yet
Binary Classifier Evaluation Guide
12 pages
L9 RBF+PM
No ratings yet
L9 RBF+PM
33 pages
Evaluation Metrics in Machine Learning - GeeksforGeeks
No ratings yet
Evaluation Metrics in Machine Learning - GeeksforGeeks
6 pages
Machine Learning
No ratings yet
Machine Learning
28 pages
Intro to Binary Classification
No ratings yet
Intro to Binary Classification
10 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
ML Unit 2
No ratings yet
ML Unit 2
31 pages
ML Interview Questions Placements
No ratings yet
ML Interview Questions Placements
99 pages
Pyq Unit 2
No ratings yet
Pyq Unit 2
41 pages
K2 Data Science Bootcamp: Classification Insights
No ratings yet
K2 Data Science Bootcamp: Classification Insights
11 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
Binary, Multi-Class & Multi-Label Classification
No ratings yet
Binary, Multi-Class & Multi-Label Classification
6 pages
Mlslides 2
No ratings yet
Mlslides 2
92 pages
UNIT-1-2.Binary Classification and Related Tasks
No ratings yet
UNIT-1-2.Binary Classification and Related Tasks
22 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Session01 DataScience
No ratings yet
Session01 DataScience
79 pages
Unit 4
No ratings yet
Unit 4
20 pages
Lec 2
No ratings yet
Lec 2
37 pages
Non Conformity Cum Resolution Report
No ratings yet
Non Conformity Cum Resolution Report
1 page
Tabela Comparativa Microsoft e Office 365
No ratings yet
Tabela Comparativa Microsoft e Office 365
1 page
Evermotion Archmodels Vol 53 PDF
No ratings yet
Evermotion Archmodels Vol 53 PDF
2 pages
Solar Modules for Diverse Needs
No ratings yet
Solar Modules for Diverse Needs
2 pages
SIP03 - Modle System Integration
No ratings yet
SIP03 - Modle System Integration
32 pages
Report Writing
No ratings yet
Report Writing
28 pages
16 - Employee Joining Form
0% (1)
16 - Employee Joining Form
5 pages
RE7RL13BU OFF DELAY Timing Relay Datasheet
No ratings yet
RE7RL13BU OFF DELAY Timing Relay Datasheet
7 pages
Ethafoam 400 Polyethylene Foam: Density 4.0 PCF (64.1 KG/M) Maximum Loading 5.0 Psi (34.5 Kpa) Color Black, Natural
No ratings yet
Ethafoam 400 Polyethylene Foam: Density 4.0 PCF (64.1 KG/M) Maximum Loading 5.0 Psi (34.5 Kpa) Color Black, Natural
2 pages
MCA-104 Compressed
No ratings yet
MCA-104 Compressed
2 pages
Network Security Lab Manual
No ratings yet
Network Security Lab Manual
59 pages
Project Manager Thesis
100% (2)
Project Manager Thesis
6 pages
SPE Pitch Deck
No ratings yet
SPE Pitch Deck
10 pages
Day 8 Complete CSS Notes
No ratings yet
Day 8 Complete CSS Notes
8 pages
Total Divide Kumon
No ratings yet
Total Divide Kumon
21 pages
Internal Auditing Principles Guide
No ratings yet
Internal Auditing Principles Guide
3 pages
Empower Integration Theory
No ratings yet
Empower Integration Theory
41 pages
PR 1 Table of Contents
No ratings yet
PR 1 Table of Contents
3 pages
TKO Whitepaper V3 Q4 2021
No ratings yet
TKO Whitepaper V3 Q4 2021
54 pages
Electronics Hobbyist Circuits Guide
No ratings yet
Electronics Hobbyist Circuits Guide
6 pages
Machine Learning Canvas Guide
No ratings yet
Machine Learning Canvas Guide
2 pages
Frequency Tables and Measures of Location
No ratings yet
Frequency Tables and Measures of Location
4 pages
Voice Recognition System Research
No ratings yet
Voice Recognition System Research
37 pages
DI PA - SIMATIC - PCS - 7 - and - SIMATIC - PCS - Neo
No ratings yet
DI PA - SIMATIC - PCS - 7 - and - SIMATIC - PCS - Neo
15 pages
Bilingual JavaScript Notes
No ratings yet
Bilingual JavaScript Notes
4 pages
Real-Time Queue Detection and Management System Using YOLO Object Detection
No ratings yet
Real-Time Queue Detection and Management System Using YOLO Object Detection
5 pages
KL 002.12.1 KSC Kes Student Guide Unit1 en v1.1.1
No ratings yet
KL 002.12.1 KSC Kes Student Guide Unit1 en v1.1.1
140 pages
Admit Card
No ratings yet
Admit Card
4 pages
4.12.080-210-004-01 Clarified Water Storage tank-Layout1-A1
No ratings yet
4.12.080-210-004-01 Clarified Water Storage tank-Layout1-A1
1 page
Contoh Pitch Deck Airbnb
No ratings yet
Contoh Pitch Deck Airbnb
18 pages

Hands On ML Workshop-Classification

Uploaded by

Hands On ML Workshop-Classification

Uploaded by

Classification

From Binary to Multioutput Systems

Gauranga Kumar Baishya

August 29, 2025

Gauranga Kumar Baishya Classification August 29, 2025 1 / 17

1 Introduction to Classification & MNIST

2 Training a Binary Classifier

6 Advanced Classification Tasks

Gauranga Kumar Baishya Classification August 29, 2025 2 / 17

Important First Step

Simplifying the Problem

Target Vector Creation

Training an SGD Classifier

Gauranga Kumar Baishya Classification August 29, 2025 4 / 17

Initial Accuracy Score

array([0.96355, 0.93795, 0.95615])

This seems great, but is it?

The Pitfall of Skewed Datasets

Gauranga Kumar Baishya Classification August 29, 2025 5 / 17

A Better Way to Evaluate

Gauranga Kumar Baishya Classification August 29, 2025 6 / 17

1522 non-5s were wrongly classified as 5s (FP).

Precision: Accuracy of Positive Predictions

Recall (Sensitivity): True Positive Rate

Gauranga Kumar Baishya Classification August 29, 2025 8 / 17

F1 Score: The Harmonic Mean

Gauranga Kumar Baishya Classification August 29, 2025 9 / 17

Figure: Plotting precision and recall against the decision threshold.

Gauranga Kumar Baishya Classification August 29, 2025 10 / 17

The Inherent Conflict

How it Works: The Decision Threshold

Gauranga Kumar Baishya Classification August 29, 2025 11 / 17

Gauranga Kumar Baishya Classification August 29, 2025 12 / 17

When to use ROC vs. Precision-Recall?

Gauranga Kumar Baishya Classification August 29, 2025 13 / 17

Multiclass (or Multinomial) Classifiers

One-vs-the-Rest (OvR) One-vs-One (OvO)

Scikit-Learn automatically applies OvR or OvO based on the algorithm.

The Multiclass Confusion Matrix

Gauranga Kumar Baishya Classification August 29, 2025 15 / 17

Gauranga Kumar Baishya Classification August 29, 2025 17 / 17

You might also like