0% found this document useful (0 votes)

9 views46 pages

Classification - Supervised Learning

The document provides an overview of classification techniques in supervised machine learning, including support vector machines (SVM), K-nearest neighbors (KNN), and decision trees. It defines classification and regression problems, discusses binary and multiclass classification, and describes the basic algorithms for SVM, KNN, and decision tree classification using ID3, including their applications. It includes a small demo of KNN classification.

Uploaded by

Hussein Hazime

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views46 pages

Classification - Supervised Learning

Uploaded by

Hussein Hazime

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Classification

(Supervised Learning)
COE 544
Intelligent Engineering Algorithms

Prepared by:
Jason Sakr
Elias Saliba
Marc Mansour

Instructor: Dr. Joe Tekli

10/1/2016 1
Introduction
Machine Learning

“A field of study that gives computers the ability to learn without being
explicitly programmed”
- Arthur Lee Samuel

10/1/2016 2
Introduction
Supervised vs Unsupervised

Supervised

• Previous knowledge built-in Input/Output

pairs
• Called labeled training data
• Used to generate a desired output

Training
• Input/output training pairs Data

• Output is called the supervisory signal

10/1/2016 3
Input Machine Output
Introduction
Supervised vs Unsupervised

Unsupervised

• No prior knowledge on the data

• No training data set

• No idea of what the results should look like

• No input/output training pairs

10/1/2016 4
Introduction
Classification vs Regression

Classification
• One of the most common problems in SL.

• When the machine encounters new samples, it should be able to identify their
class.

• Classification algorithms are called classifiers.

• Classify inputs into discrete categories.

10/1/2016 5
Introduction
Classification vs Regression

Classification example Discrete Set

COW

COW
DOG
Input Output
… Training Set

DOG CAT

Input Output HORSE

10/1/2016 6
Introduction
Classification vs Regression

Classification example
COW
• With enough training
data, the machine
DOG should be able to
classify the animal
CAT
correctly as a dog.
Actual Input

HORSE

10/1/2016 7
Introduction
Classification vs Regression

Regression

• Having some information about the data in hand

• The machine tries to predict certain outcomes.

• Tries to map inputs into continuous outputs.

10/1/2016 8
Introduction
Classification vs Regression

Regression example Continuous Set

$10,000

Input Output
Training Set

$1,000,000
[10,000 ; 1,000,000]

Input Output
10/1/2016 9
Introduction
Classification vs Regression

Regression example
$150,000
• With enough
information on the size
$500,000 of the house, its price
should be approximated
accurately.
Actual Input $800,000

10/1/2016 10
Introduction
Binary vs Multiclass

• Binary:
• Samples must be classified into one of two categories
• Patient is Sick/Healthy, price is Higher/Lower than a certain value.

• Multiclass:
• Samples must be classified into one of many categories
• Animal is a Dog/Cat/Horse, fruit is an Apple/Orange/Banana/Strawberry.

10/1/2016 11
Introduction
Classifiers

There are many algorithms for classification:

• Linear Classifiers: • Support Vector Machines

• Naïve Bayes Classifier
•
• Quadratic Classifiers
Perceptron

• Decision Trees:
• ID3
• Random Forests • Kernel Estimation
• K-Nearest Neighbor 10/1/2016 12
Outline
• Introduction
• Support Vector Machine (SVM)
• Algorithm
• Applications
• K-Nearest Neighbors (KNN)
• Algorithm
• Applications
• Demo
• Decision Trees
• ID3 Algorithm
• Demo
• Conclusion 10/1/2016 13
SVM
Algorithm

• This classifier is
given labeled
training data
(supervised
learning), the
algorithm outputs
an optimal
hyperplane which
categorizes the info
into classes.
10/1/2016 14
SVM
Algorithm

• The goal is
to provide a
hyperplane
that
classifies all
vectors into
2 classes.
10/1/2016 15
SVM
Algorithm

• There exists
more than one
hyperplane
that can split
the two
classes.

10/1/2016 16
SVM
Algorithm

• The best
choice will
be the
hyperplane
that has the
maximum
margin from
both classes.
10/1/2016 17
SVM
Algorithm

• The aim is to
Maximize the
total margin by
minimizing the
weighted
vector.

10/1/2016 18
10/1/2016 19
SVM
Applications

• Spam filtering [spam, not spam]

• Sentiment classification [positive, negative]
• For example, classifying reviews
• Customer service message classification [urgent,
not urgent]
• Information retrieval [relevant, not relevant]
10/1/2016 20
KNN
Algorithm

• Learning: Memorize All Training Data.

• Label New Example
• Compute distance of input to each training example.
• Select K closest instances.
• Return class with most instances selected.
• If K = 1, then the case is simply assigned to the class of its nearest neighbor.

10/1/2016 21
KNN
Algorithm

Distance functions Small Demo

• [Link]

10/1/2016 22
KNN
Algorithm

• If k = 3 (solid line circle) it is assigned to the red

class because there are 2 triangles and only 1
square inside the inner circle.
• If k = 5 (dashed line circle) it is assigned to the
blue class (3 squares vs. 2 triangles inside the
outer circle).

10/1/2016 23
KNN
Applications

• Radar target classification

• GPS

10/1/2016 24
KNN
Demo

[Link]
10/1/2016 25
Decision Trees

• Decision trees are one of the oldest and most used elements in machine
learning in general. They go back for decades, and they are extremely
robust.
• Decision trees uses a trick to let you do non-linear decision making with
simple linear decision surfaces.

10/1/2016 26
Decision Trees

• We can use a tree of questions as a

representation language, each node from the
tree is a test about an attribute.
• To learn we have to perform a search in the
space of trees of questions.

10/1/2016 27
Decision Trees

Example

O
Sunny
Windy? X
X

10/1/2016 28
Decision Trees

Example

Decision node

10/1/2016 29
Leaf node
Decision Trees
ID3 Algorithm
• One of the first algorithms for building decision trees.

• It employs a top down, greedy search through the space of possible

branches.

• ID3 uses Entropy and Information Gain to construct a decision tree.

• Entropy to calculate the homogeneity of a sample.
• Information gain is based on the decrease in entropy after a dataset is split

10/1/2016 30
Decision Trees
ID3 Algorithm

• Entropy:
• If the sample is completely homogeneous
the entropy is zero and if the sample is an
equally divided it has entropy of one.
• Formulas:
• 1.

• 2.

10/1/2016 31
Decision Trees
ID3 Algorithm

• Entropy:
• Formulas:
• 1.

10/1/2016 32
Decision Trees
ID3 Algorithm

• Entropy:
• Formulas:
• 2.

10/1/2016 33
Decision Trees
ID3 Algorithm
• Information gain:

• The information gain is based on the

decrease in entropy after a dataset is
split on an attribute.
• Constructing a decision tree is all
about finding attribute that returns
the highest information gain.

10/1/2016 34
Decision Trees
ID3 Algorithm
• Step 1: Calculate entropy of the target.

• Step 2: Calculate the information gain for each attribute

• Step 3: Chose the attribute with the largest information gain as the decision node

• Step 4: Check the entropy of each branch

• 4a: A branch with entropy of 0 is a leaf node
• 4b: A branch with entropy more than 0 => split

• Step 5: Back to step 1

10/1/2016 35
Decision Trees
ID3 Algorithm
• Step 1: Calculate entropy of the target.

10/1/2016 36
Decision Trees
ID3 Algorithm

• Step 2: Calculate the information gain

for each attribute
• The dataset is split on different attributes
• Calculate entropy for each split (using
formula 2)
• Calculate the information gain for each split

10/1/2016 37
Decision Trees
ID3 Algorithm
• Step 3: Chose the attribute with the largest
information gain as the decision node

10/1/2016 38
Decision Trees
ID3 Algorithm

• Step 4: Check the entropy of each branch

• 4a: A branch with entropy of 0 is a leaf node

10/1/2016 39
Decision Trees
ID3 Algorithm
• Step 4: Check the entropy of each branch
• 4b: A branch with entropy more than 0 => split

10/1/2016 40
Decision Trees
ID3 Algorithm
• Step 5:
• The ID3 algorithm is run recursively on the non-leaf
branches, until all data is classified.

10/1/2016 41
Decision Trees
Demo

Decision Tree Demonstration

O
Sunny
Windy? X
X

10/1/2016 42
Conclusion
• After we introduced ML, and viewed its different concepts and their
differences
• SL vs UL - Classification vs Regression - Binary vs Multiclass
• We dug deeper into Supervised Classification
• Its main algorithms
• Explaining their thought process
• Their applications
• Some demos

10/1/2016 43
References
• Supervised learning. (n.d.). Retrieved September 28, 2016, from
[Link]
• What is the difference between supervised and unsupervised learning algorithms?
(n.d.). Retrieved September 28, 2016, from [Link]
difference-between-supervised-and-unsupervised-learning-algorithms
• Decision tree learning. (n.d.). Retrieved September 28, 2016, from
[Link]
• ID3. (n.d.). Retrieved September 28, 2016, from
[Link]
• Data Mining Map. (n.d.). Retrieved September 28, 2016, from
[Link]

10/1/2016 44
References
• K-nearest neighbors algorithm. (n.d.). Retrieved September 29, 2016, from
[Link]
• How SVM (Support Vector Machine) algorithm works. Retrieved September 28, 2016, from
[Link]
• Introduction to Support Vector Machines. (n.d.). Retrieved September 28, 2016, from
[Link]
ml
• KNN Classification. (n.d.). Retrieved September 28, 2016, from
[Link]

10/1/2016 45
THANK YOU!

10/1/2016 46

Adaptive Control in Lathe Operations
No ratings yet
Adaptive Control in Lathe Operations
19 pages
ITB3105 CASE Tools Lab Record
100% (1)
ITB3105 CASE Tools Lab Record
40 pages
Advances in Biped Robotics by Dr. Dutta
No ratings yet
Advances in Biped Robotics by Dr. Dutta
28 pages
Grade 10 Formative Test Circle
100% (4)
Grade 10 Formative Test Circle
14 pages
Q1 LC1. Illustrate Quadratic Equations
No ratings yet
Q1 LC1. Illustrate Quadratic Equations
30 pages
Babuska Cara 1991
No ratings yet
Babuska Cara 1991
224 pages
DSP Exam Pattern and Topics Overview
100% (1)
DSP Exam Pattern and Topics Overview
2 pages
Article On Socioeconomic - Status - and - Educational - Choice - of - Education PDF
No ratings yet
Article On Socioeconomic - Status - and - Educational - Choice - of - Education PDF
13 pages
CC103
100% (1)
CC103
12 pages
Statistical Process Control (S P C) : Presented by
100% (3)
Statistical Process Control (S P C) : Presented by
69 pages
Ce76 - Cadd Lab Manual - Kcet
No ratings yet
Ce76 - Cadd Lab Manual - Kcet
28 pages
Absolute Extrema
No ratings yet
Absolute Extrema
7 pages
Internship Report
No ratings yet
Internship Report
38 pages
M SC Physics Syllabus 29th July 23 (3rd Anniversary of NEP)
No ratings yet
M SC Physics Syllabus 29th July 23 (3rd Anniversary of NEP)
42 pages
Attention Is All You Need
67% (3)
Attention Is All You Need
11 pages
Actuarial Science: State Transitions
No ratings yet
Actuarial Science: State Transitions
13 pages
Engineering Mechanics
No ratings yet
Engineering Mechanics
156 pages
Annex C Statistical Data Output 2023 CB 010
No ratings yet
Annex C Statistical Data Output 2023 CB 010
6 pages
Cal1 (Prelim)
No ratings yet
Cal1 (Prelim)
34 pages
Grade 6 Abacus Math Guide
No ratings yet
Grade 6 Abacus Math Guide
10 pages
Bab 5 Taburan Normal
No ratings yet
Bab 5 Taburan Normal
28 pages
Grade 9 Atp 2025
No ratings yet
Grade 9 Atp 2025
4 pages
Cse 205: Digital Logic Design: Dr. Tanzima Hashem Assistant Professor Cse, Buet
No ratings yet
Cse 205: Digital Logic Design: Dr. Tanzima Hashem Assistant Professor Cse, Buet
89 pages
Pile Design Using SPT Values ESJ Feb-2020
No ratings yet
Pile Design Using SPT Values ESJ Feb-2020
14 pages
Maths - Xii - MS - PB - 2024 - 25 - Set - 2 (A)
No ratings yet
Maths - Xii - MS - PB - 2024 - 25 - Set - 2 (A)
12 pages
Deep Learning Essentials
No ratings yet
Deep Learning Essentials
27 pages
Probset Solid Geometry
No ratings yet
Probset Solid Geometry
4 pages
High School Academic Record
No ratings yet
High School Academic Record
1 page
Pile Eccentricity
No ratings yet
Pile Eccentricity
8 pages
Visual Art Elements Guide
No ratings yet
Visual Art Elements Guide
10 pages

Classification - Supervised Learning

Uploaded by

Classification - Supervised Learning

Uploaded by

Classification

Instructor: Dr. Joe Tekli

• Previous knowledge built-in Input/Output

• Output is called the supervisory signal

• No prior knowledge on the data

• No idea of what the results should look like

• Classification algorithms are called classifiers.

• Classify inputs into discrete categories.

Classification example Discrete Set

Input Output HORSE

• Having some information about the data in hand

• Tries to map inputs into continuous outputs.

Regression example Continuous Set

There are many algorithms for classification:

• Linear Classifiers: • Support Vector Machines

• Spam filtering [spam, not spam]

• Learning: Memorize All Training Data.

Distance functions Small Demo

• If k = 3 (solid line circle) it is assigned to the red

• Radar target classification

• We can use a tree of questions as a

• It employs a top down, greedy search through the space of possible

• ID3 uses Entropy and Information Gain to construct a decision tree.

• The information gain is based on the

• Step 2: Calculate the information gain for each attribute

• Step 4: Check the entropy of each branch

• Step 5: Back to step 1

• Step 2: Calculate the information gain

• Step 4: Check the entropy of each branch

Decision Tree Demonstration

You might also like