0% found this document useful (0 votes)

10 views

Machine Learning-Lecture 04

Uploaded by

Amna Arooj

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Machine Learning-Lecture 04

Uploaded by

Amna Arooj

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

Machine Learning

LECTURE – 04
CLASSIFICATION—A TWO-STEP PROCESS
 Model construction: describing a set of predetermined
classes

 Each tuple/sample is assumed to belong to a predefined

class, as determined by the class label attribute

 The set of tuples used for model construction is Training

set

 The model is represented as classification rules, decision

trees, or mathematical formulae

2
CLASSIFICATION—A TWO-STEP PROCESS
 Model usage:
 For classifying future or unknown objects
 Estimate accuracy of the model
 The known label of test sample is compared with the

classified result from the model

 Accuracy rate is the percentage of test set samples that are

correctly classified by the model

 Test set is independent of training set (otherwise over-

fitting)
 If the accuracy is acceptable, use the model to classify new
data
3
PROCESS (1): MODEL CONSTRUCTION

Classification
Algorithms
Training
Data

NAME RANK YEARS TENURED Classifier

Mike Assistant Prof 3 no (Model)
Mary Assistant Prof 7 yes
Bill Professor 2 yes
Jim Associate Prof 7 yes
IF rank = ‘professor’
Dave Assistant Prof 6 no
Anne Associate Prof 3 no
OR years > 6 4
THEN tenured = ‘yes’
PROCESS 2:MODEL USAGE FOR PREDICTION

Classifier

Testing
Data Unseen Data

(Jeff, Professor, 4)
NAME RANK YEARS TENURED
Tom Assistant Prof 2 no Tenured?
Merlisa Associate Prof 7 no
George Professor 5 yes 5

Joseph Assistant Prof 7 yes

Naive Bayes Classifier
Calculates the probability of a hypothesis (class label) given the
evidence (input features).

 Naive Bayes is a probabilistic algorithm commonly used for

classification tasks, especially in natural language processing and text
classification

Simple but effective algorithm

Introduction
It's called "naive" because it makes a strong assumption of feature
independence, which means it assumes that the features used to
describe the input are conditionally independent given the class label.

Consider a bag of fruits containing apples, bananas, and oranges

having features like color, shape, and size
Teach a computer to recognize these fruits based on their features.
Naïve Bayes Overview
1.Collect Data:
◦ gather data

◦ note down its features – like whether it's red or yellow, large or small, and so on.

2.Calculate Probabilities: Naive Bayes calculates the probability of a fruit

being, say, an apple, given its features. It does this by looking at how often
certain features (like being red and round) occur in your data set for apples.
Naïve Bayes Overview
3. Assumption of Independence (Naive Assumption): The "naive" part of
Naive Bayes comes from assuming that the features you're considering (like
color, shape, size) are independent of each other. It means that knowing the color
of the fruit doesn't give you any information about its shape, and vice versa.

4. Bayes' Theorem: Calculates the probability of an event (like a fruit

being an apple) based on the probabilities of certain related events (like the

fruit being red and round).

Naïve Bayes Overview
5. Classification: When you give the classifier the features of a new
fruit, it calculates the probabilities for each category (apple, banana, or
orange). The category with the highest probability is the classifier's
best guess for what the fruit is.
Naïve Bayes Classifier: An Example
P(Ci): P(buys_computer = “yes”) = 9/14 = 0.643
P(buys_computer = “no”) = 5/14= 0.357

Compute P(X|Ci) for each class

P(age = “<=30” | buys_computer = “yes”) = 2/9 = 0.222

P(age = “<= 30” | buys_computer = “no”) = 3/5 = 0.6

P(income = “medium” | buys_computer = “yes”) = 4/9 = 0.444

P(income = “medium” | buys_computer = “no”) = 2/5 = 0.4

25
Naïve Bayes Classifier: An Example
P(student = “yes” | buys_computer = “yes) = 6/9 = 0.667

P(student = “yes” | buys_computer = “no”) = 1/5 = 0.2

P(credit_rating = “fair” | buys_computer = “yes”) = 6/9 = 0.66

P(credit_rating = “fair” | buys_computer = “no”) = 2/5 = 0.4

26
Naïve Bayes Classifier: An Example
X = (age <= 30 , income = medium, student = yes, credit_rating =
fair)

P(X|Ci) : P(X|buys_computer = “yes”) = 0.222 x 0.444 x 0.667 x 0.667

= 0.044
P(X|buys_computer = “no”) = 0.6 x 0.4 x 0.2 x 0.4 = 0.019

P(X|Ci)P(Ci) : P(X|buys_computer = “yes”) P(buys_computer =

“yes”) = 0.028
P(X|buys_computer = “no”) * P(buys_computer =
“no”) = 0.007
27
Avoiding the Zero-Probability Problem

Naïve Bayesian prediction requires each conditional prob.

be non-zero. Otherwise, the predicted prob. will be zero

n
P( X | C i)   P( x k | C i)
k 1

Ex. Suppose a dataset with 1000 tuples, income=low (0),

income= medium (990), and income = high (10)

29
Avoiding the Zero-Probability Problem
Use Laplacian correction (or Laplacian estimator)

◦ Adding 1 to each case

Prob(income = low) = 1/1003
Prob(income = medium) = 991/1003
Prob(income = high) = 11/1003

◦ The “corrected” prob. estimates are close to their

“uncorrected” counterparts

30
Solve this:

Bayes Classification
No ratings yet
Bayes Classification
9 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
Naive Bayes - Report (Repaired)
No ratings yet
Naive Bayes - Report (Repaired)
5 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
6 Classification
No ratings yet
6 Classification
53 pages
Learning AI
No ratings yet
Learning AI
34 pages
Unit IV
No ratings yet
Unit IV
43 pages
Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
Naive Bayes
No ratings yet
Naive Bayes
38 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Data MIning Chapter 8
No ratings yet
Data MIning Chapter 8
11 pages
Lecture-7 Classification Using Naive Bays
No ratings yet
Lecture-7 Classification Using Naive Bays
19 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
Bayesian Classification and Reasoning_FADML_Seminar
No ratings yet
Bayesian Classification and Reasoning_FADML_Seminar
26 pages
Bayesian Learning
No ratings yet
Bayesian Learning
58 pages
Unit-4 Naïve Bayes & Support Vector Machine
No ratings yet
Unit-4 Naïve Bayes & Support Vector Machine
79 pages
8 Classification
No ratings yet
8 Classification
45 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
10 Classification New 1
No ratings yet
10 Classification New 1
31 pages
Naive Bayes.ppt
No ratings yet
Naive Bayes.ppt
24 pages
Lecture12-Ch8-ClassBasic-Part2
No ratings yet
Lecture12-Ch8-ClassBasic-Part2
22 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Classification Ppts 2021
No ratings yet
Classification Ppts 2021
80 pages
ML Naive Bayes 1
No ratings yet
ML Naive Bayes 1
19 pages
Classification
No ratings yet
Classification
33 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
Practical_3 (2)
No ratings yet
Practical_3 (2)
11 pages
Bayesian Classification
No ratings yet
Bayesian Classification
25 pages
For Unit 4 Useful
100% (1)
For Unit 4 Useful
107 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
6 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
6d7701 - Bayesean Classifer
No ratings yet
6d7701 - Bayesean Classifer
8 pages
Naive Bates Classifier
No ratings yet
Naive Bates Classifier
18 pages
Lecture 8 - Naive Bayes
No ratings yet
Lecture 8 - Naive Bayes
27 pages
23-Naive Bayes
No ratings yet
23-Naive Bayes
22 pages
IME672 - Lecture 44
No ratings yet
IME672 - Lecture 44
16 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
U02Lecture07 Classification
100% (1)
U02Lecture07 Classification
56 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Classification & Prediction: - Shailesh Yadav Central University of Rajasthan
No ratings yet
Classification & Prediction: - Shailesh Yadav Central University of Rajasthan
28 pages
3 - Bayesian Classification
No ratings yet
3 - Bayesian Classification
15 pages
Machine Ass
No ratings yet
Machine Ass
33 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
Ch5
No ratings yet
Ch5
21 pages
Resentation On Aïve Bayesian Lassification
No ratings yet
Resentation On Aïve Bayesian Lassification
38 pages
2.3 Bayes classification
No ratings yet
2.3 Bayes classification
15 pages
Week 4 - Classification Alternative Techniques
No ratings yet
Week 4 - Classification Alternative Techniques
87 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Data Mining 4th Is
No ratings yet
Data Mining 4th Is
24 pages
Lesson 3.3 - Supervised Learning Rule Based Classification
No ratings yet
Lesson 3.3 - Supervised Learning Rule Based Classification
43 pages
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
No ratings yet
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
3 pages
Naive Bayes
No ratings yet
Naive Bayes
2 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet
Is the Answer Reasonable?, Grade 7: The Test Connection
From Everand
Is the Answer Reasonable?, Grade 7: The Test Connection
Frank Schaffer Publications
No ratings yet
6 Effective Business Model, Mintzberg Roles
No ratings yet
6 Effective Business Model, Mintzberg Roles
11 pages
Lecture 1, Intro To Computer Ethics
No ratings yet
Lecture 1, Intro To Computer Ethics
23 pages
Tayyab Abdullah 30
No ratings yet
Tayyab Abdullah 30
6 pages
Machine Learning-Lecture 01
No ratings yet
Machine Learning-Lecture 01
28 pages
Machine Learning-Lecture 02
No ratings yet
Machine Learning-Lecture 02
28 pages
Machine Learning-Lecture 03
No ratings yet
Machine Learning-Lecture 03
19 pages
5 Business Plan
No ratings yet
5 Business Plan
22 pages
4 Industry Analysis
No ratings yet
4 Industry Analysis
17 pages
Asim Zaman Ceasar Cipher
No ratings yet
Asim Zaman Ceasar Cipher
2 pages
Sre Assignment
No ratings yet
Sre Assignment
15 pages
Amna Arooj Lab 1
No ratings yet
Amna Arooj Lab 1
4 pages
1 s2.0 S2096720923000404 Main
No ratings yet
1 s2.0 S2096720923000404 Main
44 pages
Ingilizce-07 07 2021-5bblockchain
No ratings yet
Ingilizce-07 07 2021-5bblockchain
12 pages
Using The Linear Regression Functions On Your TI-84 Plus Calculator
No ratings yet
Using The Linear Regression Functions On Your TI-84 Plus Calculator
5 pages
3 Free Courses That Helped Me Land My First Data Scientist Job in Amazon - by Farzad Mahmoodinobar - Medium
No ratings yet
3 Free Courses That Helped Me Land My First Data Scientist Job in Amazon - by Farzad Mahmoodinobar - Medium
15 pages
Statistic Final Assignment
No ratings yet
Statistic Final Assignment
2 pages
Statistical Treatment
No ratings yet
Statistical Treatment
4 pages
PQT 18MAB204T Assignment PDF
No ratings yet
PQT 18MAB204T Assignment PDF
3 pages
Input Analyzer Instructions
No ratings yet
Input Analyzer Instructions
16 pages
2088-Article Text-6814-1-10-20230619
No ratings yet
2088-Article Text-6814-1-10-20230619
9 pages
SB Test Bank Chapter 6
No ratings yet
SB Test Bank Chapter 6
127 pages
Pengaruh Fasilitas Wisata Dan Harga Terhadap Kepuasan Konsumen (Studi Pada Museum Satwa)
No ratings yet
Pengaruh Fasilitas Wisata Dan Harga Terhadap Kepuasan Konsumen (Studi Pada Museum Satwa)
9 pages
UCLA Econ 41 Midterm1 Answers
No ratings yet
UCLA Econ 41 Midterm1 Answers
9 pages
Chapter 13
No ratings yet
Chapter 13
108 pages
Z Test (Standard Normal Distribution) (N 30) : 1) Confidence Interval For Mean
No ratings yet
Z Test (Standard Normal Distribution) (N 30) : 1) Confidence Interval For Mean
10 pages
Generalized linear mixed models modern concepts methods and applications 1st Edition Stroup - Read the ebook online or download it for a complete experience
No ratings yet
Generalized linear mixed models modern concepts methods and applications 1st Edition Stroup - Read the ebook online or download it for a complete experience
54 pages
Stat and Prob q4 Week 1 Module 9
No ratings yet
Stat and Prob q4 Week 1 Module 9
19 pages
Warranty Data and Analysis
No ratings yet
Warranty Data and Analysis
11 pages
Probability and Statistics Book Solutions
50% (4)
Probability and Statistics Book Solutions
100 pages
5 Basic Steps in Hypothesis Test: Men Willingly Believe What They Wish." - Julius Caesar (100-44 BC)
No ratings yet
5 Basic Steps in Hypothesis Test: Men Willingly Believe What They Wish." - Julius Caesar (100-44 BC)
11 pages
Introduction to Probability and Statistics 3rd Edition Mendenhall Solutions Manual - Read Now With The Full Version Of All Chapters
100% (3)
Introduction to Probability and Statistics 3rd Edition Mendenhall Solutions Manual - Read Now With The Full Version Of All Chapters
53 pages
Anderson Distribution of The Correlation Coefficient
No ratings yet
Anderson Distribution of The Correlation Coefficient
14 pages
IAT-I Question For MA3391 - P & S
No ratings yet
IAT-I Question For MA3391 - P & S
4 pages
Test Set 2
No ratings yet
Test Set 2
2 pages
Sher-E-Bangla Agricultural University: Assignment On
No ratings yet
Sher-E-Bangla Agricultural University: Assignment On
12 pages
2a EDA
No ratings yet
2a EDA
16 pages
CS 464 Introduction To Machine Learning: Feature Selection
No ratings yet
CS 464 Introduction To Machine Learning: Feature Selection
36 pages
An Actuarial Model For Assessing General Practictioner Prescribing Costs
No ratings yet
An Actuarial Model For Assessing General Practictioner Prescribing Costs
20 pages
Mit18 05 s22 Class10 Pset Sol
No ratings yet
Mit18 05 s22 Class10 Pset Sol
4 pages
Chapter 15 PowerPoint
No ratings yet
Chapter 15 PowerPoint
23 pages
Probability and Statistical Methods Math 322: y Y Y y
No ratings yet
Probability and Statistical Methods Math 322: y Y Y y
9 pages
Zonal Anisotropy
100% (2)
Zonal Anisotropy
1 page
HT For PHO
No ratings yet
HT For PHO
89 pages