0% found this document useful (0 votes)

39 views66 pages

IIM Rohtak: Machine Learning Insights

The document discusses supervised learning in machine learning, focusing on predictive analysis and classification techniques such as decision trees, logistic regression, and k-means clustering. It emphasizes the importance of model accuracy, confusion matrices, and metrics like precision, recall, and F1-score for evaluating model performance. Additionally, it introduces the concept of Cohen's kappa coefficient for assessing agreement between classifiers.

Uploaded by

Mirgank Tirkha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views66 pages

IIM Rohtak: Machine Learning Insights

Uploaded by

Mirgank Tirkha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

IIM Rohtak

Business Analytics
Chapters
6-12 Today objective

Supervised Learning
Machine Learning Approach
Predictive Analysis
Classification analysis
Accuracy Theory & Decision Tree based Approach
Indian Institute of Management (IIM),Rohtak
IIM Rohtak

ORANGE

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE
Clustering

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE

Now select any cluster ,the

same may be you will get in
data table and scatter plot

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE
K Means

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE
Understanding K means

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE

Randomly allocation of data points

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE
k-Means
[Link]

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE

Let you want k=4

,then fixed k=4

But now you asked

from system
(silhouette) and find
optimal value of k,
then please select
from tab under k
means window

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE

Now system is suggesting

k=2 (score is high among
others)

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

ORANGE

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Predictive Learning??
MARCH month : In most of Indian
school ,Annual yearly examination was
held?
Now I am sure most of the parents(re
call your conversation with your parent )
saying you do hard work in the subjects
like math's ,science etc.
On what basis your parents suggest to you
??and why ???
Indian Institute of Management (IIM),Rohtak
IIM Rohtak

Machine Learning

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Machine Learning
Supervised learning

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Machine Learning

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Machine Learning

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Classification

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Classification
•Logistic Regression
•Support Vector Machine
•k-Nearest Neighbours
•Decision Tree
•Ensemble Method(bagging
and Boosting )
•Random Forest
•Gradient Boost
•Ada Boost
•Neural Network
•Navie Bayes

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Classification

[Link]
a-types-in-statistics-347e152e8bee Indian Institute of Management (IIM),Rohtak
IIM Rohtak

Classification of machine learning

algorithms

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Classification

Decision Trees are used to predict a Label (usually

Binary ) dependent variables such as:
• Will a person suffer a heart attack in the next year?
• Will a voter vote BJP in the next country
election?
• Will student X clear this time IAS exam?
For such type of problem ,decision tree gives basic
solution based upon past data.

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Decision Tree
Tree based learning algorithms are considered to be one of the best
and mostly used supervised learning methods. Tree based methods
empower predictive models with high accuracy, stability and ease of
interpretation. Unlike linear models, they map non-linear
relationships quite well. They are adaptable at solving any kind of
problem at hand.

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Classification
(a decision tree structure)
• Model construction: describing a set of predetermined classes
– Each tuple/sample is assumed to belong to a predefined
class, as determined by the class label attribute
– The set of tuples used for model construction is training set
– The model is represented as classification rules, decision
trees, or mathematical formulae
• Model usage: for classifying future or unknown objects
– Estimate accuracy of the model
• The known label of test sample is compared with the
classified result from the model
• Accuracy rate is the percentage of test set samples that
are correctly classified by the model
– If the accuracy is acceptable, use the model to classify data
tuples whose class labels are not known
Indian Institute of Management (IIM),Rohtak
IIM Rohtak

Confusion Matrix

150

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix

False Positive (FP) – Type I Error False Negative (FN) – Type II Error
•The predicted value was falsely •The predicted value was falsely
predicted. predicted.
•The actual value was negative, but •The actual value was positive, but
the model predicted a positive the model predicted a negative
value. value.
•Also known as the type I error. •Also known as the type II error.
Indian Institute of Management (IIM),Rohtak
IIM Rohtak

Confusion Matrix

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix
It is easy to “game” the
accuracy metric when making
predictions for a dataset like
this. To do that, you must
predict that nothing will
happen and label every email
as non-spam. The model
predicting the majority (non-
spam) class all the time will
mostly be right, leading to very
high accuracy.

In this specific example, the accuracy is 95%: yes,

the model missed every spam email, but it was
still right in 57 cases out of 60.
However, this accuracy is now meaningless. The
model does not serve the primary goal or help
identify the target event.

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix
Pros:
•Accuracy is a helpful metric when you deal with balanced classes and
care about the overall model “correctness” and not the ability to
predict a specific class.
•Accuracy is easy to explain and communicate.
Cons:
•If you have imbalanced classes, accuracy is less useful since it gives
equal weight to the model’s ability to predict all categories.
•Communicating accuracy in such cases can be misleading and disguise
low performance on the target class.
When you see an imbalanced example like the spam example above, it
is very intuitive to suggest a different approach to model evaluation
that overcomes the limitation of accuracy: we do not need the
“overall” correctness. We want to find spam emails, after all! Can we
focus on how well we see and detect them specifically?
Precision and recall are the two metrics that help with that.
Indian Institute of Management (IIM),Rohtak
IIM Rohtak

Confusion Matrix
Calculate Recall | Sensitivity | True Positive Rate — TPR

=TP/Actual Yes

95
95 + 5
Indian Institute of Management (IIM),Rohtak
IIM Rohtak

Confusion Matrix

The false positive rate is calculated as

FP/FP+TN, where FP is the number of
false positives and TN is the number of
true negatives (FP+TN being the total
number of negatives). It’s the
probability that a false alarm will be
raised: that a positive result will be
given when the true value is negative.
5 5
= = = 10%
5 + 45 50

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix

𝑇𝑁
=
𝑇𝑁 + 𝐹𝑃
𝑇𝑁 = 45 = 45 =
90%
𝑇𝑁 + 𝐹𝑃 45 + 5 50

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix
False Negative – The predicted value is negative, but the actual value is
positive, i.e., the model falsely predicted the positive class labels to be
negative.
False Negative Rate – The ratio of false-negative and totally positive,
i.e.,
FNR = FN / P
FNR = FN / (FN+TP)
NOTE: False negative (FN) is also called ‘type-2 error’.

FNR = FN / (FN+TP)=5/5+95=5/100=5%

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix
The formula for calculating precision= =TP/Predicted Yes

When it is predicted Yes, how often is it correct?

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix
What is F1-Score?
F1-score is the harmonic mean of precision and recall. It gives us an
overall measure of classifier performance by balancing both the
precision and recall values. It is given by the formula

It ranges from 0 to 1, with 1 being the best possible score.

Why is the f1-score the harmonic mean of precision and recall rather
than the arithmetic mean?

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix

Accuracy: Overall, how often is the classifier correct?

•(TP+TN)/total = (100+50)/165 = 0.91
Misclassification Rate: Overall, how often is it wrong?

•(FP+FN)/total = (10+5)/165 = 0.09

•equivalent to 1 minus Accuracy
•also known as "Error Rate"
Indian Institute of Management (IIM),Rohtak
IIM Rohtak

Confusion Matrix
True Positive Rate: When it's actually yes, how often does it
predict yes?
•TP/actual yes = 100/105 = 0.95
•also known as "Sensitivity" or "Recall"
False Positive Rate: When it's actually no, how often does it
predict yes?
•FP/actual no = 10/60 = 0.17
True Negative Rate: When it's actually no, how often does it
predict no?
•TN/actual no = 50/60 = 0.83
•equivalent to 1 minus False Positive Rate
Precision: When it predicts yes, how often is it correct?
•TP/predicted yes = 100/110 = 0.91
•Prevalence: How often does the yes condition actually occur
in our sample?
actual yes/total = 105/165 = 0.64
Indian Institute of Management (IIM),Rohtak
IIM Rohtak

Confusion Matrix
Cohen's kappa coefficient?
What is the Cohen's kappa coefficient?
The Cohen's kappa coefficient is a statistic that
quantifies the agreement between two raters or
classifiers, taking into account the possibility of
random agreement. It is often used to assess the
reliability of human annotations or the accuracy of
machine learning models for classification tasks. The
kappa coefficient ranges from -1 to 1, where -1
means complete disagreement, 0 means random
agreement, and 1 means perfect agreement. A higher
kappa value indicates a better performance of the
model or the rater. Indian Institute of Management (IIM),Rohtak
IIM Rohtak

Confusion Matrix
Cohen's kappa coefficient?

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix
Cohen's kappa coefficient?

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix
Cohen's kappa coefficient?

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix
Cohen's kappa coefficient?

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Confusion Matrix

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Model Construction

Indian Institute of Management (IIM),Rohtak

IIM Rohtak

Process (1): Model Construction

Classification
Algorithms
Training
Data

NAME RANK YEARS TENURED Classifier

Mike Assistant Prof 3 no (Model)
Mary Assistant Prof 7 yes
Bill Professor 2 yes
Jim Associate Prof 7 yes
IF rank = ‘professor’
Dave Assistant Prof 6 no
OR years > 6
Anne Associate Prof 3 no
THEN tenured = ‘yes’
Indian Institute of Management (IIM),Rohtak
IIM Rohtak

Process (2): Using the Model in

Prediction

Classifier

Testing
Data Unseen Data

(Jeff, Professor, 4)
NAME RANK YEARS TENURED
T om A ssistant P rof 2 no Tenured?
M erlisa A ssociate P rof 7 no
G eorge P rofessor 5 yes
Joseph A ssistant P rof 7 yes
Indian Institute of Management (IIM),Rohtak
IIM Rohtak

Decision Tree Induction: Training Dataset

age income student credit_rating buys_computer
<=30 high no fair no
<=30 high no excellent no
31…40 high no fair yes
>40 medium no fair yes
>40 low yes fair yes
>40 low yes excellent no
31…40 low yes excellent yes
<=30 medium no fair no
<=30 low yes fair yes
>40 medium yes fair yes
<=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
>40 medium no excellent no
YES

>40 low no fair ??

NO
Indian Institute of Management (IIM),Rohtak
IIM Rohtak

>40 low no fair ?? YES

Decision Tree Induction: An Example
age income student credit_rating buys_computer
❑ Training data set: Buys_computer <=30 high no fair no
❑ Resulting tree: <=30 high no excellent no
31…40 high no fair yes
>40 medium no fair yes
>40 low yes fair yes
>40 low yes excellent no
31…40 low yes excellent yes
age? <=30 medium no fair no
<=30 low yes fair yes
>40 medium yes fair yes
<=30 medium yes excellent yes
<=30 overcast
31..40 >40 31…40 medium no excellent yes
31…40 high yes fair yes
>40 medium no excellent no

student? yes credit rating?

no yes excellent fair

no yes yes
Indian Institute of Management (IIM),Rohtak
IIM Rohtak

Thank you !!! 66

Indian Institute of Management (IIM),Rohtak

Decision Tree Learning in Machine Learning
No ratings yet
Decision Tree Learning in Machine Learning
91 pages
Supervised Learning Techniques in IIM Rohtak
No ratings yet
Supervised Learning Techniques in IIM Rohtak
77 pages
Market Basket Analysis Techniques
No ratings yet
Market Basket Analysis Techniques
67 pages
LightGBM in Supervised Learning
No ratings yet
LightGBM in Supervised Learning
121 pages
TOPSIS Method for Multi-Criteria Decision Making
No ratings yet
TOPSIS Method for Multi-Criteria Decision Making
35 pages
IIM Rohtak: Market Basket Analysis Guide
No ratings yet
IIM Rohtak: Market Basket Analysis Guide
39 pages
Akooda Case Study: Enhancing Decision Support
No ratings yet
Akooda Case Study: Enhancing Decision Support
54 pages
Quantitative Techniques Question Paper
No ratings yet
Quantitative Techniques Question Paper
3 pages
TGMC GitHub Data Model Overview
No ratings yet
TGMC GitHub Data Model Overview
77 pages
IIM Rohtak Multi-Criteria Decision Making
No ratings yet
IIM Rohtak Multi-Criteria Decision Making
27 pages
Answer N6
No ratings yet
Answer N6
66 pages
Operations Management Decision Models
No ratings yet
Operations Management Decision Models
7 pages
Role of Quantitative Techniques in Business
No ratings yet
Role of Quantitative Techniques in Business
12 pages
Data Analysis for Business Decisions
No ratings yet
Data Analysis for Business Decisions
66 pages
Answer N6
No ratings yet
Answer N6
75 pages
Slide CH 2 - Decision Making
No ratings yet
Slide CH 2 - Decision Making
32 pages
Quantitative Techniques in Management
No ratings yet
Quantitative Techniques in Management
1 page
Quantitative Techniques and Method
No ratings yet
Quantitative Techniques and Method
165 pages
Use Case Point Estimation Guide
No ratings yet
Use Case Point Estimation Guide
50 pages
Analytics Course with Placement Support
No ratings yet
Analytics Course with Placement Support
14 pages
Introduction to Operations Research Techniques
No ratings yet
Introduction to Operations Research Techniques
26 pages
2b DecisionTheory ModA
No ratings yet
2b DecisionTheory ModA
29 pages
IT Practical File: Data Analysis Techniques
No ratings yet
IT Practical File: Data Analysis Techniques
53 pages
ARMA & ARIMA Model Overview and Analysis
No ratings yet
ARMA & ARIMA Model Overview and Analysis
5 pages
Sample
No ratings yet
Sample
96 pages
Operations Strategy and Decision Making
No ratings yet
Operations Strategy and Decision Making
42 pages
IIT Roorkee Data Analytics Certification
No ratings yet
IIT Roorkee Data Analytics Certification
15 pages
Feature Selection in Machine Learning
No ratings yet
Feature Selection in Machine Learning
53 pages
Quantitative Methods for Business Success
No ratings yet
Quantitative Methods for Business Success
34 pages
Decision Tree Analysis in Operations Research
No ratings yet
Decision Tree Analysis in Operations Research
6 pages
Quantitative Management Techniques Overview
No ratings yet
Quantitative Management Techniques Overview
15 pages
CH 01 PDF
No ratings yet
CH 01 PDF
28 pages
CH 01 PDF
No ratings yet
CH 01 PDF
28 pages
Overview of Quantitative Decision Making
No ratings yet
Overview of Quantitative Decision Making
13 pages
Overview of Quantitative Decision Making
0% (1)
Overview of Quantitative Decision Making
305 pages
MBA Quantitative Techniques For Management Notes 1 PDF
57% (7)
MBA Quantitative Techniques For Management Notes 1 PDF
125 pages
Quantitative Techniques For Management - Removed
No ratings yet
Quantitative Techniques For Management - Removed
149 pages
Quantitative Techniques For Management PDF
100% (6)
Quantitative Techniques For Management PDF
507 pages
Quantitative Techniques for Management
No ratings yet
Quantitative Techniques for Management
249 pages
Quantitative Techniques For Management PDF
88% (8)
Quantitative Techniques For Management PDF
507 pages
4Ps Marketing Analysis of Patanjali
No ratings yet
4Ps Marketing Analysis of Patanjali
22 pages
Random Forest Model for Car Classification
No ratings yet
Random Forest Model for Car Classification
15 pages
Investment Banking Overview at TWP
No ratings yet
Investment Banking Overview at TWP
39 pages
Moving Average Forecasting Template
No ratings yet
Moving Average Forecasting Template
3 pages
4Ps Marketing Analysis of Patanjali
No ratings yet
4Ps Marketing Analysis of Patanjali
21 pages
Quantity Discount Inventory Model Guide
No ratings yet
Quantity Discount Inventory Model Guide
2 pages
Crafting Effective Argumentative Essays
No ratings yet
Crafting Effective Argumentative Essays
7 pages
Financial Riddles for Investors
No ratings yet
Financial Riddles for Investors
3 pages
Investment Banking FAQs Explained
100% (1)
Investment Banking FAQs Explained
11 pages
Total Supply Chain Cost Analysis
No ratings yet
Total Supply Chain Cost Analysis
1 page
Student Performance Evaluation Report
No ratings yet
Student Performance Evaluation Report
15 pages
Center of Gravity Calculation Template
No ratings yet
Center of Gravity Calculation Template
2 pages
Eepnagar National Income Accounts Analysis
No ratings yet
Eepnagar National Income Accounts Analysis
2 pages
Macroeconomic Concepts for Business
No ratings yet
Macroeconomic Concepts for Business
35 pages
Probability Analysis: Key Concepts and Metrics
No ratings yet
Probability Analysis: Key Concepts and Metrics
36 pages
GDP Accounting Methods and Challenges
No ratings yet
GDP Accounting Methods and Challenges
21 pages
Principles of Psychological First Aid
No ratings yet
Principles of Psychological First Aid
15 pages
Probability Theory in Management Analysis
No ratings yet
Probability Theory in Management Analysis
54 pages
Community Mental Health Course Overview
No ratings yet
Community Mental Health Course Overview
6 pages
Data Mining and Warehousing Overview
No ratings yet
Data Mining and Warehousing Overview
21 pages
Finding People in Crowd Photos
No ratings yet
Finding People in Crowd Photos
8 pages
CSE B.Tech V Semester Course Syllabus
No ratings yet
CSE B.Tech V Semester Course Syllabus
239 pages
Caltech-UCSD Birds-200 Dataset Overview
No ratings yet
Caltech-UCSD Birds-200 Dataset Overview
8 pages
AI Classifiers Performance Analysis
No ratings yet
AI Classifiers Performance Analysis
3 pages
Rule-Based Classification Overview
No ratings yet
Rule-Based Classification Overview
3 pages
E-Commerce Customer Churn Analysis Report
100% (1)
E-Commerce Customer Churn Analysis Report
43 pages
Text Classification Techniques in NLP
No ratings yet
Text Classification Techniques in NLP
16 pages
Spatial Modelling For Natural and Environmental Vulnerability Through Remote Sensing and GIS in Astrakhan, Russia
No ratings yet
Spatial Modelling For Natural and Environmental Vulnerability Through Remote Sensing and GIS in Astrakhan, Russia
9 pages
Datamites Certified Data Scientist Syllabus PDF
50% (2)
Datamites Certified Data Scientist Syllabus PDF
12 pages
BCS602: Intro to Machine Learning
No ratings yet
BCS602: Intro to Machine Learning
58 pages
Neuro Fuzzy Systems for Heart Disease Diagnosis
No ratings yet
Neuro Fuzzy Systems for Heart Disease Diagnosis
15 pages
NLP and Linguistics in Tech Applications
No ratings yet
NLP and Linguistics in Tech Applications
5 pages
Clustering Methods and Similarity Metrics
No ratings yet
Clustering Methods and Similarity Metrics
53 pages
New Insights on Industrial Policy
No ratings yet
New Insights on Industrial Policy
48 pages
Big Data Research in Indonesia: Challenges & Opportunities
No ratings yet
Big Data Research in Indonesia: Challenges & Opportunities
52 pages
Understanding Fully Connected Layers in CNN
No ratings yet
Understanding Fully Connected Layers in CNN
2 pages
QNDE2009 PerioProbe PDF
No ratings yet
QNDE2009 PerioProbe PDF
9 pages
Aspect-Based Opinion Mining in Reviews
No ratings yet
Aspect-Based Opinion Mining in Reviews
5 pages
Deep Learning For Multigrade Brain Tumor Classification in Smart Healthcare Systems: A Prospective Survey
No ratings yet
Deep Learning For Multigrade Brain Tumor Classification in Smart Healthcare Systems: A Prospective Survey
16 pages
Data Mining Systems Classification
No ratings yet
Data Mining Systems Classification
35 pages
KTU Data Mining Course Syllabus
No ratings yet
KTU Data Mining Course Syllabus
3 pages
Devlox Project Ideas Overview
No ratings yet
Devlox Project Ideas Overview
6 pages
Predictive Maintenance for Track Switches
No ratings yet
Predictive Maintenance for Track Switches
14 pages
Text Classification Techniques Overview
No ratings yet
Text Classification Techniques Overview
73 pages
Supervised Learning: Linear Regression Models
No ratings yet
Supervised Learning: Linear Regression Models
129 pages
Illustrated BERT and ELMo Explained
No ratings yet
Illustrated BERT and ELMo Explained
4 pages
CART: Theory and Applications
100% (1)
CART: Theory and Applications
40 pages
Overview of Computer Aided Process Planning
No ratings yet
Overview of Computer Aided Process Planning
21 pages
Perceptron vs. Feedforward Neural Networks
No ratings yet
Perceptron vs. Feedforward Neural Networks
2 pages

IIM Rohtak: Machine Learning Insights

Uploaded by

IIM Rohtak: Machine Learning Insights

Uploaded by

IIM Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Now select any cluster ,the

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Randomly allocation of data points

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Let you want k=4

But now you asked

Indian Institute of Management (IIM),Rohtak

Now system is suggesting

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Classification of machine learning

Indian Institute of Management (IIM),Rohtak

Decision Trees are used to predict a Label (usually

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

In this specific example, the accuracy is 95%: yes,

Indian Institute of Management (IIM),Rohtak

The false positive rate is calculated as

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

When it is predicted Yes, how often is it correct?

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

It ranges from 0 to 1, with 1 being the best possible score.

Indian Institute of Management (IIM),Rohtak

Accuracy: Overall, how often is the classifier correct?

•(FP+FN)/total = (10+5)/165 = 0.09

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Indian Institute of Management (IIM),Rohtak

Process (1): Model Construction

NAME RANK YEARS TENURED Classifier

Process (2): Using the Model in

Decision Tree Induction: Training Dataset

>40 low no fair ??

>40 low no fair ?? YES

student? yes credit rating?

no yes excellent fair

Thank you !!! 66

You might also like