0% found this document useful (0 votes)

43 views8 pages

Classification Is The Task of Assigning A Class Label To An Input Pattern

The document discusses classification in machine learning, detailing supervised, unsupervised, and semi-supervised learning approaches. It explains the components of a pattern recognition system, including sensing, segmentation, feature extraction, and Bayesian decision theory, which is used for optimal classification. Additionally, it covers the K-means clustering algorithm, its process, and its strengths and weaknesses in data analysis.

Uploaded by

Sanjana B

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views8 pages

Classification Is The Task of Assigning A Class Label To An Input Pattern

Uploaded by

Sanjana B

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Classification is the task of assigning a class label to an input pattern.

The class label indicates

one of a given set of classes. The classification is carried out with the help of a model obtained
using a learning procedure. According to the type of learning used, there are two categories of
classification, one using supervised learning and the other using unsupervised learning.
Supervised learning makes use of a set of examples which already have the class labels
assigned to them. Unsupervised learning attempts to find inherent structures in the
data. Semi-supervised learning makes use of a small number of labeled data and a large
number of unlabeled data to learn the classifier.

Sensing
The sensors in a system are what receives the data input, and they may vary depending
on the purpose of the system. They are usually some form of transducers such as a camera or
a microphone.
Segmentation
After receiving the input data the different patterns need to be separated. Segmentation is
one of the toughest problems of pattern recognition because a lot of the patterns tend to
overlap and intermingle. For example, trying to recognize a pattern of the individual sound
"s", in the two words "see", and "son" would prove difficult because the sound is pronounced
differently in the two words, and using the same model to segment the 's" would not be
accurate.
Feature Extraction and Classification
Here, the goal is to characterize the data to be recognized by measurements that will give
the same results for data in the same category and different results for data in different
categories. This leads to finding distinguishing features that are invariant to any
transformations of the data. The degree of classifying the input into different categories varies
on the features of the data. While perfect classification is often impossible, an easier task is to
find the probability of the data fitting one of the categories.
Post Processing
The post-processor uses the output of the classifier to decide on the recommended action on
the data.
The image to the right shows the various components of a patten recognition

system.

The design of a pattern recognition also involves the repetition of the design cycle which
contains different activities. The different cycles involved are;
Data Collection
Feature and Model Choices: The choice of distinguishing the features we will be looking
for is a very critical step. Prior knowledge about the incoming data also helps in selecting the
right features.
Training: The process of using the data to determine the classifier is known as training the
classifier.
Evaluation: Evaluation is important to measure the performance of the system and also
indicate any room for improvement.
The image to the right shows an example of the design cycle for a pattern recognition

system.

Bayesian Decision Theory

Introduction
Bayesian decision theory is a fundamental statistical approach to the problem of pattern
classification. It is considered the ideal case in which the probability structure underlying the
categories is known perfectly. While this sort of situation rarely occurs in practice, it permits
us to determine the optimal (Bayes) classifier against which we can compare all other
classifiers. Moreover, in some problems it enables us to predict the error we will get when we
generalize to novel patterns.
This approach is based on quantifying the tradeoffs between various classification decisions
using probability and the costs that accompany such decisions. It makes the assumption that
the decision problem is posed in probabilistic terms, and that all of the relevant probability
values are known.

One of the most well-known equations in the world of statistics and probability is Bayes’

Theorem (see formula below). The basic intuition is that the probability of some class or event

occurring, given some feature (i.e. attribute), is calculated based on the likelihood of the
feature’s value and any prior information about the class or event of interest. This seems like a

lot to digest, so I will break it down for you. First off, the case of cancer detection is a two-

class problem. The first class, ω1, represents the event that a tumor is present, and ω2

represents the event that a tumor is not present.

Prior

There are four parts to Bayes’ Theorem: Prior, Evidence, Likelihood, and Posterior. The

priors(P(ω1), P(ω2)), define how likely it is for event ω1 or ω2 to occur in nature. It is

important to realize the priors vary depending on the situation. Since the objective is to detect

cancer, it is safe to say that the probability of a tumor being present is pretty low: P(ω1)<P(ω2

Likelihood

From a high level, a CT scan is when x-rays are applied in a circular motion. One of the key

metrics that is produced is attenuation — a measurement of x-ray absorption. Objects with a

higher density have a higher attenuation and vice-versa. Therefore, a tumor is more likely to

have a high attenuation compared to lung tissue.

Suppose you only look at attenuation values to help make your decision between ω1 and ω2.

Each class has a class-conditional probability density, p(x|ω1) and p(x|ω2), called likelihoods.

The figure below shows a hypothetical class-conditional probability density for p(x|ω). These

distributions are extracted by analyzing your training data; however, it is always good to have

domain expertise to check the validity of the data.

Evidence

The best way to describe the evidence, p(x), is through the law of total probability. This law

states that if you have mutually exclusive events (e.g. ω1 and ω2) whose probability of

occurrence sum up to 1, then the probability of some feature (e.g. attenuation) is the likelihood

times the prior summed across all mutually exclusive events.

Posterior

The result of using Bayes’ Theorem is called the posterior, P(ω1|x) and P(ω2|x). The posterior

represents the probability that an observation falls into class ω1 or ω2 (i.e tumor present or

not) given the measurement x (e.g. attenuation). Each observation receives a posterior

probability for every class, and all the posteriors must add up to 1. In regards to the cancer

detection problem we are trying to solve, there are two posterior probabilities. The image

below is a hypothetical scenario of how the posterior values could change with respect to a

measurement x. In addition to a connection between the likelihoods and the posteriors, the

posterior can be heavily affected by prior P(ω).

Decision Rules

Now that we have a good understanding of Bayes’ theorem, it’s time to see how we can use it

to make a decision boundary between our two classes. There are two methods for determining

whether a patient has a tumor present or not. The first is a basic approach that only uses the

prior probability values to make a decision. The second way utilizes the posteriors, which

takes advantage of the priors and class-conditional probability distributions.

Using the Priors

Suppose we only make a decision based on the natural prior probabilities. This means we

forget about all the other factors in Bayes’ Theorem. Since the probability of having a tumor,

P(ω1), is far less than not having one P(ω2), our model/system will always decide that every

patient does not have a tumor. Even though the model/system will be correct most of the time,

it will not identify the patients who actually have a tumor and need proper medical attention.

Using the Posteriors

Now let’s take a more comprehensive approach by using the posteriors, P(ω1|x) and P(ω2|x).

Since the posteriors are a result of Bayes’ Theorem, the impact of the priors is mitigated by the

class-conditional probability densities, p(x|ω1) and p(x|ω2). If our model/system is looking at

a region with a higher attenuation than ordinary tissue, then the probability of a tumor being

present increases despite the natural prior probabilities. Let’s assume there is a 75% chance

that a specific region contains a tumor, then that would mean there is a 25% chance there is no

tumor at all. That 25% chance is our probability of error, also known as risk.

K-means clustering is one of the simplest and popular unsupervised machine learning

algorithms.

Typically, unsupervised algorithms make inferences from datasets using only input vectors
without referring to known, or labelled, outcomes.

The objective of K-means is simple: group similar data points together and discover

underlying patterns. To achieve this objective, K-means looks for a fixed number (k) of

clusters in a dataset.”

A cluster refers to a collection of data points aggregated together because of certain

similarities.
Define a target number k, which refers to the number of centroids in the dataset. A centroid is

the imaginary or real location representing the center of the cluster.

Every data point is allocated to each of the clusters through reducing the in-cluster sum of

squares.

In other words, the K-means algorithm identifies k number of centroids, and then allocates

every data point to the nearest cluster, while keeping the centroids as small as possible.

The ‘means’ in the K-means refers to averaging of the data; that is, finding the centroid.

How the K-means algorithm works

To process the learning data, the K-means algorithm in data mining starts with a first group of

randomly selected centroids, which are used as the beginning points for every cluster, and then

performs iterative (repetitive) calculations to optimize the positions of the centroids

It halts creating and optimizing clusters when either:

• The centroids have stabilized — there is no change in their values because the

clustering has been successful.

• The defined number of iterations has been achieved.

K-means algorithm

Step 1: Import libraries

• Pandas for reading and writing spreadsheets

• Numpy for carrying out efficient computations

• Matplotlib for visualization of data

Step 2: Generate random data

Step 3: Use Scikit-Learn

Step 4: Finding the centroid

Step 5: Testing the algorithm

K-means clustering is an extensively used technique for data cluster analysis.

It is easy to understand, especially if you accelerate your learning using a K-means clustering

tutorial. Furthermore, it delivers training results quickly.

However, its performance is usually not as competitive as those of the other sophisticated

clustering techniques because slight variations in the data could lead to high variance.

Furthermore, clusters are assumed to be spherical and evenly sized, something which may

reduce the accuracy of the K-means clustering Python results.

ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
Unit Iv
No ratings yet
Unit Iv
34 pages
2024 - Slide2 - BayesML Sub
No ratings yet
2024 - Slide2 - BayesML Sub
40 pages
Fish Species Classification Guide
No ratings yet
Fish Species Classification Guide
141 pages
Bayes&Voice Recognition
No ratings yet
Bayes&Voice Recognition
76 pages
Unit - V Pattern Recognition: Dr.K.Sampath Kumar Scse/Gu
No ratings yet
Unit - V Pattern Recognition: Dr.K.Sampath Kumar Scse/Gu
30 pages
CSC 323-07 Bayesian Learning
No ratings yet
CSC 323-07 Bayesian Learning
11 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
14 pages
Unit 5 Notes DWM
No ratings yet
Unit 5 Notes DWM
18 pages
Bayes Theorem Topic Final
No ratings yet
Bayes Theorem Topic Final
23 pages
Bayesian Decision Theory: Intro To
No ratings yet
Bayesian Decision Theory: Intro To
56 pages
2022 Slide9 BayesML Eng
No ratings yet
2022 Slide9 BayesML Eng
34 pages
ML Bayes05
No ratings yet
ML Bayes05
18 pages
Naïve Bayes Classifier Overview
No ratings yet
Naïve Bayes Classifier Overview
29 pages
Understanding Bayes' Theorem in ML
No ratings yet
Understanding Bayes' Theorem in ML
22 pages
Chapter 4
No ratings yet
Chapter 4
57 pages
Introduction To Machine Learning CS - 229
No ratings yet
Introduction To Machine Learning CS - 229
109 pages
Pattern Reco Tutorial
No ratings yet
Pattern Reco Tutorial
13 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
Bayesian Learning
No ratings yet
Bayesian Learning
42 pages
WINSEM2023-24 MCSE602L TH VL2023240501960 2024-03-13 Reference-Material-I
No ratings yet
WINSEM2023-24 MCSE602L TH VL2023240501960 2024-03-13 Reference-Material-I
132 pages
Lec 04
No ratings yet
Lec 04
70 pages
E-Note 14654 Content Document 20231228101425AM
No ratings yet
E-Note 14654 Content Document 20231228101425AM
10 pages
Unit 6 Neural Network Part 2 2
No ratings yet
Unit 6 Neural Network Part 2 2
27 pages
Bayesian Decision Theory Explained
No ratings yet
Bayesian Decision Theory Explained
6 pages
Naïve Bayes Classification Overview
No ratings yet
Naïve Bayes Classification Overview
19 pages
Naïve Bayesian Classifier Overview
No ratings yet
Naïve Bayesian Classifier Overview
48 pages
Unit 5-6
No ratings yet
Unit 5-6
18 pages
ML Merged
No ratings yet
ML Merged
729 pages
ML Merged Endsem
No ratings yet
ML Merged Endsem
1,117 pages
Data Mining 4th Is
No ratings yet
Data Mining 4th Is
24 pages
Machine Learning PPT Part III
No ratings yet
Machine Learning PPT Part III
26 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Unit-3 (After Mid)
No ratings yet
Unit-3 (After Mid)
10 pages
Baye's Theorem - Example
No ratings yet
Baye's Theorem - Example
7 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
PR Unit 1 2
No ratings yet
PR Unit 1 2
40 pages
Bayes
No ratings yet
Bayes
48 pages
2 Unit PR Statistical Decision Making
No ratings yet
2 Unit PR Statistical Decision Making
61 pages
Pattern Recognition Overview
No ratings yet
Pattern Recognition Overview
48 pages
Understanding Bayesian Classification Techniques
No ratings yet
Understanding Bayesian Classification Techniques
25 pages
Bayesian Classifiers in MATLAB
No ratings yet
Bayesian Classifiers in MATLAB
21 pages
Statistical Perspective
No ratings yet
Statistical Perspective
85 pages
Bayesian Learning Methods
No ratings yet
Bayesian Learning Methods
57 pages
ASTMA
No ratings yet
ASTMA
9 pages
Predicting The Missing Value by Bayesian Classification: Abstract
No ratings yet
Predicting The Missing Value by Bayesian Classification: Abstract
5 pages
Unit-2 Statistical PR
No ratings yet
Unit-2 Statistical PR
26 pages
DATA - FA 2024 - Dist
No ratings yet
DATA - FA 2024 - Dist
85 pages
Bayesian Classification
No ratings yet
Bayesian Classification
7 pages
Bayesian Decision Theory in ML
No ratings yet
Bayesian Decision Theory in ML
56 pages
Lec 1
No ratings yet
Lec 1
42 pages
Bark08 Ghahramani Samlbb 01
No ratings yet
Bark08 Ghahramani Samlbb 01
26 pages
482 LectureNotes Chapter 5
No ratings yet
482 LectureNotes Chapter 5
22 pages
Bayes Theorem in Machine Learning
No ratings yet
Bayes Theorem in Machine Learning
40 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
Pattern Revision
No ratings yet
Pattern Revision
63 pages
AIML-Unit 3 Notes-Assignment 3
No ratings yet
AIML-Unit 3 Notes-Assignment 3
37 pages
K Means
No ratings yet
K Means
3 pages
Standardization Vs Normalization in Pattern Recognition
No ratings yet
Standardization Vs Normalization in Pattern Recognition
1 page
Classification and Clustering Are Two Fundamental Tasks in Machine Learning and Data Mining
No ratings yet
Classification and Clustering Are Two Fundamental Tasks in Machine Learning and Data Mining
3 pages
Single Layer Perceptron and Multilayer Perceptron
No ratings yet
Single Layer Perceptron and Multilayer Perceptron
2 pages
AIML MSE 2 Sums Notes
No ratings yet
AIML MSE 2 Sums Notes
131 pages
AIML MSE 2 Notes
No ratings yet
AIML MSE 2 Notes
35 pages
Strategies for Motivating Students
100% (1)
Strategies for Motivating Students
5 pages
Lesson 1 Quarter 1 Real or Make Believe
0% (1)
Lesson 1 Quarter 1 Real or Make Believe
2 pages
Ee RPPF-1
No ratings yet
Ee RPPF-1
3 pages
BTP Presentation On Text To Image Synthesis
100% (1)
BTP Presentation On Text To Image Synthesis
38 pages
Week 39 Yr 3 Daily Lesson Plan
No ratings yet
Week 39 Yr 3 Daily Lesson Plan
6 pages
Tle10 - He - Cookery - q2 - Mod1 - Performmiseenplace - v3 (70 Pages) PDF
100% (11)
Tle10 - He - Cookery - q2 - Mod1 - Performmiseenplace - v3 (70 Pages) PDF
70 pages
Job Vocabulary Listening Lesson
No ratings yet
Job Vocabulary Listening Lesson
2 pages
ASA Psych Bandura Instr
No ratings yet
ASA Psych Bandura Instr
5 pages
Methodologies and Approaches of Creativity
No ratings yet
Methodologies and Approaches of Creativity
16 pages
Mechanical Engineer Resume Erode
No ratings yet
Mechanical Engineer Resume Erode
3 pages
PSYC384 Course Outline 2024
No ratings yet
PSYC384 Course Outline 2024
6 pages
Museum Concept From Past To Present and Importance of Museums As Centers of Art Education
No ratings yet
Museum Concept From Past To Present and Importance of Museums As Centers of Art Education
3 pages
Classroom Management Insights
No ratings yet
Classroom Management Insights
8 pages
Training-Matrix - TLE Balingasag-South-District
No ratings yet
Training-Matrix - TLE Balingasag-South-District
5 pages
Minutes of The Meeting Gad
No ratings yet
Minutes of The Meeting Gad
10 pages
Assessment For Learning - Key Characteristics: Teaching Strategies
No ratings yet
Assessment For Learning - Key Characteristics: Teaching Strategies
2 pages
Formato Matriz Objetivos y Destrezas 24-25
No ratings yet
Formato Matriz Objetivos y Destrezas 24-25
30 pages
Initial and Primary Level Articulation
No ratings yet
Initial and Primary Level Articulation
6 pages
Maryknoll School of Lupon, Inc.: Kambing Baratua ST., Lupon, Davao Oriental
No ratings yet
Maryknoll School of Lupon, Inc.: Kambing Baratua ST., Lupon, Davao Oriental
14 pages
Healthy Habits Lesson Plan for Grade 7
No ratings yet
Healthy Habits Lesson Plan for Grade 7
3 pages
Lesson Plan Music
No ratings yet
Lesson Plan Music
5 pages
Geography Es 1 Sample Unit
No ratings yet
Geography Es 1 Sample Unit
4 pages
DLL - English 5 - Q3 - W6
No ratings yet
DLL - English 5 - Q3 - W6
5 pages
Machine Learning Application in Prediction of Scour Around Bridge Piers: A Comprehensive Review
No ratings yet
Machine Learning Application in Prediction of Scour Around Bridge Piers: A Comprehensive Review
24 pages
NMRC 2024: Call for Research Abstracts
No ratings yet
NMRC 2024: Call for Research Abstracts
3 pages
Individual Performance Commitment and Review Form (Ipcrf) For Teacher I-Iii
No ratings yet
Individual Performance Commitment and Review Form (Ipcrf) For Teacher I-Iii
20 pages
Adh Resume 2019
No ratings yet
Adh Resume 2019
1 page
Unit 3 Summary
No ratings yet
Unit 3 Summary
2 pages
Research Methods in Education and The Social Sciences - View As Single Page
No ratings yet
Research Methods in Education and The Social Sciences - View As Single Page
7 pages
TAFE Assessment Help
No ratings yet
TAFE Assessment Help
3 pages

Classification Is The Task of Assigning A Class Label To An Input Pattern

Uploaded by

Classification Is The Task of Assigning A Class Label To An Input Pattern

Uploaded by

Classification is the task of assigning a class label to an input pattern.

The class label indicates

Bayesian Decision Theory

represents the event that a tumor is not present.

priors(P(ω1), P(ω2)), define how likely it is for event ω1 or ω2 to occur in nature. It is

metrics that is produced is attenuation — a measurement of x-ray absorption. Objects with a

have a high attenuation compared to lung tissue.

domain expertise to check the validity of the data.

times the prior summed across all mutually exclusive events.

posterior can be heavily affected by prior P(ω).

takes advantage of the priors and class-conditional probability distributions.

Using the Posteriors

class-conditional probability densities, p(x|ω1) and p(x|ω2). If our model/system is looking at

A cluster refers to a collection of data points aggregated together because of certain

the imaginary or real location representing the center of the cluster.

How the K-means algorithm works

performs iterative (repetitive) calculations to optimize the positions of the centroids

It halts creating and optimizing clusters when either:

clustering has been successful.

• The defined number of iterations has been achieved.

Step 1: Import libraries

• Pandas for reading and writing spreadsheets

• Matplotlib for visualization of data

Step 2: Generate random data

Step 3: Use Scikit-Learn

Step 4: Finding the centroid

Step 5: Testing the algorithm

K-means clustering is an extensively used technique for data cluster analysis.

tutorial. Furthermore, it delivers training results quickly.

reduce the accuracy of the K-means clustering Python results.

You might also like