Introduction To ML
Introduction To ML
2
What is machine learning?
A branch of artificial intelligence, concerned with the
design and development of algorithms that allow computers
to evolve behaviors based on empirical data.
As intelligence requires knowledge, it is necessary for the
computers to acquire knowledge.
Flood of data…..Highly complex systems,.. Speed of
programming (Supermarkets, Banks, telephone switches,
research, medical ..etc Google??) Any alternative ???
A program is said to learn from experience E with respect to
task T and performance measure P, if it’s performance at tasks
in T, as measured by P, improves with experience E.
Machine learning is programming computers to optimize a
performance criterion using example data or past experience
What is ML?
An algorithm is a sequence of instructions that when
carried out transforms input to output.
There are tasks with no algorithms.
The problem of sorting algorithm?
??? we gave a program a number of examples of unsorted
lists and corresponding sorted lists, and wanted the
program to learn (or, come up with an algorithm) to sort?
Learn pattern in data???
To be intelligent, a system that is in a changing environment
should have the ability to learn.
If a system can learn and adapt to such changes, the system
designer need not foresee and provide solutions for all
possible situations.
LEARNING
There are two ways that a system can improve:
1. By acquiring new knowledge
acquiring new facts
acquiring new skills
2. By adapting its behavior
solving problems more accurately
solving problems more efficiently
Why do we need Machine Learning?
• Some tasks cannot be defined well, except by examples (e.g. recognition of
faces or people).
• The amount of knowledge about a certain problem / task may be too large
for explicit encoding by humans (e.g. in medical diagnostics)
Recognizing patterns:
Facial identities or facial expressions
Medical images
Generating patterns:
Generating images or motion sequences
Recognizing anomalies:
Unusual sequences of credit card transactions
The web contains a lot of data. Tasks with very big datasets
often use machine learning
especially if the data is noisy or non-stationary.
Recommendation systems:
Lots of noisy data. Million dollar prize!
Information retrieval:
Find documents or images with similar content.
Data Visualization:
Display a huge database in a revealing way
Learning task
• Classification:
Prediction of an item class.
• Forecasting:
Prediction of a parameter value.
• Characterization:
Find hypotheses that describe groups of items.
• Clustering:
Partitioning of the (unassigned) data set into clusters
with common properties. (Unsupervised learning)
dataset and pre-processing
Complexity of datasets:
• Many instances (examples)
• Instances with multiple features (properties / characteristics)
• Dependencies between the features (correlations)
Instance selection:
Remove identical / inconsistent / incomplete instances (e.g.
reduction of homologous genes, removal of wrongly annotated
genes)
Learner
Environment/
Experience Knowledge
Performance
Element
What is ML?
18
witten&eibe
Uses of machine Learning
Machine Learning creates an optimized model of the
concept being learned based on data or past
experience. The model is parameterized.
Learning is the execution of a computer program to
optimize the parameter values so that the model fits
data or past experience well.
Uses of learning: Predictive and/or Descriptive.
Predictive: Use the model to predict things about an
unseen example.
Descriptive: Use the model to describe the examples
seen or experiences had. This model can be used in
some problem-solving situation.
The basic principle
10^5 machine learning algorithms
Hundreds new every year
Every algorithm has three components: –
1. Hypothesis space—possible outputs ( ANN,
SVM, Decision tree, Bayes network etc )
2. Search strategy---strategy for exploring space
(optimizing an objective function)
3. Evaluation like accuracy, precision and recall,
squared error ,Likelihood • Posterior probability •
Cost / Utility , Margin
Learning system model
Testing
Input Learning
Samples Method
System
Training
Training and testing
Universal set
(unobserved)
27 Semi-supervised learning
Machine learning structure
Supervised learning
Machine learning structure
Unsupervised learning
Semi-supervised learning (SSL)
Techniques:
Perceptron
Logistic regression
Support vector machine (SVM)
Ada-line
Multi-layer perceptron (MLP)
Learning techniques
• Non-linear case
Spectral clustering
Density Estimation
Gaussian mixture model (GMM)
Graphical models
Dimensionality reduction
Principal component analysis (PCA)
Factor analysis
Classification
There are three methodologies:
a) Model a classification rule directly
Examples: k-NN, linear classifier, SVM, neural nets, …
b) Model the probability of class memberships given input data
Examples: logistic regression, probabilistic neural nets (softmax),…
c) Make a probabilistic model of data within each class
Examples: naive Bayes, model-based ….
Important ML taxonomy for learning models
probabilistic models vs non-probabilistic models
discriminative models vs generative models
Resulting model is also called the hypothesis
Classification
Given a model space and an optimality criterion, a model satisfying this criterion is sought
Some optimizing criteria:
Many approaches:
Regression,
Decision Trees,
Bayesian,
Neural Networks,
...