100% found this document useful (1 vote)

146 views

Introduction To ML

Machine learning is a branch of artificial intelligence that uses algorithms to identify patterns in data and learn from that data in order to make predictions or decisions without being explicitly programmed. The goal of machine learning is for computers to be able to learn from examples or past experiences to improve their performance on some task. Some key aspects covered in the document include the different types of machine learning tasks like classification, clustering, and prediction, as well as the basic components of a machine learning system like the hypothesis space, search strategy, and evaluation method.

Uploaded by

Pooja Patwari

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

146 views

Introduction To ML

Uploaded by

Pooja Patwari

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

INTRODUCTION

What is machine learning?

 Goal: programs that detect patterns and regularities in the
data
 Strong patterns  good predictions
 Problem 1: most patterns are not interesting
 Problem 2: patterns may be inexact (or
spurious)
 Problem 3: data may be garbled or missing
Related Disciplines
 Artificial Intelligence
 Data Mining
 Probability and Statistics
 Information theory
 Numerical optimization
 Computational complexity theory
 Control theory (adaptive)
 Psychology (developmental, cognitive)
 Neurobiology
 Linguistics
 Philosophy

2
What is machine learning?
 A branch of artificial intelligence, concerned with the
design and development of algorithms that allow computers
to evolve behaviors based on empirical data.
 As intelligence requires knowledge, it is necessary for the
computers to acquire knowledge.
 Flood of data…..Highly complex systems,.. Speed of
programming (Supermarkets, Banks, telephone switches,
research, medical ..etc Google??) Any alternative ???
 A program is said to learn from experience E with respect to
task T and performance measure P, if it’s performance at tasks
in T, as measured by P, improves with experience E.
 Machine learning is programming computers to optimize a
performance criterion using example data or past experience
What is ML?
 An algorithm is a sequence of instructions that when
carried out transforms input to output.
 There are tasks with no algorithms.
 The problem of sorting algorithm?
 ??? we gave a program a number of examples of unsorted
lists and corresponding sorted lists, and wanted the
program to learn (or, come up with an algorithm) to sort?
 Learn pattern in data???
 To be intelligent, a system that is in a changing environment
should have the ability to learn.
 If a system can learn and adapt to such changes, the system
designer need not foresee and provide solutions for all
possible situations.
LEARNING
There are two ways that a system can improve:
1. By acquiring new knowledge
 acquiring new facts
 acquiring new skills
2. By adapting its behavior
 solving problems more accurately
 solving problems more efficiently
Why do we need Machine Learning?
• Some tasks cannot be defined well, except by examples (e.g. recognition of
faces or people).

• Large amounts of data may have hidden relationships and correlations.

Only automated approaches may be able to detect these.

• The amount of knowledge about a certain problem / task may be too large
for explicit encoding by humans (e.g. in medical diagnostics)

• Environments change over time, and new knowledge is constantly being

discovered. A continuous redesign of the systems “by hand” may be
difficult.
Some examples of tasks that are best solved by
using a learning algorithm

 Recognizing patterns:
 Facial identities or facial expressions

 Handwritten or spoken words

 Medical images

 Generating patterns:
 Generating images or motion sequences

 Recognizing anomalies:
 Unusual sequences of credit card transactions

 Unusual patterns of sensor readings in a nuclear power

plant or unusual sound in your car engine.
 Prediction:
 Future stock prices or currency exchange rates
Some web-based examples of machine learning

 The web contains a lot of data. Tasks with very big datasets
often use machine learning
 especially if the data is noisy or non-stationary.

 Spam filtering, fraud detection:

 The enemy adapts so we must adapt too.

 Recommendation systems:
 Lots of noisy data. Million dollar prize!

 Information retrieval:
 Find documents or images with similar content.

 Data Visualization:
 Display a huge database in a revealing way
Learning task
• Classification:
 Prediction of an item class.
• Forecasting:
 Prediction of a parameter value.
• Characterization:
 Find hypotheses that describe groups of items.
• Clustering:
 Partitioning of the (unassigned) data set into clusters
with common properties. (Unsupervised learning)
dataset and pre-processing
 Complexity of datasets:
• Many instances (examples)
• Instances with multiple features (properties / characteristics)
• Dependencies between the features (correlations)
 Instance selection:
 Remove identical / inconsistent / incomplete instances (e.g.
reduction of homologous genes, removal of wrongly annotated
genes)

 Feature transformation / selection:

 Projection techniques (e.g. principal components analysis)
 Compression techniques (e.g. minimum description length)
 Feature selection techniques
Defining the Learning Task
Improve on task, T, with respect to
performance metric, P, based on experience, E.
T: Playing checkers
P: Percentage of games won against an arbitrary opponent
E: Playing practice games against itself

T: Recognizing hand-written words

P: Percentage of words correctly classified
E: Database of human-labeled images of handwritten words

T: Driving on four-lane highways using vision sensors

P: Average distance traveled before a human-judged error
E: A sequence of images and steering commands recorded while
observing a human driver.

T: Categorize email messages as spam or legitimate.

P: Percentage of email messages correctly classified.
E: Database of emails, some with human-given labels
Designing a Learning System
 Choose the training experience
 Choose exactly what is to be learned, i.e. the
target function.
 Choose how to represent the target function.
 Choose a learning algorithm to infer the target
function from the experience.

Learner
Environment/
Experience Knowledge

Performance
Element
What is ML?

Can we improve investment gain with help of stock data?

The learning Model
Understanding Hypothesis space
How many possible Boolean functions

4 features = 216 = 65536

After 7 examples, we still have

29 possibilities

The space of all hypothesis that

can be output by a learning algorithm

Version space : space not ruled

out by a training examples
Learning as search
 Inductive learning: find a concept description that fits the data
 Example: rule sets as description language
 Enormous, but finite, search space
 Simple solution:
 enumerate the concept space
 eliminate descriptions that do not fit examples
 surviving descriptions contain target concept

18
witten&eibe
Uses of machine Learning
 Machine Learning creates an optimized model of the
concept being learned based on data or past
experience. The model is parameterized.
 Learning is the execution of a computer program to
optimize the parameter values so that the model fits
data or past experience well.
 Uses of learning: Predictive and/or Descriptive.
 Predictive: Use the model to predict things about an
unseen example.
 Descriptive: Use the model to describe the examples
seen or experiences had. This model can be used in
some problem-solving situation.
The basic principle
 10^5 machine learning algorithms
 Hundreds new every year
 Every algorithm has three components: –
1. Hypothesis space—possible outputs ( ANN,
SVM, Decision tree, Bayes network etc )
2. Search strategy---strategy for exploring space
(optimizing an objective function)
3. Evaluation like accuracy, precision and recall,
squared error ,Likelihood • Posterior probability •
Cost / Utility , Margin
Learning system model

Testing

Input Learning
Samples Method

System

Training
Training and testing

Data acquisition Practical usage

Universal set
(unobserved)

Training set Testing set

(observed) Labels are known (unobserved)
Labels are known but not given
Performance
 There are several factors affecting the performance:
 Types of training provided
 The form and extent of any initial background knowledge
 The type of feedback provided
 The learning algorithms used

 Two important factors:

 Modeling
 Optimization
Algorithms
 The success of machine learning system also depends on the
algorithms.

 The algorithms control the search to find and build the

knowledge structures.

 The learning algorithms should extract useful information

from training examples.
Algorithms
 Supervised learning ( )
 Prediction
 Classification (discrete labels), Regression (real values)
 Unsupervised learning ( )
 Clustering
 Probability distribution estimation
 Finding association (in features)
 Dimension reduction [NO FEEDBACK]
 Semi-supervised learning
 Reinforcement learning [INDIRECT FEEDBACK]
 Decision making (robot, chess machine)
Types of learning task
 Supervised learning
 Learn to predict output when given an input vector
 Who provides the correct answer?
 Reinforcement learning
 Learn action to maximize payoff
 Not much information in a payoff signal
 Payoff is often delayed
 Reinforcement learning is an important area that will not be
covered in this course.
 Unsupervised learning
 Create an internal representation of the input e.g. form
clusters; extract features
 How do we know if a representation is good?
 This is the new frontier of machine learning because most big
datasets do not come with labels.
Algorithms

Supervised learning Unsupervised learning

27 Semi-supervised learning
Machine learning structure

 Supervised learning
Machine learning structure
 Unsupervised learning
Semi-supervised learning (SSL)

 Traditional supervised learning is limited to using labeled data.

 SSL also uses unlabeled data to learn.

Let (x,y) be a labeled instance and (x,ø) be an unlabeled instance.

L: a set of n labaled instances.
U: a set of m unlabeled instances.
n << m
SSL tries to use L U U to learn a predictive model.
Learning techniques
• Linear classifier

, where w is an d-dim vector (learned)

 Techniques:
 Perceptron
 Logistic regression
 Support vector machine (SVM)
 Ada-line
 Multi-layer perceptron (MLP)
Learning techniques
• Non-linear case

 Support vector machine (SVM):

 Linear to nonlinear: Feature transform and kernel function
Learning techniques
 Unsupervised learning categories and techniques
 Clustering
 K-means clustering

 Spectral clustering

 Density Estimation
 Gaussian mixture model (GMM)

 Graphical models

 Dimensionality reduction
 Principal component analysis (PCA)

 Factor analysis
Classification
 There are three methodologies:
a) Model a classification rule directly
Examples: k-NN, linear classifier, SVM, neural nets, …
b) Model the probability of class memberships given input data
Examples: logistic regression, probabilistic neural nets (softmax),…
c) Make a probabilistic model of data within each class
Examples: naive Bayes, model-based ….
 Important ML taxonomy for learning models
probabilistic models vs non-probabilistic models
discriminative models vs generative models
 Resulting model is also called the hypothesis

Classification

zebra tiger rhino panda

Algorith Model lion
hippo
m
elephant
giraffe
lion penguin snake

Given a model space and an optimality criterion, a model satisfying this criterion is sought
Some optimizing criteria:

 Maximizing the prediction accuracy

 Minimizing the hypothesis’ size
 Maximizing the hypothesis fitness to the input data
 Maximizing the hypothesis interpretability
 Minimizing the time complexity of prediction
Classification
Learn a method for predicting the instance class from
pre-labeled (classified) instances

Many approaches:
Regression,
Decision Trees,
Bayesian,
Neural Networks,
...

Given a set of points from classes

what is the class of new point ?
37
Linear and Non-Linear Decision
boundary
Regression
• Regression analysis is used to predict the value of one variable (the
dependent variable) on the basis of other variables (the
independent variables).
• Learn a continuous function.

• Given, the following data, can we find

the value of the output when x = 0.44?
• Goal is to predict for input x an output
f(x) that is close to the true y.

• It is generally a problem of function approximation, or

interpolation, working out the value between values that we
know.
39

Submitted To:: Prof. Vinay Singh Chawan
No ratings yet
Submitted To:: Prof. Vinay Singh Chawan
12 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Unit 1 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 1 - Machine Learning - WWW - Rgpvnotes.in
23 pages
Machine Learning Algorithms PDF
100% (1)
Machine Learning Algorithms PDF
148 pages
Machine Learning: Presentation By: C. Vinoth Kumar SSN College of Engineering
100% (1)
Machine Learning: Presentation By: C. Vinoth Kumar SSN College of Engineering
15 pages
Artificial Intelligence: Slide 6
100% (1)
Artificial Intelligence: Slide 6
42 pages
Machine Learning Material
100% (3)
Machine Learning Material
115 pages
Machine Learning
100% (1)
Machine Learning
21 pages
Introduction To Machine Learning
100% (1)
Introduction To Machine Learning
46 pages
Fundamentals of Statistics For Data Science
No ratings yet
Fundamentals of Statistics For Data Science
23 pages
Machine Learning Techniques
100% (2)
Machine Learning Techniques
45 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
27 pages
Great Collection of Data Science Resources
100% (1)
Great Collection of Data Science Resources
2 pages
1 - Machine Learning (Start)
No ratings yet
1 - Machine Learning (Start)
32 pages
Machine Learning
100% (1)
Machine Learning
81 pages
L2 - Machine Learning Process
No ratings yet
L2 - Machine Learning Process
17 pages
02 - Lecture Note - TensorFlow Ops
No ratings yet
02 - Lecture Note - TensorFlow Ops
21 pages
Unit - 5.1 - Introduction To Machine Learning
No ratings yet
Unit - 5.1 - Introduction To Machine Learning
38 pages
Ai Agents
No ratings yet
Ai Agents
31 pages
Data Science Theory: Analysis and Analytics
No ratings yet
Data Science Theory: Analysis and Analytics
14 pages
Symbolic Machine Learning: M.S.Kaysar, M.Engg Cse, Iub
100% (2)
Symbolic Machine Learning: M.S.Kaysar, M.Engg Cse, Iub
112 pages
Machine Learning Tutorial
100% (1)
Machine Learning Tutorial
44 pages
Deep Learning Interview Questions
No ratings yet
Deep Learning Interview Questions
17 pages
1 - Intro To Machine Learning
100% (1)
1 - Intro To Machine Learning
20 pages
Machine Learning and Real-World Applications
100% (1)
Machine Learning and Real-World Applications
19 pages
Machine Learning Basics: An Illustrated Guide For Non-Technical Readers
No ratings yet
Machine Learning Basics: An Illustrated Guide For Non-Technical Readers
16 pages
Top 100 ML Interview Q&A
100% (1)
Top 100 ML Interview Q&A
39 pages
A Course in Machine Learning
No ratings yet
A Course in Machine Learning
189 pages
Data Science Use Cases
100% (1)
Data Science Use Cases
10 pages
8 Machine Learning Algorithms in Python
100% (3)
8 Machine Learning Algorithms in Python
16 pages
Data Science A Beginner S Guide 1668243666
100% (1)
Data Science A Beginner S Guide 1668243666
26 pages
Natural Language Processing: Dr. Ahmed El-Bialy
100% (1)
Natural Language Processing: Dr. Ahmed El-Bialy
49 pages
Basics of Prompt Engineering
No ratings yet
Basics of Prompt Engineering
16 pages
What Are The Types of Machine Learning?
100% (1)
What Are The Types of Machine Learning?
24 pages
Machine Learning Cheat Sheet
No ratings yet
Machine Learning Cheat Sheet
1 page
K Means
100% (2)
K Means
329 pages
Overview of Machine Learning PDF
100% (1)
Overview of Machine Learning PDF
57 pages
Combined ML
100% (1)
Combined ML
705 pages
Exploratory Data Analysis - Satyajit
No ratings yet
Exploratory Data Analysis - Satyajit
35 pages
Data Science Interview Questions (#Day11) PDF
100% (1)
Data Science Interview Questions (#Day11) PDF
11 pages
ML
No ratings yet
ML
79 pages
Top 9 Feature Engineering Techniques With Python: Dataset & Prerequisites
No ratings yet
Top 9 Feature Engineering Techniques With Python: Dataset & Prerequisites
27 pages
Machine Learning1
100% (1)
Machine Learning1
11 pages
Bias and Variance
No ratings yet
Bias and Variance
6 pages
Machine Learning
No ratings yet
Machine Learning
15 pages
Data Science PPT Module 1
100% (1)
Data Science PPT Module 1
24 pages
ML Interview Questions and Answers
100% (1)
ML Interview Questions and Answers
25 pages
EDA - The Right Way
No ratings yet
EDA - The Right Way
111 pages
500 Machine Learning Projects
100% (1)
500 Machine Learning Projects
14 pages
Feature Engineering Handout
No ratings yet
Feature Engineering Handout
33 pages
DataScience Interview Questions
100% (1)
DataScience Interview Questions
66 pages
Machine Learning Handouts
No ratings yet
Machine Learning Handouts
110 pages
Fundamentals of Data Science
100% (3)
Fundamentals of Data Science
62 pages
Python Data Science
100% (1)
Python Data Science
173 pages
ML First Unit
No ratings yet
ML First Unit
70 pages
Machine Learning
100% (5)
Machine Learning
56 pages
Data Mining Project Shivani Pandey
100% (1)
Data Mining Project Shivani Pandey
40 pages
Machine Learning with Python: Design and Develop Machine Learning and Deep Learning Technique using real world code examples
From Everand
Machine Learning with Python: Design and Develop Machine Learning and Deep Learning Technique using real world code examples
Abhishek Vijayvargia
No ratings yet
Machine Learning and Deep Learning With Python
From Everand
Machine Learning and Deep Learning With Python
James Chen
No ratings yet
Unit-I
No ratings yet
Unit-I
23 pages
Machine Learning Unit1
No ratings yet
Machine Learning Unit1
151 pages
SVM Optimization: Derivation of The Lagrangian Dual
No ratings yet
SVM Optimization: Derivation of The Lagrangian Dual
13 pages
Soft Max
No ratings yet
Soft Max
6 pages
Support Vector Machines (SVM) : N I y X D
No ratings yet
Support Vector Machines (SVM) : N I y X D
5 pages
Non-Linear Classifiers
No ratings yet
Non-Linear Classifiers
19 pages
GD in LR
No ratings yet
GD in LR
23 pages
Kernel Methods: Feature Mapping at No Cost
No ratings yet
Kernel Methods: Feature Mapping at No Cost
25 pages
Introduction To SVM
No ratings yet
Introduction To SVM
24 pages
Gradient Descent Learning: Minimize Objective Function: Error Landscape
No ratings yet
Gradient Descent Learning: Minimize Objective Function: Error Landscape
14 pages
Backpropagation Learning in Neural Networks
No ratings yet
Backpropagation Learning in Neural Networks
27 pages
Notes EIC17103 11 8 20 PDF
No ratings yet
Notes EIC17103 11 8 20 PDF
8 pages
SCI 1020 - wk2
No ratings yet
SCI 1020 - wk2
4 pages
Chapter Four Research Design
No ratings yet
Chapter Four Research Design
9 pages
2023.workbook - Maths Literacy Grade 10
No ratings yet
2023.workbook - Maths Literacy Grade 10
16 pages
2588 Software Development and Modelling For Churn Prediction Using Logistic Regression in Telecommunication Industry
No ratings yet
2588 Software Development and Modelling For Churn Prediction Using Logistic Regression in Telecommunication Industry
5 pages
Mumbai Educational Trust: MET Institute of Computer Science
No ratings yet
Mumbai Educational Trust: MET Institute of Computer Science
11 pages
Basic School Teachers ' Perspective To Digital Teaching and Learning in Ghana
No ratings yet
Basic School Teachers ' Perspective To Digital Teaching and Learning in Ghana
16 pages
Grade 7 Science Variable Practice-Answers
No ratings yet
Grade 7 Science Variable Practice-Answers
1 page
Assessing The Factors of Customers Satisfaction On Credit Card Users in Bangladesh
No ratings yet
Assessing The Factors of Customers Satisfaction On Credit Card Users in Bangladesh
13 pages
Lab Report Template From 2019 DP
No ratings yet
Lab Report Template From 2019 DP
9 pages
Credit Scoring in The Age of Big Data - A State-of-the-Art
No ratings yet
Credit Scoring in The Age of Big Data - A State-of-the-Art
13 pages
Eps 101a Unit 1 Slides
No ratings yet
Eps 101a Unit 1 Slides
49 pages
Data Interpretation With MS Excel
No ratings yet
Data Interpretation With MS Excel
5 pages
NIM Qunatitative Methods Workshop1
No ratings yet
NIM Qunatitative Methods Workshop1
30 pages
Ananta Raj Dahal
No ratings yet
Ananta Raj Dahal
14 pages
Tobit Postestimation - Postestimation Tools For Tobit
No ratings yet
Tobit Postestimation - Postestimation Tools For Tobit
5 pages
CHPT 12 Homework
100% (1)
CHPT 12 Homework
22 pages
Demand Forecasting Methods
No ratings yet
Demand Forecasting Methods
9 pages
Agustin, 2020. The Effect of Brand Image
No ratings yet
Agustin, 2020. The Effect of Brand Image
20 pages
Uma Ánalise Fatorial Completa Da Variação de Microdureza em Cordôes de Solda Depositados Pelo Processo de Soldagem Por Arco Metálico A Gás de
No ratings yet
Uma Ánalise Fatorial Completa Da Variação de Microdureza em Cordôes de Solda Depositados Pelo Processo de Soldagem Por Arco Metálico A Gás de
4 pages
Determinants of Induced Abortion Among Women of Reproductive Age
No ratings yet
Determinants of Induced Abortion Among Women of Reproductive Age
10 pages
Data Science Minimum - 10 Essential Skills You Need To Know To Start Doing Data Science - KDnuggets
No ratings yet
Data Science Minimum - 10 Essential Skills You Need To Know To Start Doing Data Science - KDnuggets
8 pages
Unfolding Model of Employees Turnover
100% (3)
Unfolding Model of Employees Turnover
23 pages
Customers Satisfaction On Online Shopping in Malay
No ratings yet
Customers Satisfaction On Online Shopping in Malay
9 pages
Bayesian Symbolic Regression: Ying Jin, Weilin Fu, Jian Kang, Jiadong Guo, Jian Guo
No ratings yet
Bayesian Symbolic Regression: Ying Jin, Weilin Fu, Jian Kang, Jiadong Guo, Jian Guo
10 pages
Chapter 5 Final
No ratings yet
Chapter 5 Final
29 pages
Quiz 2
No ratings yet
Quiz 2
1 page
Multiple Linear Regression: BIOST 515 January 15, 2004
No ratings yet
Multiple Linear Regression: BIOST 515 January 15, 2004
32 pages
The Probit Model: Alexander Spermann University of Freiburg University of Freiburg Sose 2009
No ratings yet
The Probit Model: Alexander Spermann University of Freiburg University of Freiburg Sose 2009
38 pages
The Effect of Stock Ownership, Independent Board of Commisioners and Characteristics of The Audit Committee On Creative Accounting Practices
No ratings yet
The Effect of Stock Ownership, Independent Board of Commisioners and Characteristics of The Audit Committee On Creative Accounting Practices
8 pages