0% found this document useful (0 votes)
80 views15 pages

Introduction to Machine Learning Concepts

The document provides an introduction to machine learning, including its definition, how it works, types of machine learning, and limitations. It discusses supervised, unsupervised and reinforcement learning. Applications mentioned include automotive, robotics, healthcare, fraud prevention, and mass personalization.

Uploaded by

nupurbopche633
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
80 views15 pages

Introduction to Machine Learning Concepts

The document provides an introduction to machine learning, including its definition, how it works, types of machine learning, and limitations. It discusses supervised, unsupervised and reinforcement learning. Applications mentioned include automotive, robotics, healthcare, fraud prevention, and mass personalization.

Uploaded by

nupurbopche633
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

UNIT

1. INTRODUCTION TO MACHINE LEARNING:-


Arthur Samuel, an early American leader in the field of computer gaming and
artificial intelligence, coined the term “Machine Learning ” in 1959 while at IBM.
He defined machine learning as “the field of study that gives computers the ability
to learn without being explicitly programmed “.

Definition of learning: A computer program is said to learn from experience E


with respect to some class of tasks T and performance measure P , if its
performance at tasks T, as measured by P , improves with experience E.
EXAMPLE:-

 A robot driving learning problem

 Task T : Driving on highways using vision sensors

 Performance P : Average distance traveled before an error

 Training experience E : A sequence of images and steering commands


recorded while observing a human driver

Machine Learning is said as a subset of artificial intelligence that is mainly


concerned with the development of algorithms which allow a computer to learn
from the data and past experiences on their own.

Machine learning enables a machine to automatically learn from data, improve


performance from experiences, and predict things without being explicitly
programmed.

How does Machine Learning work


A Machine Learning system learns from historical data, builds the prediction
models, and whenever it receives new data, predicts the output for it. The
accuracy of predicted output depends upon the amount of data, as the huge
amount of data helps to build a better model which predicts the output more
accurately.

Suppose we have a complex problem, where we need to perform some


predictions, so instead of writing a code for it, we just need to feed the data to
generic algorithms, and with the help of these algorithms, machine builds the
logic as per the data and predict the output. Machine learning has changed our
way of thinking about the problem. The below block diagram explains the working
of Machine Learning algorithm:

Features of Machine Learning:


o Machine learning uses data to detect various patterns in a
given dataset.
o It can learn from past data and improve automatically.
o It is a data-driven technology.
o Machine learning is much similar to data mining as it also
deals with the huge amount of the data.

Need for Machine Learning


The need for machine learning is increasing day by day. The
reason behind the need for machine learning is that it is capable
of doing tasks that are too complex for a person to implement
directly. As a human, we have some limitations as we cannot
access the huge amount of data manually, so for this, we need
some computer systems and here comes the machine learning to
make things easy for us.
The importance of machine learning can be easily understood by
its uses cases, Currently, machine learning is used in self-driving
cars, cyber fraud detection, face recognition, and friend
suggestion by Facebook, etc. Various top companies such as
Netflix and Amazon have build machine learning models that are
using a vast amount of data to analyze the user interest and
recommend product accordingly.

Classification of Machine Learning


At a broad level, machine learning can be classified into three
types:
1. Supervised learning
2. Unsupervised learning
3. Reinforcement learning
1) Supervised Learning
Supervised learning is a type of machine learning method in
which we provide sample labeled data to the machine learning
system in order to train it, and on that basis, it predicts the output.
The system creates a model using labeled data to understand the
datasets and learn about each data, once the training and
processing are done then we test the model by providing a
sample data to check whether it is predicting the exact output or
not.
The goal of supervised learning is to map input data with the
output data. The supervised learning is based on supervision, and
it is the same as when a student learns things in the supervision of
the teacher. The example of supervised learning is spam filtering.
Supervised learning can be grouped further in two categories of
algorithms:
o Classification

o Regression
2) Unsupervised Learning
Unsupervised learning is a learning method in which a machine
learns without any supervision.
The training is provided to the machine with the set of data that
has not been labeled, classified, or categorized, and the algorithm
needs to act on that data without any supervision. The goal of
unsupervised learning is to restructure the input data into new
features or a group of objects with similar patterns.
In unsupervised learning, we don't have a predetermined result.
The machine tries to find useful insights from the huge amount of
data. It can be further classifieds into two categories of
algorithms:
o Clustering

o Association

3) Reinforcement Learning
Reinforcement learning is a feedback-based learning method, in
which a learning agent gets a reward for each right action and
gets a penalty for each wrong action. The agent learns
automatically with these feedbacks and improves its performance.
In reinforcement learning, the agent interacts with the
environment and explores it. The goal of an agent is to get the
most reward points, and hence, it improves its performance.
The robotic dog, which automatically learns the movement of his
arms, is an example of Reinforcement learning.
4. Semi-supervised learning:
Where an incomplete training signal is given: a training set with
some (often many) of the target outputs missing. There is a
special case of this principle known as Transduction where the
entire set of problem instances is known at learning time, except
that part of the targets are missing. Semi-supervised learning is
an approach to machine learning that combines small labeled
data with a large amount of unlabeled data during training. Semi-
supervised learning falls between unsupervised learning and
supervised learning.

2.Scope of machine learning:-


Automotive Industry

The automotive industry is one of the areas where Machine


Learning is excelling by changing the definition of ‘safe’ driving.
There are a few major companies such as Google, Tesla,
Mercedes Benz, Nissan, etc. that have invested hugely in
Machine Learning to come up with novel innovations. However,
Tesla’s self-driving car is the best in the industry. These self-
driving cars are built using Machine Learning, IoT sensors, high-
definition cameras, voice recognition systems, etc.

Robotics

Robotics is one of the fields that always gain the interest of


researchers as well as the common. In 1954, George Devol
invented the first robot that was programmable and it was
named Unimate. After that, in the 21st century, Hanson Robotics
created the first AI-robot, Sophia. These inventions were possible
with the help of Machine Learning and Artificial Intelligence.
Quantum Computing

We are still at an infant state in the field of Machine Learning.


There are a lot of advancements to achieve in this field. One of
them that will take Machine Learning to the next level is Quantum
Computing. It is a type of computing that uses the mechanical
phenomena of quantum such as entanglement and superposition.
By using the quantum phenomenon of superposition, we can
create systems (quantum systems) that can exhibit multiple states
at the same time. On the other hand, entanglement is the
phenomenon where two different states can be referenced to
each other

Safer Healthcare
We’ve been seeing significant growth in machine
learning being used to predict and support COVID-19
strategies. The healthcare industry itself has been
long using ML for a wide range of purposes, we
believe that the future scope of machine learning
will undertake more complex use cases.
 Robots performing complicated surgeries

precisely.
 ML programs reading patient history, records,

reports etc. to devise personalised treatment


plans. IBM Watson Oncology is an important
project in this space.
 Wearable technology for disease prevention and

elder healthcare monitoring is also making great


strides.
Fraud Prevention
Banks and other financial institutions use machine-
learning-based fraud detection technology to stop
malpractices (although the irony of proving ‘I am not
a robot’ to a machine is not lost!).
 Banks are building machine learning algorithms

based on historical data to predict fraudulent


transactions.
 Classification and regression methods are being

used to identify and filter out phishing emails.


 Machine learning and computer vision algorithms

are checking for identity matching across key


databases in real-time to prevent identity theft.
 These pattern matching techniques are also

used to identify fake documents to prevent


forgery.

Mass Personalisation
Retail, social media and entertainment platforms use
ML to give customers personalised services and
experiences.
 The face swap filter uses algorithms based on

image recognition and computer vision to detect


and (well, almost) accurately exchange facial
features.
 E-commerce and media platforms are using ML

to offer hyper-personalised experiences, as well


as offer freemium models of payment.
3.LIMITATION OF ML:-
1. Inadequate Training Data
The major issue that comes while using machine learning
algorithms is the lack of quality as well as quantity of data.
Although data plays a vital role in the processing of machine
learning algorithms, many data scientists claim that inadequate
data, noisy data, and unclean data are extremely exhausting the
machine learning algorithms. For example, a simple task requires
thousands of sample data, and an advanced task such as speech
or image recognition needs millions of sample data examples.
Further, data quality is also important for the algorithms to work
ideally, but the absence of data quality is also found in Machine
Learning applications.

2. Poor quality of data


As we have discussed above, data plays a significant role in
machine learning, and it must be of good quality as well. Noisy
data, incomplete data, inaccurate data, and unclean data lead to
less accuracy in classification and low-quality results. Hence, data
quality can also be considered as a major common problem while
processing machine learning algorithms.

3. Non-representative training data


To make sure our training model is generalized well or not, we
have to ensure that sample training data must be representative
of new cases that we need to generalize. The training data must
cover all cases that are already occurred as well as occurring.
Further, if we are using non-representative training data in the
model, it results in less accurate predictions. A machine learning
model is said to be ideal if it predicts well for generalized cases
and provides accurate decisions. If there is less training data, then
there will be a sampling noise in the model, called the non-
representative training set. It won't be accurate in predictions. To
overcome this, it will be biased against one class or a group.

4. Overfitting and Underfitting


Overfitting:
Overfitting is one of the most common issues faced by Machine
Learning engineers and data scientists. Whenever a machine
learning model is trained with a huge amount of data, it starts
capturing noise and inaccurate data into the training data set. It
negatively affects the performance of the model. Let's understand
with a simple example where we have a few training data sets
such as 1000 mangoes, 1000 apples, 1000 bananas, and 5000
papayas. Then there is a considerable probability of identification
of an apple as papaya because we have a massive amount of
biased data in the training data set; hence prediction got
negatively affected. The main reason behind overfitting is using
non-linear methods used in machine learning algorithms as they
build non-realistic data models. We can overcome overfitting by
using linear and parametric algorithms in the machine learning
models.
Underfitting:
Underfitting is just the opposite of overfitting. Whenever a
machine learning model is trained with fewer amounts of data,
and as a result, it provides incomplete and inaccurate data and
destroys the accuracy of the machine learning model.
Underfitting occurs when our model is too simple to understand
the base structure of the data, just like an undersized pant. This
generally happens when we have limited data into the data set,
and we try to build a linear model with non-linear data. In such
scenarios, the complexity of the model destroys, and rules of the
machine learning model become too easy to be applied on this
data set, and the model starts doing wrong predictions as well.

5. Monitoring and maintenance


As we know that generalized output data is mandatory for any
machine learning model; hence, regular monitoring and
maintenance become compulsory for the same. Different results
for different actions require data change; hence editing of codes
as well as resources for monitoring them also become necessary.

6. Getting bad recommendations


A machine learning model operates under a specific context
which results in bad recommendations and concept drift in the
model. Let's understand with an example where at a specific time
customer is looking for some gadgets, but now customer
requirement changed over time but still machine learning model
showing same recommendations to the customer while customer
expectation has been changed. This incident is called a Data Drift.
It generally occurs when new data is introduced or interpretation
of data changes. However, we can overcome this by regularly
updating and monitoring data according to the expectations.
7. Lack of skilled resources
Although Machine Learning and Artificial Intelligence are
continuously growing in the market, still these industries are
fresher in comparison to others. The absence of skilled resources
in the form of manpower is also an issue. Hence, we need
manpower having in-depth knowledge of mathematics, science,
and technologies for developing and managing scientific
substances for machine learning.

8. Customer Segmentation
Customer segmentation is also an important issue while
developing a machine learning algorithm. To identify the
customers who paid for the recommendations shown by the
model and who don't even check them. Hence, an algorithm is
necessary to recognize the customer behavior and trigger a
relevant recommendation for the user based on past experience.

9. Process Complexity of Machine Learning


The machine learning process is very complex, which is also
another major issue faced by machine learning engineers and
data scientists. However, Machine Learning and Artificial
Intelligence are very new technologies but are still in an
experimental phase and continuously being changing over time.
There is the majority of hits and trial experiments; hence the
probability of error is higher than expected. Further, it also
includes analyzing the data, removing data bias, training data,
applying complex mathematical calculations, etc., making the
procedure more complicated and quite tedious.

10. Data Bias


Data Biasing is also found a big challenge in Machine Learning.
These errors exist when certain elements of the dataset are heavily
weighted or need more importance than others. Biased data leads
to inaccurate results, skewed outcomes, and other analytical
errors. However, we can resolve this error by determining where
data is actually biased in the dataset. Further, take necessary steps
to reduce it.

You might also like