0% found this document useful (0 votes)
3 views40 pages

Module -1 Lecture-1

The document provides an introduction to deep learning, machine learning, and artificial intelligence, highlighting their definitions, types, and key differences. It emphasizes the importance of datasets for machine learning projects, detailing the types of data, sources for datasets, and the necessity of data preprocessing. Additionally, it covers supervised and unsupervised learning techniques, along with their applications and algorithms.

Uploaded by

Ashwin K.L
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
Download as pdf or txt
0% found this document useful (0 votes)
3 views40 pages

Module -1 Lecture-1

The document provides an introduction to deep learning, machine learning, and artificial intelligence, highlighting their definitions, types, and key differences. It emphasizes the importance of datasets for machine learning projects, detailing the types of data, sources for datasets, and the necessity of data preprocessing. Additionally, it covers supervised and unsupervised learning techniques, along with their applications and algorithms.

Uploaded by

Ashwin K.L
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
Download as pdf or txt
Download as pdf or txt
You are on page 1/ 40

Module -1

Introduction to Deep Learning


Basics:
•Convolutional networks,
•Supervised and Unsupervised Training,
•Classifiers,
•Optimization and loss functions in classifiers,
• Introduction to neural networks,
•Learning XOR problem,
• Backpropagation and its importance.
Key Takeaways
1. Deep learning is a type of machine learning, which is a subset of
artificial intelligence.
2. Machine learning is about computers being able to think and act
with less human intervention; deep learning is about computers
learning to think using structures modeled on the human brain.
3. Machine learning requires less computing power; deep learning
typically needs less ongoing human intervention.
4. Deep learning can analyze images, videos, and unstructured data
in ways machine learning can’t easily do.
5. Every industry will have career paths that involve machine and
deep learning.
Artificial intelligence is a field of computer science
which makes a computer system that can mimic human
intelligence. It is comprised of two words "Artificial"
and "intelligence", which means "a human-made
thinking power." Hence we can define it as,

“Artificial intelligence is a technology using which we can


create intelligent systems that can simulate human
intelligence.”
•The Artificial intelligence system does not require to be pre-
programmed, instead of that, they use such algorithms which can
work with their own intelligence.
•It involves machine learning algorithms such as Reinforcement
learning algorithm and deep learning neural networks.
•AI is being used in multiple places such as Siri, Google?s
AlphaGo, AI in Chess playing, etc.

Based on capabilities, AI can be classified into three types:


•Weak AI
•General AI
•Strong AI
Machine learning
Machine learning is about extracting knowledge from the data.
It can be defined as,

“Machine learning is a subfield of artificial intelligence,


which enables machines to learn from past data or
experiences without being explicitly programmed”.
It can be divided into three types:

•Supervised learning
•Reinforcement learning
•Unsupervised learning
•Machine learning enables a computer system to make predictions
or take some decisions using historical data without being explicitly
programmed.
•Machine learning uses a massive amount of structured and semi-
structured data so that a machine learning model can generate
accurate result or give predictions based on that data.
•Machine learning works on algorithm which learn by it’s own using
historical data.
•Machine learning is being used in various places such as for online
recommender system, for Google search algorithms, Email spam
filter, Facebook Auto friend tagging suggestion, etc
Key differences between Artificial Intelligence (AI) and Machine learning (ML):
How to get datasets for Machine Learning
•The key to success in the field of machine learning or to
become a great data scientist is to practice with different types
of datasets. But discovering a suitable dataset for each kind of
machine learning project is a difficult task.
What is a dataset?
•A dataset is a collection of data in which data is arranged in
some order. A dataset can contain any data from a series of an
array to a database table.
•A tabular dataset can be understood as a database table or
matrix, where each column corresponds to a particular
variable, and each row corresponds to the fields of the
dataset. The most supported file type for a tabular dataset
is "Comma Separated File," or CSV. But to store a "tree-like
data," we can use the JSON file more efficiently.
Types of data in datasets
•Numerical data:Such as house price, temperature, etc.
•Categorical data:Such as Yes/No, True/False, Blue/green, etc.
•Ordinal data:These data are similar to categorical data but can be measured on the
basis of comparison.

Need of Dataset
•To work with machine learning projects, we need a huge amount of data, because,
without the data, one cannot train ML/AI models. Collecting and preparing the
dataset is one of the most crucial parts while creating an ML/AI project.
•The technology applied behind any ML projects cannot work properly if the dataset
is not well prepared and pre-processed.
•During the development of the ML project, the developers completely rely on the
datasets. In building ML applications, datasets are divided into two parts:
•Training dataset:
•Test Dataset
Popular sources for Machine Learning datasets

1. Kaggle Datasets
2. UCI Machine Learning Repository
3. Datasets via AWS
4. Google's Dataset Search Engine
5. Microsoft Datasets
6. Awesome Public Dataset Collection
7. Government Datasets
8. Computer Vision Datasets
9. Scikit-learn dataset
Data Preprocessing in Machine learning
Data preprocessing is a process of preparing the raw data and making it suitable for a
machine learning model. It is the first and crucial step while creating a machine learning
model.
Why do we need Data Preprocessing?
A real-world data generally contains noises, missing values, and maybe in an unusable
format which cannot be directly used for machine learning models. Data preprocessing is
required tasks for cleaning the data and making it suitable for a machine learning model
which also increases the accuracy and efficiency of a machine learning model.
It involves below steps:
What is Deep Learning????
•Artificial Intelligence is the concept of creating smart
intelligent machines.
•Machine Learning is a subset of artificial intelligence
that helps you build AI-driven applications.
•Deep Learning is a subset of machine learning that uses
vast volumes of data and complex algorithms to train a
model.
Difference between Supervised
and Unsupervised Learning

Supervised and Unsupervised


learning are the two techniques of
machine learning.

But both the techniques are used


in different scenarios and with
different datasets.
Supervised Machine Learning
•Supervised learning is the types of machine learning in which machines are trained
using well "labelled" training data, and on basis of that data, machines predict the
output. The labelled data means some input data is already tagged with the correct
output.
•In supervised learning, the training data provided to the machines work as the
supervisor that teaches the machines to predict the output correctly. It applies the
same concept as a student learns in the supervision of the teacher.
•Supervised learning is a process of providing input data as well as correct output
data to the machine learning model. The aim of a supervised learning algorithm is
to find a mapping function to map the input variable(x) with the output
variable(y).
•In the real-world, supervised learning can be used for Risk Assessment, Image
classification, Fraud Detection, spam filtering, etc.
1. Regression
Regression algorithms are used if there is a relationship between the input
variable and the output variable. It is used for the prediction of continuous
variables, such as Weather forecasting, Market Trends, etc. Below are some
popular Regression algorithms which come under supervised learning:
•Linear Regression
•Regression Trees
•Non-Linear Regression
•Bayesian Linear Regression
•Polynomial Regression
2. Classification
Classification algorithms are used when the output variable is categorical,
which means there are two classes such as Yes-No, Male-Female, True-false,
etc.
•Spam Filtering,
•Random Forest
•Decision Trees
•Logistic Regression
•Support vector Machines
Clustering: Clustering is a method of grouping the objects into clusters
such that objects with most similarities remains into a group and has less
or no similarities with the objects of another group. Cluster analysis finds
the commonalities between the data objects and categorizes them as per
the presence and absence of those commonalities.
Association: An association rule is an unsupervised learning method
which is used for finding the relationships between variables in the large
database. It determines the set of items that occurs together in the dataset.
Association rule makes marketing strategy more effective. Such as people
who buy X item (suppose a bread) are also tend to purchase Y
(Butter/Jam) item. A typical example of Association rule is Market Basket
Analysis.

You might also like