0% found this document useful (0 votes)
10 views

Machine Learning

Uploaded by

kavyashreejr16
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views

Machine Learning

Uploaded by

kavyashreejr16
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

FACULTY OF COMPUTER APPLICATIONS

Bachelor of Computer Applications (Hons)

 Sem. :7
 Subject Code : 05BC0702
 Subject : Machine Learning
 Course Objectives :
1. To get the idea about concepts of machine learning and its
applications
2. To learn the concepts of feature engineering
3. Understanding various algorithms of classification for supervised
machine learning
4. Understanding various algorithms of regression for supervised
machine learning
5. Understanding various algorithms of clustering and association
rule mining for unsupervised machine learning

 Prerequisites :-

Unit No Topics Covered No of


lectures
required
1 Introduction to Machine Learning and preparing 10
a model
Introduction, What is human learning? Types of
machine learning, Applications of machine learning,
Languages and tools in machine learning
Machine learning activities, Basic types of data,
Exploring data structure, Data quality and
remediation, Data pre-processing
2 Model selection and feature engineering 10
Introduction, Selecting a model, Training a model for
supervised learning
Features – understand your data better, Feature
extraction and engineering, Feature engineering on –
numerical data & categorical data & text data, Feature
FACULTY OF COMPUTER APPLICATIONS
Bachelor of Computer Applications (Hons)
scaling, Feature selection, Dimensionality reduction
3 Supervised learning – Classification 10
Example of supervised learning, Classification model,
Classification learning steps
Importance of Bayesian method, Bayes theorem and
concept learning
Common algorithms – K nearest neighbour (kNN),
Decision tree, Random forest model, Support Vector
Machine (SVM)
Evaluating performance of classification models
4 Supervised learning – Regression 10
Introduction, Example of regression, Common
regression algorithms – Simple linear, Multiple linear,
Main problems in regression analysis, Improving
accuracy of linear regression model, Polynomial
regression model, Logistic regression, Maximum
likelihood estimation
Evaluating performance of regression models
5 Unsupervised learning 10
Introduction, Unsupervised vs supervised learning,
Applications, Clustering, Evaluating performance of
clustering

Finding pattern using association rule mining

Introduction to Neural network, Architectures of


neural network, Deep learning

Course Outcomes:

1. Able to understand applications of ML in real life


2. Able to implement preprocessing steps on data to make it ready for analysis
3. Able to apply classification algorithms for supervised learning
4. Able to apply regression algorithms for predictive analysis
5. Able to apply clustering algorithms and association rule mining algorithms for
real life problems
FACULTY OF COMPUTER APPLICATIONS
Bachelor of Computer Applications (Hons)

Course Outcomes – Program Outcomes Mapping Table :

PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8


CO1 H M M H H L
CO2 L H H M H H
CO3 L H H M H H
CO4 L H H M H H
CO5 L H H M H H

Text Book :

1. Machine Learning, Saikat Dutt et al. , Pearson Education, Third edition, 2019
2. Practical Machine Learning with Python, Dipanjan Sarkar et al., Apress, First
edition, 2018

Reference Books :

1) “Machine Learning”, Tom M Mitchell, McGraw Hill, First edition, 1997


2) “Machine Learning”, Anuradha Srinivasa Raghavan, Vincy Joseph, Wiley India,
First edition, 2019
3) “Machine Learning in Action”, Peter Harrington, DreamTech, First Edition, 2012

Web References :

1. www.edx.org
2. www.coursera.org
3. www.kaggle.com
4. www.onlinecourses.nptel.ac.in

App References :

1. Learn Machine Learning - ML Tutorials & Programs


2. Data Science 101 - Machine Learning Tutorials
FACULTY OF COMPUTER APPLICATIONS
Bachelor of Computer Applications (Hons)

Syllabus Coverage from text /reference book & web/app reference:

Unit # Book number Chapter Numbers


1 1 1,2
2 1,2 Book 1 : 3.1,3.2,3.3 – Book 2 : 4
3 1 6.1,6.2,6.3,6.4, 7, 3.5.1
4 1 8, 3.5.2
5 1 9,3.5.3, 10.1,10.2,10.3, 10.6, 10.9
FACULTY OF COMPUTER APPLICATIONS
Bachelor of Computer Applications (Hons)
PRACTICALS

Unit List of Practicals


No
1& 2 Data Identification and Feature Engineering :

Practical – 1 :Download Auto-MPG data set


(https://www.kaggle.com/uciml/autompg-dataset)
and perform following operations
i. Write program to read dataset ( Text,CSV,JSON,XML)
ii. Which of the attributes are numeric and which are categorical?
iii. Performing Data Cleaning
 Handling Missing Data
 Removing Null data
iv. Rescaling Data
v. Encoding Data
vi. Feature Selection and Dimensionality Reduction
 Implement Principle Component Analysis,

Pracitcal – 2 : Download Airline data set (https://www.kaggle.com/open-


flights/airline-database) and perform all the above operations.

3 Supervised Learning – Classification :


Practical – 1 :Write a python code to apply Naive Bayesian algorithm to
classify that whether a personcan buy computer or not based on given test data :

Age Income Student Creditrating Buyscomputer


Youth High No Fair No
Youth High No Excellent No
Middle High No Fair Yes
Senior Medium No Fair Yes
Senior Low Yes Fair Yes
Middle Low Yes Excellent No
Senior Low Yes Excellent Yes
Youth Medium No Fair No
Youth Low Yes Fair Yes
Senior Medium Yes Fair Yes
Youth Medium Yes Excellent Yes
Middle Medium No Excellent Yes
Middle High Yes Fair Yes
FACULTY OF COMPUTER APPLICATIONS
Bachelor of Computer Applications (Hons)
Senior Medium No Excellent No

Practical – 2 :Write a python code to implement decision tree for below given
dataset. Identify Job offered or not.

CGPA Communication Aptitude Programming_skill Job_offered


High Good High Good Yes
Medium Good High Good Yes
Low Bad Low Good No
Low Good Low Bad No
High Good High Bad Yes
High Good High Good Yes
Medium Bad Low Bad No
Medium Bad Low Good No
High Bad High Good Yes
Medium Good High Good Yes
Low Bad High Bad No
Low Bad High Bad No
Medium Good High Bad Yes
Low Good Low Good No
High Bad Low Bad No
Medium Bad High Good No
High Bad Low Bad No
Medium Good High Bad Yes

Practical – 3 : Write a python code to implement K-nearest neighbourhood


program for the given dataset. (for above both the data sets)

Practical – 4 :Implement supervised machine learning algorithm (Classification


– K Nearest Neighbourhood) in python to classify breast tumour data into
malignant breast tumour or benign breast tumour (use breast tumour dataset)
and obtain its accuracy level.

Practical – 5 :Implement supervised machine learning algorithm (Classification


– K Nearest Neighbourhood) in python to classify iris data into setosa,
virginica, versicolor using iris dataset and obtain its accuracy level.

Practical – 6 :Build a classification model in python that classifies if a student


FACULTY OF COMPUTER APPLICATIONS
Bachelor of Computer Applications (Hons)
gets admission in a course or not given his last two examination scores for the
dataset available at
https://docs.google.com/spreadsheets/d/1g0mjTUZ9Ado5prXA1UnAvNjmdzT
rV0TzkFkIoU-Lpbk/edit?usp=sharing

Practical – 7 :Implement supervised machine learning algorithm (Classification


– Support VectorMachine) in python to classify breast tumour data into
malignant breast tumour orbenign breast tumour (use breast tumour dataset)
and obtain its accuracy level.

Practical – 8 :Write a python program to build an email spam classifier using


support vector machines for the Spam base dataset from UCI machine learning
repository.

Practical – 9 : Implement supervised machine learning algorithm


(Classification - Naïve Bayes algorithm) in python on Pima Indians Diabetes
dataset and obtain its accuracy level.
4 Supervised Learning – Regression :
Pracitcal – 1 :Predict the CPU time if the DiskIO = 40 on the basis of following data :

DiskIO CPUTime
14 2
16 5
27 7
42 9
39 10
50 13
83 20

Practical – 2 :Write a python code to predict profit of hotel chain given the
population of the area (city) using the data at
https://docs.google.com/spreadsheets/d/1Ks20skBgEefHFU36sFqVzozoFtz2E
ZE2rxBIgXOrUg/edit?usp=sharing.

Practical – 3 : Write a python code to predict salary on the basis of experience


in years using the data at
https://github.com/tarunlnmiit/machine_learning/blob/master/SimpleLinearReg
ression.csv

Practical – 4 : Write a python code to predict the price of house given square
FACULTY OF COMPUTER APPLICATIONS
Bachelor of Computer Applications (Hons)
feet and number of bed rooms in the house for the dataset available at
https://docs.google.com/spreadsheets/d/1DHVK7gKo4TSyj7mFLwofHamj1Sl
4SOZma2q51w1ZvyE/edit?usp=sharing

Practical – 5 : Build a logistic regression model to classify flower type based on the
dataset of iris flower.

Practical – 6 :Build a multivariate logistic regression model to classify glass


type of glass givendifferent glass mixture features using the Glass Identification
Dataset from UCIMachine Learning Repository.
5 Unsupervised Learning – Clustering and Association Rule
Mining :

Practical – 1 :Implement unsupervised machine learning algorithm (Clustering


– K Means) in pythonon Titanic dataset to cluster data (use Titanic dataset) by
removing the class label.

Practical – 2 :Implement unsupervised machine learning algorithm (Clustering


– K Means) in python on Breast Tumour dataset to cluster data (use Breast
Tumour dataset) by removing theclass label.

Practical – 3 :Implement unsupervised machine learning algorithm (Clustering


– Hierarchical) inpython on Titanic dataset to cluster data (use Titanic dataset).

Practical – 4 :Implement unsupervised machine learning algorithm (Clustering


– Hierarchical) in python on Breast Tumour dataset to cluster data (use Breast
Tumour dataset) by removing the class label.

Practical – 5 :Implement Apriori algorithm in python to find rules


whichexplain association betweendifferent products for given transactions at a
retail store. (The data is available at
https://drive.google.com/file/d/1NUXoptUlHY8z4KcFKpFA6sQN5KnWzk3p/
view?usp=sharing )

Practical – 6 : Generating Association rule mining for following data set.

1 Red,White,Green
2 White,Orange
3 White,Blue
4 Red,White,Orange
5 Red,Blue
6 White,Blue
7 White,Orange
FACULTY OF COMPUTER APPLICATIONS
Bachelor of Computer Applications (Hons)
8 Red,White,Blue,Green
9 Red,White,Blue
10 Yellow

Practical – 7 : Generate Association rule mining for following data set.

1 Apple, Beer, Sugar, Chicken


2 Apple, Beer, Sugar
3 Apple, Beer
4 Apple, Peru
5 Milk, Beer, Sugar, Chicken
6 Milk, Beer, Sugar
7 Milk, Beer
8 Milk, Peru

You might also like