Machine Learning Foundation
Machine Learning Foundation
L:2 T :0 P :2 Credits:3
Unit I
Giving computers the ability to learn from data: making predictions about the future with supervised
learning, solving interactive problems with reinforcement learning, discovering hidden structures with
unsupervised learning, roadmap for building machine learning systems, using Python for machine
learning
Building good training sets: data preprocessing, dealing with missing data, handling categorical data,
partitioning a dataset in training and test sets, normalization, selecting meaningful features
Unit II
Machine learning classifiers using scikit-learn: choosing a classification algorithm, first steps with
scikit-learn, modeling class probabilities via logistic regression, maximum margin classification with
support vector machine, decision tree learning, k-nearest neighbor algorithm, bayesian learning, majority
voting classifier, bagging and boosting classifier, random forest classifier
Unit III
Predicting continuous target variables with regression analysis: introducing linear regression,
relationship using a correlation matrix, exploratory data analysis, regularized methods for regression,
polynomial regression, modeling nonlinear relationships in the housing dataset, decision tree and random
forest regressor
Unit IV
Working with unlabeled data: K-means clustering, hard versus soft clustering, using the elbow method
to find the optimal number of clusters, silhouette plots, organizing clusters as a hierarchical tree,
agglomerative clustering, DBSCAN clustering
Unit V
Dimension reduction: unsupervised dimensionality reduction via principal component analysis,
supervised data compression via linear discriminant analysis, using kernel principle component analysis
for nonlinear mappings, projecting new data points
Unit VI
Model evaluation and hyperparameter tuning: streamlining workflows with pipelines, using k- fold
cross validation to access model performance, debugging algorithms with learning and validation curves,
fine-tuning machine learning models via grid search