Deep Learning Assignment: CNN & Clustering

Uploaded by

damini1996mini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views1 page

Deep Learning Assignment: CNN & Clustering

Uploaded by

damini1996mini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

EE769 Introduction to Machine Learning (Jan 2022 edition)

Electrical Engineering, Indian Institute of Technology Bombay

Programming Assignment – 3 : Deep Learning and Unsupervised Learning

Instructions:

a) Submit four ipython notebooks with file names <RollNo>_<i>.pynb, where i is the question
number. The notebook should be a complete code plus report with copious comments,
references and URLs, outputs, critical observations, and your reasoning to choose next steps.
b) Use good coding practices such as avoiding hard-coding, using self-explanatory variable
names, using functions (if applicable). This will also be graded.
c) Cite your sources if you use code from the internet. Also clarify what you have modified.
Ensure that the code has a permissive license or it can be assumed that academic purposes
fall under ‘fair use’.

Problem statements:

1. Convolutional Neural Networks:

a. Copy and study the starter code (until “ConvNet as fixed feature extractor”) given by
Sasank (CTO of [Link], pytorch contributor, and alumnus of IITB) for classifying ants
vs. bees: [Link] The
key feature of this code is that it does not train a model from scratch, but uses
transfer learning of a ResNet-18 architecture that is pre-trained on a large dataset
(ImageNet) and then only fine-tunes it for the problem at hand.
b. Modify the code to run on co-lab without any new features. [1]
c. Modify the code to plot validation loss and accuracy after every training epoch. [2]
d. Change the learning rate, momentum, and number of epochs at least three times to
see the net effect on final validation loss and accuracy and its speed of convergence.
[Link] [1]
e. Introduce weight decay (L2 penalty on weights) and find a good value for the weight
decay factor. [1]
2. Clustering:
a. Visualize and pre-process the data as appropriate from the file [Link].
You might have to use a power, an exponential, or a log transformation. [1]
b. Train k-means, and find the appropriate number of k. [1]
c. Using the cluster assignment as the label, visualize the t-sne embedding. [1]
3. PCA:
a. Visualize the data from the file [Link]. [1]
b. Train PCA. [1]
c. Plot the variance explained versus PCA dimensions. [1]
d. Reconstruct the data with various numbers of PCA dimensions, and compute the
MSE. [1]
4. Non-linear dimension reduction:
a. Visualize the data from the file [Link]. [1]
b. Train KPCA. [1]
c. Plot the variance explained versus KPCA dimensions for up to 10 dimensions. [1]

Common questions

Plotting validation loss and accuracy after each training epoch provides real-time feedback on the model's performance during training. This process is important for monitoring overfitting or underfitting, as it helps assess whether the model is generalizing well to unseen data. Sudden changes or stability in these metrics can guide adjustments to hyperparameters or model structure, ensuring that model improvements align with the data's characteristics and desired performance metrics.

Weight decay, also known as L2 regularization, adds a penalty proportional to the square of the magnitude of the model's weights to the loss function. This discourages overly complex models by penalizing large weight values, thus preventing overfitting. By controlling model complexity, weight decay improves the model’s ability to generalize to unseen data, enhancing its robustness and performance on held-out samples. Properly tuning the weight decay factor is crucial to balance model simplicity and predictive power.

t-SNE is advantageous for visualizing cluster assignments as it captures non-linear structures in data and embeds high-dimensional data into low-dimensional spaces while preserving local similarities, making it suitable for identifying clusters. Unlike traditional methods like PCA, which focuses on global variance, t-SNE emphasizes revealing complex data relationships and patterns that are not easily visible through other linear techniques, providing more intuitive visual insights into clustering outcomes.

Non-linear dimension reduction techniques like Kernel PCA can model complex, non-linear relationships among data points that linear methods like PCA cannot capture. Kernel PCA projects data into high-dimensional spaces using kernel functions, uncovering structure and patterns in data with non-linear characteristics, which are pivotal in enhancing model performance in tasks where linear assumptions are inadequate. The choice between Kernel PCA and linear methods depends on the data characteristics and task requirements, with non-linear techniques generally providing richer insights at the cost of increased computational complexity.

Plotting variance explained versus PCA dimensions allows the identification of the optimal number of components that capture the most substantial data variation. This plot, often referred to as a scree plot or elbow curve, helps determine the point of diminishing returns where additional components contribute minimally to explained variance. Making informed decisions on dimensionality reduction enables reduced computational costs and improved model performance, as it simplifies the dataset while retaining meaningful information.

Reconstructing data using varying numbers of PCA dimensions involves balancing the trade-off between data reconstruction accuracy and dimensionality reduction. Lowering dimensions may increase computation efficiency and reduce storage needs, but it can lead to higher Mean Squared Error (MSE) as critical data information might be lost. Conversely, using more dimensions preserves more information and results in lower MSE, but at increased computational and storage expenses. The challenge lies in selecting a dimensional threshold that minimizes MSE while maximizing the benefits of dimensionality reduction.

Modifications to the learning rate, momentum, and the number of epochs can significantly impact a neural network's training effectiveness and efficiency. A properly selected learning rate facilitates rapid convergence, whereas a too-high rate may cause divergence or oscillation. Momentum helps in accelerating the gradients during training, especially in regions with shallow gradients, reducing convergence time. Furthermore, adjusting the number of epochs can ensure sufficient training duration for convergence but needs to be balanced to avoid overfitting. Experimenting with these parameters allows practitioners to optimize the training process for better performance and faster convergence.

Selecting the appropriate number of clusters (k) in k-means clustering is crucial for accurately reflecting inherent data groupings and ensuring model validity. An incorrect k can lead to overfitting or underfitting, misrepresenting data structure. Effective methods to determine k include the Elbow Method, which analyzes the within-cluster sum of squares to identify diminishing returns, and the Silhouette Coefficient, which measures cluster separation and cohesion. These methods help achieve a balance between accuracy and simplicity, crucial for meaningful cluster interpretations.

Transforming data using power, exponential, or log transformations helps stabilize variance, improve linear relationships among features, and normalize data distribution, which are critical for effective clustering. These transformations can make patterns more discernible by adjusting skewness and reducing the impact of outliers, thus enhancing the accuracy and reliability of clustering algorithms that assume certain statistical properties of the data, such as k-means.

Transfer learning with a pre-trained model like ResNet-18 involves using a model that has been previously trained on a large and diverse dataset (such as ImageNet). This approach utilizes the existing patterns and features learned by the model, thus requiring only fine-tuning on a smaller, task-specific dataset, such as the ants vs. bees classification. This significantly reduces computation time and the amount of data needed for training, compared to training a CNN from scratch, where the model must learn all patterns from the ground up, often necessitating a large dataset and more computational resources.

PyTorch Crash Course: Tensors & Autograd
No ratings yet
PyTorch Crash Course: Tensors & Autograd
16 pages
PyTorch Crash Course Overview
No ratings yet
PyTorch Crash Course Overview
15 pages
CPEN 429 Lab 1-ML With Python and MATLAB - 2026
No ratings yet
CPEN 429 Lab 1-ML With Python and MATLAB - 2026
13 pages
Python and PyTorch Basics Guide
No ratings yet
Python and PyTorch Basics Guide
4 pages
Py Torch
No ratings yet
Py Torch
41 pages
Pytorch Workflow Exercises - Ipynb - Colab
No ratings yet
Pytorch Workflow Exercises - Ipynb - Colab
5 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
17 pages
Introduction to PyTorch Framework
No ratings yet
Introduction to PyTorch Framework
23 pages
PyTorch Fundamentals and Tutorials
No ratings yet
PyTorch Fundamentals and Tutorials
49 pages
Pytorch Neural Network Setup Guide
No ratings yet
Pytorch Neural Network Setup Guide
7 pages
PyTorch Operations Cheat Sheet
No ratings yet
PyTorch Operations Cheat Sheet
7 pages
Practical Record: ML-409P Experiments
No ratings yet
Practical Record: ML-409P Experiments
74 pages
PyTorch Tensors and Gradients Guide
No ratings yet
PyTorch Tensors and Gradients Guide
10 pages
Kaiming Initialization in PyTorch
No ratings yet
Kaiming Initialization in PyTorch
37 pages
Reinforcement Learning Control Techniques
No ratings yet
Reinforcement Learning Control Techniques
28 pages
Machine Learning Classification Assignment
No ratings yet
Machine Learning Classification Assignment
2 pages
DL Lab 3 - Neural Network With PyTorch
No ratings yet
DL Lab 3 - Neural Network With PyTorch
6 pages
Python3 Image Processing and ML Guide
No ratings yet
Python3 Image Processing and ML Guide
13 pages
Intro to PyTorch for Deep Learning
No ratings yet
Intro to PyTorch for Deep Learning
7 pages
Install Keras, TensorFlow, PyTorch Guide
No ratings yet
Install Keras, TensorFlow, PyTorch Guide
4 pages
PyTorch Tensor Operations Cheat Sheet
No ratings yet
PyTorch Tensor Operations Cheat Sheet
7 pages
PyTorch Tensor Operations Guide
No ratings yet
PyTorch Tensor Operations Guide
35 pages
ASL Alphabet Image Classification Guide
No ratings yet
ASL Alphabet Image Classification Guide
15 pages
Build a 4-Layer Neural Network
No ratings yet
Build a 4-Layer Neural Network
5 pages
PyTorch Feedforward Neural Network Guide
No ratings yet
PyTorch Feedforward Neural Network Guide
9 pages
Deep Learning Lab File for BCA Students
No ratings yet
Deep Learning Lab File for BCA Students
46 pages
Deep Learning Lab Manual with Code
No ratings yet
Deep Learning Lab Manual with Code
10 pages
CCS355 Neural Networks Assignment 1
No ratings yet
CCS355 Neural Networks Assignment 1
15 pages
Deep Learning Lab Manual for CSE Students
No ratings yet
Deep Learning Lab Manual for CSE Students
31 pages
Fine-Tuning ResNet18 for CIFAR10 Accuracy
No ratings yet
Fine-Tuning ResNet18 for CIFAR10 Accuracy
9 pages
CCS355 Neural Networks Assignment
No ratings yet
CCS355 Neural Networks Assignment
15 pages
PyTorch Cheat Sheet Reference Guide
No ratings yet
PyTorch Cheat Sheet Reference Guide
6 pages
PyTorch Workflow Fundamentals Guide
No ratings yet
PyTorch Workflow Fundamentals Guide
1 page
PyTorch Neural Network Guide for Beginners
No ratings yet
PyTorch Neural Network Guide for Beginners
17 pages
PyTorch Fundamentals in CS197 Harvard
No ratings yet
PyTorch Fundamentals in CS197 Harvard
18 pages
PyTorch Workflow Fundamentals Guide
No ratings yet
PyTorch Workflow Fundamentals Guide
43 pages
TensorFlow and Keras Model Examples
No ratings yet
TensorFlow and Keras Model Examples
31 pages
Beginner's Guide to Machine Learning
No ratings yet
Beginner's Guide to Machine Learning
9 pages
BITS Pilani Machine Learning Assignment 5
No ratings yet
BITS Pilani Machine Learning Assignment 5
2 pages
Neural Networks for CIFAR-10 Classification
No ratings yet
Neural Networks for CIFAR-10 Classification
6 pages
Machine Learning Lab Practical Record
No ratings yet
Machine Learning Lab Practical Record
73 pages
Free Beginner Resources for ML
No ratings yet
Free Beginner Resources for ML
8 pages
PyTorch MLP Assignment for MNIST
No ratings yet
PyTorch MLP Assignment for MNIST
2 pages
Validate YOLOv5 Model Accuracy
No ratings yet
Validate YOLOv5 Model Accuracy
9 pages
JSON to PDF in Transformer Lab
No ratings yet
JSON to PDF in Transformer Lab
29 pages
PyTorch Feedforward Neural Network Guide
No ratings yet
PyTorch Feedforward Neural Network Guide
13 pages
McMaster Datasci 3ML3 Homework 4 Guide
No ratings yet
McMaster Datasci 3ML3 Homework 4 Guide
5 pages
1-3587542627 CNNLab3 Assignment
No ratings yet
1-3587542627 CNNLab3 Assignment
19 pages
Multiple Linear Regression in PyTorch
No ratings yet
Multiple Linear Regression in PyTorch
13 pages
PyTorch Neural Networks Guide
No ratings yet
PyTorch Neural Networks Guide
4 pages
Autoencoders & Transfer Learning Lab Guide
No ratings yet
Autoencoders & Transfer Learning Lab Guide
8 pages
PyTorch Neural Network Training Guide
No ratings yet
PyTorch Neural Network Training Guide
48 pages
Beginner's Guide to PyTorch Basics
No ratings yet
Beginner's Guide to PyTorch Basics
35 pages
DL 1 - ComputerVision With PyTorch Notes
No ratings yet
DL 1 - ComputerVision With PyTorch Notes
304 pages
RLDL File
No ratings yet
RLDL File
40 pages
Deep Learning Lab Manual Overview
No ratings yet
Deep Learning Lab Manual Overview
67 pages
MLP Classifier Implementation Guide
No ratings yet
MLP Classifier Implementation Guide
51 pages
AutoEncoders for IoT Dataset Training
No ratings yet
AutoEncoders for IoT Dataset Training
2 pages
Ewing Township Meeting Agenda Nov 2009
No ratings yet
Ewing Township Meeting Agenda Nov 2009
5 pages
The Talented Mr. Ripley: Chapter Summary
No ratings yet
The Talented Mr. Ripley: Chapter Summary
7 pages
Saurabh Baranwal: Tech Intern & CSE Graduate
No ratings yet
Saurabh Baranwal: Tech Intern & CSE Graduate
1 page
Evolution of Information Technology Today
No ratings yet
Evolution of Information Technology Today
9 pages
One Acquisition Intern Role 2025
No ratings yet
One Acquisition Intern Role 2025
2 pages
PCB Etching Process and Techniques
No ratings yet
PCB Etching Process and Techniques
4 pages
General Pathology in Homoeopathy
100% (3)
General Pathology in Homoeopathy
21 pages
Cumulative Frequency Exam Questions
No ratings yet
Cumulative Frequency Exam Questions
26 pages
IDBI Bank Annual Report FY25 Highlights
No ratings yet
IDBI Bank Annual Report FY25 Highlights
8 pages
Genotyping Inflammatory Linear Verrucous Nevus
No ratings yet
Genotyping Inflammatory Linear Verrucous Nevus
2 pages
Impact of Compensation on Performance
No ratings yet
Impact of Compensation on Performance
88 pages
Surehab Medical Footwear Catalogue
No ratings yet
Surehab Medical Footwear Catalogue
22 pages
Evaluating Adidas Athletic Apparel
No ratings yet
Evaluating Adidas Athletic Apparel
3 pages
Dilation and Curettage (D&C) Overview
No ratings yet
Dilation and Curettage (D&C) Overview
3 pages
Modbus Serial Link Connection Accessories
No ratings yet
Modbus Serial Link Connection Accessories
1 page
Chemistry MCQs for JEE Advanced 2021
No ratings yet
Chemistry MCQs for JEE Advanced 2021
5 pages
HC-06 Bluetooth Module Overview
No ratings yet
HC-06 Bluetooth Module Overview
5 pages
Emergence Profile in Dental Implants
No ratings yet
Emergence Profile in Dental Implants
7 pages
Advanced CSS Techniques and Examples
100% (1)
Advanced CSS Techniques and Examples
7 pages
Pin Bhaba Pass Trek Overview
No ratings yet
Pin Bhaba Pass Trek Overview
9 pages
Formation of the Australian Alps
No ratings yet
Formation of the Australian Alps
6 pages
Miles Sullivan Drinking Bird Media Kit
No ratings yet
Miles Sullivan Drinking Bird Media Kit
19 pages
FCFF Model in Cash Flow Valuation
No ratings yet
FCFF Model in Cash Flow Valuation
23 pages
Ollivanders Wand Box Template Invoice
No ratings yet
Ollivanders Wand Box Template Invoice
1 page
NIS2 to CIS Controls Mapping Guide
No ratings yet
NIS2 to CIS Controls Mapping Guide
228 pages
Capex Forecast - Telecommunication
No ratings yet
Capex Forecast - Telecommunication
20 pages
Salbutamol Drug Study Overview
No ratings yet
Salbutamol Drug Study Overview
9 pages
QCS 2007: Ground Investigation Standards
No ratings yet
QCS 2007: Ground Investigation Standards
7 pages
Pharma Marketing Trends for 2025
No ratings yet
Pharma Marketing Trends for 2025
15 pages
Jampilen EP548R: High-Performance Copolymer
No ratings yet
Jampilen EP548R: High-Performance Copolymer
3 pages