Image Classification using MNIST Dataset

Image Classifica on using MNIST Data set
Problem Statement:
The task is to build and train a deep learning model that can classify images from a given dataset.
Specifically, we are working with the MNIST dataset, which contains handwri en digits (0-9). The goal
is to create a Convolu onal Neural Network (CNN) to classify images into the appropriate digit class
(from 0 to 9). The model will be evaluated on its ability to predict the correct class for test images, and
the performance will be measured using accuracy and loss metrics. The CNN will use techniques such
as convolu on layers, max pooling, and dropout for regulariza on.
Learning Outcomes:
By the end of this lesson, students should be able to:
1. Understand the Basics of Image Classifica on:
o Understand the key components of a deep learning model for image classifica on,
including CNNs and their layers (Convolu onal, Pooling, Dense).
2. Work with Image Data:
o Learn how to load, preprocess, and normalize image data for neural network models.
o Understand how to convert image labels into one-hot encoded format.
3. Build and Train a CNN Model:
o Construct a Convolu onal Neural Network (CNN) using Keras for classifying images.
o Use techniques such as convolu on, max pooling, and dense layers in CNNs.
4. Evaluate the Model:
o Evaluate the model using test data and report on the loss and accuracy.
5. Visualize the Model's Performance:
o Create visualiza ons to track the training and valida on accuracy/loss over epochs
using Matplotlib.
6. Make Predic ons with the Model:
o Use the trained model to make predic ons on new images and visualize the results.
7. Understand So max and Classifica on Output:
o Learn about the so max ac va on func on and how it is used to output probabili es
for each class.
Technologies Used:
1. Python:
o The core programming language used for this project.
2. Keras:
o A high-level neural networks API, wri en in Python and capable of running on top of
TensorFlow, used to construct the deep learning model.
3. TensorFlow:
o A powerful deep learning framework used to run the Keras models.
4. NumPy:
o A library for numerical computa ons, used for handling arrays and data manipula on.
5. Matplotlib:
o A Python plo ng library used to visualize the performance of the model by plo ng
training/valida on accuracy and loss graphs.
6. MNIST Dataset:
o A dataset of 70,000 28x28 grayscale images of handwri en digits (0-9) used for
training and tes ng the model.
7. Image Preprocessing:
o Techniques like normaliza on and reshaping used to prepare images for feeding into
the neural network.
Key Concepts and Techniques:
1. Convolu onal Neural Networks (CNNs):
o A specialized deep learning architecture designed to process structured grid data like
images.
2. Convolu onal Layers:
o Layers that apply a convolu on opera on to the input to extract features from the
image.
3. Max Pooling:
o A downsampling opera on that reduces the spa al dimensions (width and height) of
the image, helping the model learn more abstract features.
4. Fla ening:
o Conver ng the mul -dimensional outputs from the convolu onal layers into a one-
dimensional vector before feeding it into fully connected (dense) layers.
5. Dense Layers:
o Fully connected layers that allow the network to learn complex representa ons from
the extracted features.
6. Dropout:
o A regulariza on technique used to prevent overfi ng by randomly dropping units

(neurons) during training.
7. So max Ac va on:
o A mathema cal func on used to produce a probability distribu on over the classes
(0-9), which is used to predict the most likely class for an image.
8. One-Hot Encoding:
o A technique to represent categorical data as binary vectors, commonly used to encode

class labels in classifica on problems.
Steps in the Project:
1. Loading and Preprocessing the Data:
o Load the MNIST dataset using Keras.
o Normalize the images to scale the pixel values between 0 and 1.
o Reshape the images to match the input shape expected by the CNN.
o One-hot encode the labels for classifica on.
2. Building the CNN Model:
o Construct the CNN architecture with:
 A convolu onal layer (Conv2D) for feature extrac on.
 MaxPooling2D for reducing dimensions.
 Fla ening the output for the fully connected layer.
 Dense layers with ReLU ac va on for classifica on.
 Dropout for regulariza on.
3. Compiling and Training the Model:
o Compile the model with the Adam op mizer and categorical cross-entropy loss
func on.
o Train the model using the training data and validate it using the test data.
4. Evalua ng the Model:

o A er training, evaluate the model on the test set to calculate accuracy and loss.
5. Visualizing the Results:
o Plot the training and valida on accuracy and loss graphs to assess how well the model
has learned over me.
6. Making Predic ons:
o Use the trained model to predict the class of a test image.
o Display the image and print the predicted class along with the so max probabili es.
Conclusion and Future Work:
 Conclusion:
o This project demonstrates how to build, train, and evaluate a simple Convolu onal
Neural Network (CNN) for image classifica on using the MNIST dataset. The CNN
successfully classifies handwri en digits based on features extracted through
convolu onal layers.
 Future Work:
o You can experiment with different architectures, including more convolu onal layers
or other types of neural networks like Fully Connected Networks (FNNs) or Recurrent
Neural Networks (RNNs) for different types of tasks.
o Fine-tuning the model using techniques like data augmenta on or adjus ng

hyperparameters (e.g., learning rate, batch size) could improve performance.
o The model can be extended to classify more complex datasets like CIFAR-10 or custom
datasets.
Addi onal Notes:
 Overfi ng: It's important to monitor both training and valida on performance to avoid
overfi ng. Dropout layers and regulariza on techniques help in mi ga ng overfi ng.
 Hyperparameter Tuning: Experimen ng with different architectures, learning rates, and other
hyperparameters can lead to be er model performance.
 Deployment: Once the model is trained and evaluated, you can deploy it using various
frameworks like Flask, Django, or Streamlit for real- me image classifica on applica ons.
Impor ng Necessary libraries
Code Explana on
import numpy as np
import keras
import tensorflow as
from keras import layers
from keras import models
from keras.datasets import mnist
Step-by-Step Breakdown
1. import numpy as np
o Purpose: Numpy is a powerful library for numerical computa ons in Python.
o Why It's Needed: It is widely used in machine learning and deep learning for
handling arrays, performing mathema cal opera ons, and manipula ng image data
(as image data is o en represented as arrays).
2. import keras
o Purpose: Keras is a high-level deep learning framework running on top of

TensorFlow.
o Why It's Needed: Keras simplifies building and training deep learning models by
providing intui ve APIs.
3. import tensorflow as
o Purpose: TensorFlow is a comprehensive library for deep learning and numerical

computa on.
o Why It's Needed: TensorFlow acts as the backend for Keras. It is responsible for
performing computa ons, especially on GPUs for faster processing.
4. from keras import layers
o Purpose: This module provides building blocks (e.g., dense layers, convolu onal
layers, etc.) for construc ng neural networks.
o Why It's Needed: Layers are the core components of any neural network. For
instance, a Convolu onal Neural Network (CNN) is built by stacking convolu onal,
pooling, and dense layers.
5. from keras import models

o Purpose: This module is used to define the architecture of the neural network, either
sequen ally or func onally.
o Why It's Needed: A model represents the structure of the neural network, specifying
how layers are connected and interact.
6. from keras.datasets import mnist
o Purpose: The MNIST dataset is a benchmark dataset for image classifica on,
consis ng of 28x28 grayscale images of handwri en digits (0-9).
o Why It's Needed:
 It is commonly used for prac cing image classifica on tasks.
 It includes 60,000 training images and 10,000 test images, making it an ideal
star ng point for beginners.
Why These Imports Are Important
 Ease of Development: Libraries like TensorFlow and Keras reduce the complexity of wri ng
deep learning models from scratch.
 Standard Prac ce: These tools are industry-standard and widely accepted in academia and
research.
 Efficient and Scalable: They enable the development of models that can scale to handle real-
world problems.
Addi onal Tips for Students
 Ensure that these libraries are installed before running the code. Use the following
commands to install them if necessary:
 pip install numpy tensorflow keras
 Familiarize yourself with the MNIST dataset, as it is a common star ng point for image
classifica on projects.
Data Prepara on
Code Explana on
from keras.u ls import to_categorical
(X_train, y_train), (X_test, y_test) = mnist.load_data()
print("X_train shape", X_train.shape)
print("y_train shape", y_train.shape)
print("X_test shape", X_test.shape)
print("y_test shape", y_test.shape)
X_train = X_train.reshape(X_train.shape[0], 28, 28, 1).astype('float32') / 255
X_test = X_test.reshape(X_test.shape[0], 28, 28, 1).astype('float32') / 255
Y_train = to_categorical(y_train, num_classes=10)
Y_test = to_categorical(y_test, num_classes=10)
1. from keras.u ls import to_categorical
o Purpose: Converts class labels into one-hot encoded format.
o Why It's Needed: Neural networks work be er when categorical labels are
represented as vectors (one-hot encoding). For example, the digit 3 becomes [0, 0, 0,
1, 0, 0, 0, 0, 0, 0].
2. (X_train, y_train), (X_test, y_test) = mnist.load_data()
o Purpose: Loads the MNIST dataset into training and tes ng sets.
o Key Details:
 X_train: Input images for training.
 y_train: Corresponding labels for X_train.
 X_test: Input images for tes ng.
 y_test: Corresponding labels for X_test.
o Dataset Size:
 Training set: 60,000 images.
 Tes ng set: 10,000 images.

3. print("X_train shape", X_train.shape)
o Purpose: Displays the shape of the training data.
o Output Example: (60000, 28, 28) shows there are 60,000 images of size 28x28.
4. X_train = X_train.reshape(X_train.shape[0], 28, 28, 1).astype('float32') / 255
o Purpose: Reshapes and normalizes the training images.
o Key Steps:
 Reshape: Converts the 28x28 images into a 4D tensor of shape (number of

images, height, width, channels). Here, 1 represents the single grayscale
channel.
 Normaliza on: Divides each pixel value by 255 to scale them between 0 and
1. This helps the model converge faster during training.
5. X_test = X_test.reshape(X_test.shape[0], 28, 28, 1).astype('float32') / 255
o Purpose: Applies the same reshaping and normaliza on to the test dataset.
6. Y_train = to_categorical(y_train, num_classes=10)
o Purpose: Converts the training labels into one-hot encoded format with 10 classes
(digits 0-9).
7. Y_test = to_categorical(y_test, num_classes=10)
o Purpose: Converts the tes ng labels into one-hot encoded format.
Why This Preprocessing is Important
1. Reshaping:
o Neural networks expect input data to be in a specific shape, especially convolu onal
layers that require 4D tensors.
o Adding the channel dimension makes the data compa ble with the input format for
CNNs.
2. Normaliza on:
o Scaling pixel values to the range [0, 1] ensures numerical stability during
computa ons and faster convergence of the model.
3. One-Hot Encoding:
o This format is ideal for classifica on tasks, where the model outputs probabili es for
each class.
Output Shapes A er Preprocessing
 X_train: (60000, 28, 28, 1) — 60,000 images of 28x28 size with 1 channel.
 Y_train: (60000, 10) — One-hot encoded labels for training.
 X_test: (10000, 28, 28, 1) — 10,000 images of 28x28 size with 1 channel.
 Y_test: (10000, 10) — One-hot encoded labels for tes ng.
Addi onal Notes for Students
 Valida on: Always validate the shape of the data before proceeding to model building.
 Generaliza on: The same preprocessing steps (reshaping, normaliza on, and encoding)
should be applied to any dataset when working with neural networks.
Building and Compiling the Convolu onal Neural Network (CNN)

Code Explana on
from keras.layers import Conv2D, MaxPooling2D, Fla en, Dense, Dropout
from keras.models import Sequen al
model = Sequen al()
model.add(Conv2D(16, kernel_size=(3,3), strides=(1,1),
padding='valid', ac va on='relu', input_shape=(28,28,1)))
model.add(MaxPooling2D(pool_size=(1,1)))
model.add(Fla en())
model.add(Dense(128, ac va on='relu'))
model.add(Dropout(0.5))
model.add(Dense(10, ac va on='so max'))
model.compile(loss='categorical_crossentropy', op mizer='adam',
metrics=['accuracy'])
1. Impor ng Required Modules
from keras.layers import Conv2D, MaxPooling2D, Fla en, Dense, Dropout
from keras.models import Sequen al
 Purpose: These modules provide the building blocks to define the CNN architecture:
o Conv2D: Convolu onal layer to extract features from the input image.
o MaxPooling2D: Pooling layer to reduce the spa al dimensions of feature maps.
o Fla en: Converts 2D feature maps into a 1D vector for the dense layers.
o Dense: Fully connected layer, used for classifica on.
o Dropout: Regulariza on layer to prevent overfi ng.
o Sequen al: A linear stack of layers, where each layer has exactly one input tensor
and one output tensor.
2. Defining the CNN Architecture
model = Sequen al()
 Purpose: The Sequen al model is used to build a neural network by stacking layers
sequen ally.
Layer 1: Convolu onal Layer
model.add(Conv2D(16, kernel_size=(3,3), strides=(1,1),
padding='valid', ac va on='relu', input_shape=(28,28,1)))
 Details:
o 16: Number of filters (or feature detectors).
o kernel_size=(3,3): The size of the filter (3x3).
o strides=(1,1): Specifies the step size of the convolu on.
o padding='valid': No padding; the output will be smaller than the input.
o ac va on='relu': Rec fied Linear Unit ac va on func on adds non-linearity to the

model.
o input_shape=(28,28,1): Input image dimensions (28x28 pixels with 1 channel for

grayscale).
Layer 2: MaxPooling Layer
model.add(MaxPooling2D(pool_size=(1,1)))
 Details:
o pool_size=(1,1): Reduces the spa al dimensions of the feature map without much
informa on loss.
o Purpose: Enhances computa onal efficiency and reduces overfi ng.
Layer 3: Fla en Layer
model.add(Fla en())
 Purpose: Converts the 2D feature maps into a 1D vector to feed into the fully connected
layers.
Layer 4: Fully Connected Layer (Dense)
model.add(Dense(128, ac va on='relu'))
 Details:
o 128: Number of neurons in this dense layer.
o ac va on='relu': Applies the ReLU ac va on func on to introduce non-linearity.
Layer 5: Dropout Layer
model.add(Dropout(0.5))
 Details:
o 0.5: 50% of the neurons will be randomly deac vated during training.
o Purpose: Prevents overfi ng by making the model more robust.
Layer 6: Output Layer
model.add(Dense(10, ac va on='so max'))
 Details:
o 10: Number of output classes (digits 0-9).
o ac va on='so max': Converts the outputs into probabili es for each class.
3. Compiling the Model
model.compile(loss='categorical_crossentropy', op mizer='adam',
metrics=['accuracy'])
 Details:
o loss='categorical_crossentropy': Suitable for mul -class classifica on problems.
o op mizer='adam': Adap ve Moment Es ma on op mizer adjusts learning rates

dynamically.
o metrics=['accuracy']: Tracks the model's accuracy during training and evalua on.
Why This Architecture?
1. Feature Extrac on:
o Convolu onal layers extract relevant pa erns and features from the input image.
2. Dimensionality Reduc on:
o Pooling layers reduce computa onal complexity and prevent overfi ng.
3. Classifica on:
o Dense layers map the extracted features to the output classes.

Addi onal Tips for Students
 Experimenta on: Try changing the number of filters, kernel size, or ac va on func ons to
observe their impact.
 Visualiza on: Plot the model architecture using model.summary() or external tools to
understand the layer-wise structure.
Training the Convolu onal Neural Network (CNN)
Code Explana on
history = model.fit(X_train, Y_train, batch_size=128,
epochs=30, valida on_data=(X_test, Y_test))
1. Training the Model
Purpose of model.fit()
The fit() method is used to train the CNN model on the training dataset while evalua ng its
performance on the valida on dataset. Here's a breakdown of the parameters:
Parameters in model.fit()
1. X_train and Y_train:
o X_train: The input data for training (grayscale images of digits).
o Y_train: The corresponding one-hot encoded labels for training.
2. batch_size=128:
o The number of samples processed before the model updates its weights.
o Smaller batch size: More updates but slower training.
o Larger batch size: Faster training but higher memory consump on.
3. epochs=30:
o The number of complete passes through the en re training dataset.
o Increasing epochs can improve performance but may lead to overfi ng if too high.
4. valida on_data=(X_test, Y_test):
o A separate dataset used to evaluate the model's performance a er each epoch.
o Helps monitor for overfi ng or underfi ng.
Return Value: history
 history is an object that stores training and valida on metrics (e.g., loss and accuracy) for
each epoch.
 This can be used later for visualiza on or analysis of the training process.
2. Key Concepts
1. Batch Training:
o The training data is divided into smaller batches to efficiently train the model.
o Helps with memory constraints and smooths out updates to weights.
2. Epochs:
o Each epoch represents one full cycle through the training dataset.
o At the end of each epoch, the model is evaluated on the valida on data.
3. Valida on Data:
o Provides an unbiased evalua on of the model during training.
o Helps detect overfi ng when the valida on performance stops improving.
3. Monitoring Progress
1. Metrics Displayed During Training:
o Loss: A measure of how far off the model's predic ons are from the true labels.
o Accuracy: The percentage of correctly predicted labels.
o Valida on Loss and Accuracy: Corresponding metrics for the valida on dataset.
4. Tips for Students
1. Batch Size Selec on:
o A batch size of 128 is a good star ng point, but it can be adjusted depending on the
dataset size and system memory.
2. Epoch Count:
o 30 epochs work well for this task but can be increased if the model hasn't converged.
o Monitor valida on loss to prevent overfi ng.
3. Analyze history:
o Use the history object to visualize the training and valida on curves a er training.
Example Output During Training
Epoch 1/30
469/469 [==============================] - 3s 6ms/step - loss: 0.2500 - accuracy: 0.9270 -

val_loss: 0.0900 - val_accuracy: 0.9730
Epoch 2/30

...
Epoch 30/30

What’s Next?
 A er training, visualize the training and valida on curves to ensure the model is performing
well.
 Save the trained model for future use.
Summarizing the CNN Model
Code Explana on
model.summary()
1. Purpose of model.summary()
 model.summary() provides a detailed summary of the neural network architecture.
 It outlines the layers of the model, the number of parameters, and the shapes of the
input/output tensors for each layer.
2. What Does the Summary Include?
1. Layer Type:
o Displays the name and type of each layer (e.g., Conv2D, MaxPooling2D, Dense).
2. Output Shape:
o Shows the dimensions of the data as it flows through each layer.
3. Number of Parameters:
o Indicates the total trainable parameters for each layer.
o Includes weights and biases used in the computa ons.
4. Total Parameters:
o Summarizes the total number of trainable and non-trainable parameters in the

model.
3. Example Output
Model: "sequen al"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
conv2d (Conv2D) (None, 26, 26, 16) 160
max_pooling2d (MaxPooling2D) (None, 26, 26, 16) 0
fla en (Fla en) (None, 10816) 0
dense (Dense) (None, 128) 1384576
dropout (Dropout) (None, 128) 0
dense_1 (Dense) (None, 10) 1290
=================================================================
Total params: 1,386,026

Trainable params: 1,386,026
Non-trainable params: 0
_________________________________________________________________
4. Detailed Breakdown
Convolu onal Layer
 Output Shape: (None, 26, 26, 16):
o The output has dimensions of 26x26 with 16 filters applied.
o The reduc on from 28x28 to 26x26 is due to the filter size (3x3) and no padding
('valid').
 Param #: 160:
o Computed as
(filter width×filter height×input channels+1)×number of filters(\text{filter width}
\ mes \text{filter height} \ mes \text{input channels} + 1) \ mes \text{number of
filters}.
o (3×3×1+1)×16=160(3 \ mes 3 \ mes 1 + 1) \ mes 16 = 160.
Pooling Layer
 Output Shape: (None, 26, 26, 16):
o The shape remains the same as pooling size is 1×11 \ mes 1.
 Param #: 0:
o Pooling layers have no parameters to learn.
Fla en Layer
 Output Shape: (None, 10816):
o Converts the 26x26x16 tensor into a flat vector with 26×26×16=10,81626 \ mes 26
\ mes 16 = 10,816 features.
Dense Layer (Hidden Layer)
o Fully connected layer with 128 neurons.
o Param #: 1,384,576:
 Computed as (input units×output units+1)(\text{input units} \ mes

\text{output units} + 1).
 (10,816×128+128)=1,384,576(10,816 \ mes 128 + 128) = 1,384,576.
Dropout Layer
o The shape remains unchanged as dropout only deac vates neurons during training
without altering dimensions.
Output Layer
o Fully connected layer with 10 neurons (one for each digit class).
o Param #: 1,290:
 Computed as (input units×output units+1)(\text{input units} \ mes

\text{output units} + 1).
 (128×10+10)=1,290(128 \ mes 10 + 10) = 1,290.
5. Key Observa ons
1. Understanding Parameters:
o Trainable parameters are the weights and biases adjusted during training.
o Non-trainable parameters are fixed values, o en in layers like batch normaliza on.
o The summary helps in es ma ng the computa onal cost of the model.
3. Debugging:
o Use the summary to ensure layer outputs match the expected dimensions.
6. Addi onal Tips
 For larger models, visualize the architecture using external tools like TensorBoard or
plot_model from Keras.
 Use the parameter count to assess the model’s complexity rela ve to the available
computa onal resources.
Addi onal Notes: Parameter Details
Understanding the Summary Details
The output includes a breakdown of the model's parameters and their memory usage:
o 4,158,080 parameters (15.86 MB):
 This is the sum of all trainable and non-trainable parameters.
 Memory required for these parameters is 15.86 MB.
2. Trainable Parameters:
o 1,386,026 parameters (5.29 MB):
 Parameters (weights and biases) that the model learns during training.
 These contribute directly to op mizing the model's performance.
3. Non-Trainable Parameters:
o 0 parameters (0.00 B):
 These are fixed parameters (e.g., constants in batch normaliza on layers)

that do not change during training.
 In this model, there are no non-trainable parameters.
4. Op mizer Parameters:
o 2,772,054 parameters (10.57 MB):
 These include addi onal parameters maintained by the op mizer for

computa ons, such as momentum or adap ve learning rate adjustments.
Memory Breakdown
The memory usage is calculated based on the number of parameters and their precision:
 Each parameter typically uses 4 bytes (32-bit float) in memory.
 Memory (MB) = Total Parameters × 4 bytes ÷ 1,048,576 (bytes in 1 MB).
Key Observa ons
1. Trainable Parameters:
o The majority of trainable parameters are concentrated in the Dense layer

(Dense(128)) due to the fully connected nature of the layer.
2. Op mizer Parameters:
o Op mizer parameters contribute significantly to memory usage, highligh ng the

importance of selec ng an efficient op mizer.
3. Model Complexity:
o The number of parameters indicates the model's capacity to learn. However,

excessive parameters can lead to overfi ng.
4. Resource Management:
o Ensure the hardware (GPU/CPU) can handle the memory requirements, especially
for larger datasets or deeper models.
Prac cal Tip
 If memory usage is a concern, consider:
1. Reducing the model complexity (e.g., fewer neurons or layers).
2. Using regulariza on techniques (e.g., Dropout or L2 regulariza on).
3. Op ng for lightweight op mizers.
Evalua ng the CNN Model
Code Explana on
test_scores = model.evaluate(X_test, Y_test, verbose=2)

loss_cnn = test_scores[0] * 100
accuracy_cnn = test_scores[1] * 100
print('Test loss:', loss_cnn)
print('Test accuracy:', accuracy_cnn)
1. Purpose of Evalua on
 model.evaluate():
o Evaluates the trained CNN model on the test dataset.
o Provides two key metrics:
 Loss: Indicates how well the model performs on unseen data.
 Accuracy: Measures the propor on of correct predic ons.
2. Explana on of Key Components
1. model.evaluate():
o Parameters:
 X_test: Test images (inputs).
 Y_test: Test labels (ground truth).
 verbose=2: Prints detailed output for each evalua on step.
o Output:
 Returns a list containing:
1. Loss (index 0): Computed based on the loss func on

(categorical_crossentropy in this case).
2. Accuracy (index 1): Ra o of correct predic ons to total predic ons.
2. Scaling for Percentages:
o Mul plying the loss and accuracy by 100 converts them to percentages for be er
readability.
3. Prin ng Results:
o print('Test loss:', loss_cnn):
 Displays the percentage loss.
o print('Test accuracy:', accuracy_cnn):
 Displays the percentage accuracy.

3. Example Output
Test loss: 2.53
Test accuracy: 98.14
 Loss:
o Lower values indicate be er performance.
o Example: 2.53% loss means the model performs well on unseen data.
 Accuracy:
o Higher percentages indicate be er predic on accuracy.
o Example: 98.14% accuracy means the model correctly classifies ~98% of test images.
4. Key Observa ons
1. Model Generaliza on:
o A significant gap between training and test accuracy suggests overfi ng.
o Ensure valida on performance is also considered during training.
2. Loss vs. Accuracy:
o Accuracy focuses on correctness, while loss gives a more nuanced view of how
confident the predic ons are.
3. Performance Metric:
o High test accuracy reflects that the model generalizes well to unseen data.
5. Prac cal Tips for Students
1. Improving Accuracy:
o Experiment with data augmenta on, deeper architectures, or hyperparameter

tuning.
2. Analyzing Loss:
o Monitor if the loss remains consistent across training, valida on, and tes ng phases.
3. Debugging Low Performance:
o Check for:
 Incorrect preprocessing or normaliza on.
 Imbalanced datasets requiring strategies like oversampling or weighted loss.
Visualizing Training and Valida on Accuracy

Purpose of the Code
The provided code plots the accuracy achieved during training and valida on over all epochs.
Visualiza on is essen al to evaluate the model's performance and detect poten al issues like
overfi ng or underfi ng.
1. Code Explana on
import matplotlib.pyplot as plt
 matplotlib.pyplot: A Python library for crea ng visualiza ons.
 It is used to plot training and valida on accuracy for analysis.
2. Accessing Training History
train_acc = history.history['accuracy']
val_acc = history.history['val_accuracy']
 history.history:
o history is the object returned by the model.fit() method.
o Keys:
 'accuracy': Training accuracy for each epoch.
 'val_accuracy': Valida on accuracy for each epoch.
3. Plo ng the Data
1. X-Axis: Epochs
o range(1, len(train_acc) + 1) generates values from 1 to the total number of epochs.
2. Y-Axis: Accuracy
o Training Accuracy: Accuracy achieved on the training set.
o Valida on Accuracy: Accuracy achieved on the valida on set.
3. plt.plot():
o Draws the line graph for training and valida on accuracy.
o Labels:
 'Training Accuracy': Represents training data performance.
 'Valida on Accuracy': Represents valida on data performance.
4. plt. tle():
o Adds a tle to the graph: "Training and Valida on Accuracy".
5. plt.xlabel() and plt.ylabel():
o Label the axes: "Epochs" and "Accuracy".
6. plt.legend():
o Adds a legend to differen ate between training and valida on accuracy.
7. plt.show():
o Displays the graph.
4. Example Output
A typical output graph would look like this:
 X-Axis (Epochs): Ranges from 1 to the total number of epochs (e.g., 30).
 Y-Axis (Accuracy): Ranges from 0 to 1 (or 0% to 100%).
 Two Lines:
o Training Accuracy (solid line)
o Valida on Accuracy (dashed line)
5. Key Observa ons
1. Ideal Scenario:
o Both training and valida on accuracy increase steadily.
o They converge to a high value by the final epoch.
2. Overfi ng:
o Training accuracy con nues to increase while valida on accuracy stagnates or

decreases.
o Possible Solu ons:
 Add regulariza on (e.g., Dropout, L2 regulariza on).
 Use data augmenta on.
3. Underfi ng:
o Both training and valida on accuracy remain low.
 Increase model complexity (e.g., more layers or neurons).
 Train for more epochs.

4. Convergence:
o Training and valida on accuracy curves align closely, indica ng good generaliza on.
 Always monitor training and valida on accuracy to ensure the model is learning effec vely.
 Compare accuracy with the loss values to ensure consistent performance across metrics.
 Save and document these plots for reports or presenta ons.
Visualizing Training and Valida on Loss
Purpose of the Code
This code generates a plot comparing the training and valida on loss throughout the training
process. Visualizing the loss helps evaluate model performance and iden fy issues like overfi ng or
underfi ng.
1. Code Explana on
import matplotlib.pyplot as plt
 matplotlib.pyplot: A library for plo ng graphs in Python.
 Used here to create visualiza ons of the loss values.
2. Accessing Loss History
train_loss = history.history['loss']
val_loss = history.history['val_loss']
 history.history:
o This object stores the loss and accuracy values for both training and valida on during
the training process.
o Keys:
 'loss': Stores the training loss for each epoch.
 'val_loss': Stores the valida on loss for each epoch.
3. Plo ng the Data

plt.plot(range(1, len(train_loss) + 1), train_loss, label='Training Loss')
plt.plot(range(1, len(val_loss) + 1), val_loss, label='Valida on Loss')
 X-Axis: Epochs (from 1 to the total number of epochs).
 Y-Axis: Loss values (lower is be er).
 plt.plot(): Creates lines for both training and valida on loss.
o Training Loss: Loss calculated on the training data.
o Valida on Loss: Loss calculated on the valida on data.
4. Customizing the Plot
plt. tle('Training and Valida on Loss')
plt.xlabel('Epochs')
plt.ylabel('Loss')
plt.legend()
plt.show()
 plt. tle(): Adds a tle to the plot: "Training and Valida on Loss".
 plt.xlabel() and plt.ylabel(): Labels the axes as "Epochs" and "Loss".
 plt.legend(): Adds a legend to differen ate the lines for training and valida on loss.
 plt.show(): Displays the plot.
5. Example Output
A typical output plot would have:
 X-Axis (Epochs): Ranges from 1 to the total number of epochs (e.g., 30).
 Y-Axis (Loss): Ranges from 0 to the maximum loss value.
 Two Lines:
o Training Loss: Typically decreases over me as the model learns.
o Valida on Loss: Tracks the performance of the model on unseen data.
6. Key Observa ons
1. Ideal Scenario:
o Both training and valida on loss decrease steadily over epochs.
o Valida on loss should ideally be close to training loss, indica ng good generaliza on.
2. Overfi ng:
o Training loss con nues to decrease, but valida on loss starts increasing a er a point.
 Add regulariza on (e.g., Dropout, L2 regulariza on).
 Use early stopping to halt training before overfi ng occurs.
3. Underfi ng:
o Both training and valida on loss remain high throughout training.
 Increase model complexity.
 Train for more epochs.
4. Convergence:
o If both training and valida on loss converge to similar low values, it indicates the
model is generalizing well to the test data.
 Always monitor both training and valida on loss to detect overfi ng or underfi ng early.
 If the valida on loss starts increasing while training loss decreases, it’s a sign to stop training
or use regulariza on techniques.
 Document and save these plots to assess model performance for future reference or in
project reports.
Summary:
Visualizing the loss for both training and valida on data is crucial for understanding how well the
model is performing and detec ng poten al issues like overfi ng.

Image Classification using MNIST Dataset

Uploaded by

Image Classification using MNIST Dataset

Uploaded by

Image Classiﬁca on using MNIST Data set

By the end of this lesson, students should be able to:

1. Understand the Basics of Image Classiﬁca on:

2. Work with Image Data:

o Understand how to convert image labels into one-hot encoded format.

3. Build and Train a CNN Model:

4. Evaluate the Model:

5. Visualize the Model's Performance:

6. Make Predic ons with the Model:

7. Understand So max and Classiﬁca on Output:

o The core programming language used for this project.

o A powerful deep learning framework used to run the Keras models.

Key Concepts and Techniques:

1. Convolu onal Neural Networks (CNNs):

2. Convolu onal Layers:

o A regulariza on technique used to prevent overﬁ ng by randomly dropping units

o A technique to represent categorical data as binary vectors, commonly used to encode

Steps in the Project:

1. Loading and Preprocessing the Data:

o Load the MNIST dataset using Keras.

o Normalize the images to scale the pixel values between 0 and 1.

o One-hot encode the labels for classiﬁca on.

2. Building the CNN Model:

o Construct the CNN architecture with:

 A convolu onal layer (Conv2D) for feature extrac on.

 MaxPooling2D for reducing dimensions.

 Fla ening the output for the fully connected layer.

 Dense layers with ReLU ac va on for classiﬁca on.

 Dropout for regulariza on.

3. Compiling and Training the Model:

4. Evalua ng the Model:

5. Visualizing the Results:

6. Making Predic ons:

o Use the trained model to predict the class of a test image.

Conclusion and Future Work:

o Fine-tuning the model using techniques like data augmenta on or adjus ng

Addi onal Notes:

from keras import layers

from keras import models

from keras.datasets import mnist

o Purpose: Numpy is a powerful library for numerical computa ons in Python.

o Purpose: Keras is a high-level deep learning framework running on top of

o Purpose: TensorFlow is a comprehensive library for deep learning and numerical

4. from keras import layers

5. from keras import models

6. from keras.datasets import mnist

o Why It's Needed:

 It is commonly used for prac cing image classiﬁca on tasks.

Why These Imports Are Important

Addi onal Tips for Students

 pip install numpy tensorﬂow keras

from keras.u ls import to_categorical

(X_train, y_train), (X_test, y_test) = mnist.load_data()

print("X_train shape", X_train.shape)

print("y_train shape", y_train.shape)

print("X_test shape", X_test.shape)

print("y_test shape", y_test.shape)

X_train = X_train.reshape(X_train.shape[0], 28, 28, 1).astype('ﬂoat32') / 255

X_test = X_test.reshape(X_test.shape[0], 28, 28, 1).astype('ﬂoat32') / 255

Y_train = to_categorical(y_train, num_classes=10)

Y_test = to_categorical(y_test, num_classes=10)

1. from keras.u ls import to_categorical

o Purpose: Converts class labels into one-hot encoded format.

2. (X_train, y_train), (X_test, y_test) = mnist.load_data()

 X_train: Input images for training.

 y_train: Corresponding labels for X_train.

 X_test: Input images for tes ng.

 y_test: Corresponding labels for X_test.

 Training set: 60,000 images.

 Tes ng set: 10,000 images.

o Purpose: Displays the shape of the training data.

4. X_train = X_train.reshape(X_train.shape[0], 28, 28, 1).astype('ﬂoat32') / 255

o Purpose: Reshapes and normalizes the training images.

 Reshape: Converts the 28x28 images into a 4D tensor of shape (number of

5. X_test = X_test.reshape(X_test.shape[0], 28, 28, 1).astype('ﬂoat32') / 255

6. Y_train = to_categorical(y_train, num_classes=10)