Module 1

good

Uploaded by

Shantanu Sen

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Module 1

good

Uploaded by

Shantanu Sen

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Module 1: Garbage Classification Preprocessing

Introduction:
The Garbage Classification Preprocessing module aims to prepare the dataset for garbage
classification into six classes: 'cardboard', 'glass', 'metal', 'paper', 'plastic', and 'trash'. This
module performs image data loading, resizing, conversion to grayscale, class distribution
visualization, and sample image visualization.

Dependencies:
- google.colab: The module assumes the code is run on Google Colab, as it mounts the Google
Drive to access the dataset.
- PIL (Python Imaging Library): Used for image processing tasks like resizing and converting to
grayscale.
- pathlib.Path: Utilized for handling file paths and directory operations.
- os: To work with file and directory paths.
- numpy: Used to handle counts for class distribution visualization.
- matplotlib.pyplot: For plotting class distribution and sample images.
- torchvision.datasets.ImageFolder: To load the dataset and apply transformations.

Functionality:

Data Loading and Image Resizing:

● The code imports the necessary libraries and mounts Google Drive to access the
dataset zip file located at
'/content/drive/MyDrive/Projects/Garbage/garbage_dataset.zip'.
● The zip file is extracted into the '/content/Garbage/original_images' directory.
● Images are loaded using ImageFolder from torchvision.datasets with resizing
transformation applied to ensure uniformity (32x32 pixels).

Class Distribution Visualization:

● The class distribution of the dataset is visualized using a bar plot. The counts of images
in each class are computed using os.listdir and len functions.
● Class names and corresponding image counts are stored in lists for plotting.
● A bar plot is generated using matplotlib.pyplot to visualize the class distribution, with
each bar representing a garbage class and its height corresponding to the number of
images.

Image Preprocessing:
● The convert_scale function is defined to handle image preprocessing tasks for each
class.
● For each garbage class, it creates paths for the original and processed image
directories.
● All image filenames in the original directory are retrieved using os.listdir.
● Images are loaded, resized to 32x32 pixels using Image.open and resize, and converted
to grayscale using convert('L').
● Processed grayscale images are saved to the corresponding processed image directory.
● Preprocessing is performed for all images in each class using a loop.

Sample Image Visualization:

● The get_image_paths function retrieves a sample of 5 image file paths from a given
folder path.
● The plot_sample_images function visualizes the sample images in a single row, using
matplotlib.pyplot.
● Sample images are loaded using Image.open and plotted in subplots using plt.imshow.
● The axes are hidden using plt.axis('off'), and the figure is displayed using plt.show().

Usage:
The Garbage Classification Preprocessing module simplifies the dataset preparation process by
automating image loading, resizing, conversion to grayscale, and class distribution visualization.
The code efficiently preprocesses the dataset, making it suitable for garbage classification
tasks. Additionally, the sample image visualization allows users to inspect the quality of the
processed images and gain insights into the variety of data available for training or testing the
classification model.

1. Import the necessary libraries and mount Google Drive to access the dataset.
2. Load the original images, resize them to 32x32 pixels, convert them to grayscale, and save
the processed images in '/content/Garbage/processed_images'.
3. Visualize the class distribution in the dataset using a bar plot to understand the data
distribution across different garbage classes.
4. Obtain a sample of 5 images from each class in both the original and processed datasets for
visual inspection.
5. Display the sample images in rows for each class to better understand the quality and
diversity of the data after preprocessing.

The Garbage Classification Preprocessing module ensures the dataset is adequately prepared
and ready for subsequent steps in the garbage classification process. The image preprocessing
steps enable the model to handle consistent input sizes and simplified feature representations.
Users can use the class distribution visualization to assess dataset balance and identify
potential data augmentation needs. By visualizing sample images, users gain insights into the
dataset's content, ensuring data quality and aiding model selection and design decisions.

Module 1: Air Quality Analysis

This module focuses on analyzing air quality data collected across various states in India. The
goal is to calculate the Air Quality Index (AQI) and gain insights into air pollution trends over
time. The analysis involves several technical details and preprocessing techniques.
Data Loading and Preprocessing:
- Import necessary libraries: NumPy, Pandas, Matplotlib, Seaborn, and Basemap.
- Load air quality data from the "../input/" directory using Pandas.
- Fill missing values in the data with zeros for consistent analysis.

Calculating Individual Pollutant Indices:

- Define functions to calculate individual pollutant indices for SO2, NO2, RSPM, and SPM.
- The indices are computed based on predefined ranges specified by Indian government
standards.

Calculating Air Quality Index (AQI):

- Calculate AQI based on individual pollutant indices for each data point.
- The AQI represents the overall air quality and is determined by the highest value among the
individual pollutant indices.

Visualization of AQI Across India:

- Use the Basemap library to plot a map of India.
- Fetch geographical locations (latitude and longitude) of the states from another dataset.
- Represent AQI values on the map using markers of varying sizes and colors.

Time Series Analysis and Visualization:

- Perform time series analysis on AQI trends over the years.
- Resample data to get monthly average AQI values.
- Use seasonal decomposition to observe patterns, trends, and seasonality in the data.

This air quality analysis involves extensive preprocessing techniques, including data cleaning,
calculating individual pollutant indices, and computing the AQI. The visualization techniques
provide valuable insights into air pollution patterns and trends across India. Policymakers can
use these insights to formulate effective strategies for improving air quality and public health.
The time series analysis helps in identifying seasonal patterns and long-term trends, enabling
better decision-making for environmental management.

Module 1: Pollution Estimation System - Advanced Data Preprocessing

Introduction:
The Pollution Estimation System's Advanced Data Preprocessing module aims to optimize the
dataset for accurate pollution level estimation based on smoke intensity in images. This
sophisticated module incorporates intricate image processing techniques, data augmentation,
and advanced data visualization to enhance the model's performance and robustness.

Dependencies:
- OpenCV (cv2): Utilized for image processing tasks like loading, resizing, and color space
conversion.
- NumPy: For numerical operations and array manipulations.
- Matplotlib.pyplot: Used for plotting class distribution and sample images.
- Skimage (scikit-image): Enables advanced image processing functionalities.

Functionality:

1. Data Loading and Image Resizing:

- Import necessary libraries: OpenCV and NumPy.
- Load images from the dataset directory.
- Resize all images to a consistent size, such as 224x224 pixels, using advanced interpolation
techniques (e.g., Lanczos).
- Apply data normalization to scale pixel values to a range suitable for the model (e.g., [0, 1]).

2. Image Augmentation:
- Implement data augmentation techniques using Skimage to enhance dataset diversity and
robustness.
- Apply random rotations, flips, and brightness adjustments to generate augmented images.
- Save the augmented images in a separate directory to be used during model training.

3. Class Distribution Visualization:

- Calculate the number of images in each class to visualize class distribution.
- Store class names and corresponding image counts in lists for plotting.
- Utilize Matplotlib.pyplot to generate a bar plot, representing each class and its image count.
- This visualization aids in understanding the data distribution across different pollution levels.

4. Image Preprocessing:
- Define complex image preprocessing functions to handle resizing, color space conversion, and
enhancement.
- Use advanced image processing techniques, such as histogram equalization and denoising.
- Implement region of interest (ROI) extraction to focus on relevant image areas containing
smoke.
- Apply edge detection algorithms to emphasize smoke boundaries and features.
- Save the preprocessed images in a separate directory to be used in the pollution level
estimation model.

Usage:
The Advanced Data Preprocessing module significantly optimizes the dataset for accurate and
robust pollution level estimation. The incorporation of complex image processing techniques,
data augmentation, and data visualization enhances the model's ability to handle diverse and
challenging environmental conditions. The advanced image preprocessing ensures that the
model receives meaningful and relevant input representations, resulting in improved estimation
accuracy.
1. Import necessary libraries and load images from the 'dataset' directory.
2. Resize all images to a consistent size (e.g., 224x224 pixels) using advanced interpolation
techniques.
3. Apply data normalization to scale pixel values to a suitable range (e.g., [0, 1]).
4. Implement data augmentation techniques to increase dataset diversity and robustness.
5. Visualize the class distribution using a bar plot to understand data distribution across different
pollution levels.
6. Implement complex image preprocessing functions to handle resizing, color space
conversion, enhancement, ROI extraction, and edge detection.
7. Save the preprocessed images in a separate directory for subsequent use.
8. Obtain a sample of 5 images from each class for visual inspection.
9. Display the sample images along with their augmented versions to assess dataset diversity.

The Advanced Data Preprocessing module significantly enhances the dataset's quality and
prepares it for subsequent steps in the pollution level estimation process. By applying advanced
image processing techniques and data augmentation, the module ensures that the model can
effectively handle various environmental scenarios and make accurate pollution level
predictions. Users can use the class distribution visualization to assess dataset balance and
identify potential data augmentation needs. The visualization of sample images provides
valuable insights into the dataset's diversity and prepares users for potential challenges in
model development.

Final Year Game Project Proposal
50% (4)
Final Year Game Project Proposal
11 pages
Report 3
No ratings yet
Report 3
7 pages
Speech M1
No ratings yet
Speech M1
4 pages
M Shahab 1710305075121
No ratings yet
M Shahab 1710305075121
21 pages
ai project 1
No ratings yet
ai project 1
10 pages
Project Manual - Team 591965
No ratings yet
Project Manual - Team 591965
27 pages
DL - Image Recognition System
No ratings yet
DL - Image Recognition System
3 pages
A.I Final Report
No ratings yet
A.I Final Report
39 pages
CENG 3264: Engin Des and Project MGMT - Lab 3 Matlab Tutorial II - Image Processing
No ratings yet
CENG 3264: Engin Des and Project MGMT - Lab 3 Matlab Tutorial II - Image Processing
5 pages
Anomaly Detection in Images CIFAR-10
No ratings yet
Anomaly Detection in Images CIFAR-10
9 pages
All Projects F23
No ratings yet
All Projects F23
145 pages
AIML Hard
No ratings yet
AIML Hard
22 pages
Assignment_1
No ratings yet
Assignment_1
6 pages
MIC Assignment4
No ratings yet
MIC Assignment4
9 pages
Project Progress Review Report[1] (Autorecovered)
No ratings yet
Project Progress Review Report[1] (Autorecovered)
5 pages
Traffic Signs Recognition
No ratings yet
Traffic Signs Recognition
12 pages
PPT Script
No ratings yet
PPT Script
10 pages
Low-level Design and Implementation Details
No ratings yet
Low-level Design and Implementation Details
13 pages
AI Work
No ratings yet
AI Work
11 pages
DIP Mini Project
100% (1)
DIP Mini Project
12 pages
ACFrOgCfX9ATrHm9ZSjs1HLKnJCXmmPcIwFi Y7hVAv6zU1Li3igjIXOOLtGhffODBql8a993YAsc3gM SE8bidlMJr2eFkl9eJB0BU8jcLD6iWrroxwbp1 X9yQtpQks6r8vMLEnR-ORk02lgVJ
No ratings yet
ACFrOgCfX9ATrHm9ZSjs1HLKnJCXmmPcIwFi Y7hVAv6zU1Li3igjIXOOLtGhffODBql8a993YAsc3gM SE8bidlMJr2eFkl9eJB0BU8jcLD6iWrroxwbp1 X9yQtpQks6r8vMLEnR-ORk02lgVJ
20 pages
Himanshu Gupta Configuration Manual
No ratings yet
Himanshu Gupta Configuration Manual
16 pages
DMV & ML Lab
No ratings yet
DMV & ML Lab
103 pages
AttiqAhmadAfsarMLAssignment3Flask
No ratings yet
AttiqAhmadAfsarMLAssignment3Flask
9 pages
Reverse Image Search Report
No ratings yet
Reverse Image Search Report
5 pages
Recolored Image Detection Via A Deep Discriminative Model: P.Yugandhar Sharma 1304-21-862-024
No ratings yet
Recolored Image Detection Via A Deep Discriminative Model: P.Yugandhar Sharma 1304-21-862-024
67 pages
E0 271: Graphics and Visualization Assignment #2: Goals
No ratings yet
E0 271: Graphics and Visualization Assignment #2: Goals
2 pages
Abhishek - Abhishek Bhattacharya
No ratings yet
Abhishek - Abhishek Bhattacharya
3 pages
It TG Graphics Creation 9626
No ratings yet
It TG Graphics Creation 9626
40 pages
Untitled Document
No ratings yet
Untitled Document
5 pages
BigData Assessment2 26230605
No ratings yet
BigData Assessment2 26230605
14 pages
Aiml Lab Week 5 2 (1)
No ratings yet
Aiml Lab Week 5 2 (1)
3 pages
Project Title
No ratings yet
Project Title
4 pages
CUSTOMER SEGMENTATION 2
No ratings yet
CUSTOMER SEGMENTATION 2
19 pages
Image classification
No ratings yet
Image classification
8 pages
FinalGrpPpt
No ratings yet
FinalGrpPpt
20 pages
Convolutional Neural Network Project On Image Classification
No ratings yet
Convolutional Neural Network Project On Image Classification
8 pages
Image Processing System: 1. Background
No ratings yet
Image Processing System: 1. Background
4 pages
DS_slide
No ratings yet
DS_slide
15 pages
Green Minimalist Professional Business Proposal Presentation
No ratings yet
Green Minimalist Professional Business Proposal Presentation
20 pages
Computer Graphics Lab Manual
No ratings yet
Computer Graphics Lab Manual
54 pages
Review Paper On Image Processing in Distributed Environment: Sandeep Kaur, Manisha Bhardwaj
No ratings yet
Review Paper On Image Processing in Distributed Environment: Sandeep Kaur, Manisha Bhardwaj
2 pages
ANN MINI Project1 - Removed
No ratings yet
ANN MINI Project1 - Removed
9 pages
Document from Deependra singh (1)
No ratings yet
Document from Deependra singh (1)
10 pages
Edutactile - A Tool For Rapid Generation of Accurate Guideline-Compliant Tactile Graphics For Science and Mathematics
No ratings yet
Edutactile - A Tool For Rapid Generation of Accurate Guideline-Compliant Tactile Graphics For Science and Mathematics
8 pages
Sketch4Match - Content-Based Image Retrieval System Using Sketches
No ratings yet
Sketch4Match - Content-Based Image Retrieval System Using Sketches
17 pages
CAPSTONE_PROJECT
No ratings yet
CAPSTONE_PROJECT
47 pages
UNITIV.BtechIot
No ratings yet
UNITIV.BtechIot
43 pages
Assignment 2
No ratings yet
Assignment 2
4 pages
Fundamentals of Image Processing Lab Manual 2014 PDF
No ratings yet
Fundamentals of Image Processing Lab Manual 2014 PDF
68 pages
Jangan Hapus 1
No ratings yet
Jangan Hapus 1
14 pages
Assignment_2
No ratings yet
Assignment_2
3 pages
Cahpter 3to 7
No ratings yet
Cahpter 3to 7
6 pages
Bandita Internship
No ratings yet
Bandita Internship
23 pages
Report23 24
No ratings yet
Report23 24
55 pages
npmini-1-4_merged_removed
No ratings yet
npmini-1-4_merged_removed
4 pages
NB4-07 PT II Data Augmentation
No ratings yet
NB4-07 PT II Data Augmentation
6 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
CSL7620_A2
No ratings yet
CSL7620_A2
2 pages
ME P4252-II Semester - MACHINE LEARNING
No ratings yet
ME P4252-II Semester - MACHINE LEARNING
46 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Java Training Philippines
No ratings yet
Java Training Philippines
9 pages
DSE 20.1F Computer Architecture and Networks
No ratings yet
DSE 20.1F Computer Architecture and Networks
3 pages
Life - Church Brand Guidelines: The Dot Is Silent
No ratings yet
Life - Church Brand Guidelines: The Dot Is Silent
61 pages
Artificial Intelligence: International Conference
No ratings yet
Artificial Intelligence: International Conference
4 pages
Actual vs. Budget Ytd Year 2021: G/L Code Account Title Actual Budget Remaining $ Remaining %
No ratings yet
Actual vs. Budget Ytd Year 2021: G/L Code Account Title Actual Budget Remaining $ Remaining %
4 pages
Javalog 20230109
No ratings yet
Javalog 20230109
3 pages
T4user Guide
No ratings yet
T4user Guide
226 pages
Self-Quiz Unit 2 - Attempt Review
No ratings yet
Self-Quiz Unit 2 - Attempt Review
5 pages
Service Interface User Guide, February 2022 DOCA0170EN-02
No ratings yet
Service Interface User Guide, February 2022 DOCA0170EN-02
64 pages
MVS
100% (1)
MVS
226 pages
Computer Viruses
No ratings yet
Computer Viruses
3 pages
PG1F05D Tok60e23a462b81d
No ratings yet
PG1F05D Tok60e23a462b81d
5 pages
This Study Resource Was: Chapter 16-Network Security
No ratings yet
This Study Resource Was: Chapter 16-Network Security
6 pages
Paper Dissection Lab - Playdough Mats A4 Twinkle
No ratings yet
Paper Dissection Lab - Playdough Mats A4 Twinkle
6 pages
Full Download Internet of Things with Raspberry Pi and Arduino 1st Edition Rajesh Singh (Author) PDF DOCX
100% (5)
Full Download Internet of Things with Raspberry Pi and Arduino 1st Edition Rajesh Singh (Author) PDF DOCX
65 pages
VS Code Keyboard Shortcuts
No ratings yet
VS Code Keyboard Shortcuts
14 pages
78788417 TIA Portal Und PLCSIM Advanced in Der Virtuellen Maschine En
No ratings yet
78788417 TIA Portal Und PLCSIM Advanced in Der Virtuellen Maschine En
11 pages
Android Development For C# and Visual Studio 2012
No ratings yet
Android Development For C# and Visual Studio 2012
2 pages
DB Present
No ratings yet
DB Present
22 pages
Tuto Audit
No ratings yet
Tuto Audit
1 page
Channel Coding For Underwater Acoustic Communication System
No ratings yet
Channel Coding For Underwater Acoustic Communication System
4 pages
Introduction To Computer Science I Course Syllabus: Barbara - Hecker@csueastbay - Edu
No ratings yet
Introduction To Computer Science I Course Syllabus: Barbara - Hecker@csueastbay - Edu
3 pages
S - AC0 - 52000888 Report of Vendor Bal PC Wise
No ratings yet
S - AC0 - 52000888 Report of Vendor Bal PC Wise
10 pages
XGB-U +manual V1.0+XBC-DN32UUAUP XBC-DR28UUAUP PDF
No ratings yet
XGB-U +manual V1.0+XBC-DN32UUAUP XBC-DR28UUAUP PDF
1,322 pages
Neural Networks
No ratings yet
Neural Networks
13 pages
Tips
No ratings yet
Tips
14 pages
Townsend Labs Sphere L22 Microphone System User Guide
No ratings yet
Townsend Labs Sphere L22 Microphone System User Guide
59 pages
Machine-Learning-Algorith 8800284 Powerpoint
No ratings yet
Machine-Learning-Algorith 8800284 Powerpoint
10 pages
Lab 1.6.2
No ratings yet
Lab 1.6.2
22 pages