AI Music Genre Classification Using MLP

Uploaded by

Mustafa Ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

148 views15 pages

AI Music Genre Classification Using MLP

Uploaded by

Mustafa Ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Music Genre Classification

DONE BY:

MAHEK TAZEEN-1208
SYED MUSTAFA ALI-1240
NILOUFER KHANAM-1246
ABSTRACT:

Artificial Intelligence (AI) and Machine Learning stand as pivotal technological

advancements,
reshaping diverse domains such as computing, finance, healthcare, agriculture, music,
space, and tourism. This study focuses on the intricate realm of audio analysis within
AI, encompassing music information retrieval, generation, and classification.
Extracting meaningful features from music data poses a significant challenge, leading
to the exploration of various algorithms, including classical and hybrid neural
networks. Our investigation revolves around music genre classification, emphasizing
the effectiveness of a Multi-Layer Perceptron (MLP) compared to traditional methods
like Support Vector Classifier (SVC), Logistic Regression, and others. The evaluation
utilizes a subset of the Free Music Archive (FMA) dataset, with the proposed MLP
achieving an accuracy of 99.55% and a validation accuracy of 92.21%. The baseline
model, SVC, is replaced by MLP, showcasing its superior performance. The study
observes that image- based features outperform conventional audio-extracted features
in label classification. Integration of Flask web enables real-time music genre detection.
This paper provides an in- depth exploration of music genre classification methods,
emphasizing the efficacy of MLP and introducing a web-based detection approach.
Contents
 Introduction
 Scope of the project
 System Architecture
 Proposed system and Existing system
 Proposed Technique and Existing Technique
 Modules
 System Requirements
Introduction

Music is the art of combining various vocal and instrumental sounds to produce a melody or
rhythm. Music can be classified into a broad range of genres, some of the most popular ones
being blues, classical, country, disco, hip hop, jazz, metal, pop, reggae, rock. The way people
consume music has also developed with the advancement of technology. Music genre
classification, a subset of music information retrieval, is a challenging and progressive task in the
AI domain. It basically involves using machine learning concepts and algorithms to recognize
the genre of a particular music audio file, the style or category of the music. Classifying the genre
of a piece of music automatically has manifold uses in this modern world. The applications are
plenty. It can be used in an audio streaming platform or app (e.g. Soundcloud, Spotify, Gaana) for
categorization and music recommendation, which then can be used to curate playlists based on the
genre. The algorithm can be simply released as a product and can be used as a music identification
app (e.g. Shazam). It can also be used in smart bots like Alexa, Google Assistant, Siri present in
our smartphones to enhance the music listening experience of the user.
SCOPE OF THE PROJECT
The goal of this project is to explore the complex field of AI-driven audio analysis, with a
particular emphasis on the classification of musical genres via sophisticated machine
learning methods. The study broadens its scope to investigate the possibility of image
based features outperforming standard audio-extracted features by examining the
efficiency of a Multi-Layer Perceptron (MLP) in contrast to conventional methods. The
scope is further expanded with the use of Flask web technology, which allows for real-
time genre recognition of music and has useful applications in audio streaming platforms,
voice assistants, and music identification apps.
SYSTEM ARCHITECTURE
EXISTING SYSTEM:

• Recognizing the increasing significance of music in people's lives, the current Music Genre Classification Model attempts to classify
audio music into genres. But with the development of technology and the accessibility of the internet, a more precise and efficient
categorization model is desperately needed.
• The current method uses spectrograms and auditory characteristics to classify genres using machine learning approaches, namely
Support Vector Machine (SVM) and Random Forest Classifier.

PROPOSED SYSTEM

• Our suggested approach redefines the classification of musical genres by utilizing machine learning and artificial intelligence (AI). Our methodology,
which focuses on the Free Music Archive (FMA) dataset, is based on using a state-of-the-art Multi-Layer Perceptron (MLP) to get better validation
results and accuracy.
• This system surpasses conventional techniques by substituting MLP for the standard Support Vector Classifier (SVC), showcasing its ability to
transform the audio analysis field. The use of Flask web enhances the application's usability and accessibility by enabling real-time recognition
of music genres.
EXISTING TECHNIQUE: -
 Random Forest Classifier & Support Vector Machine (SVM):
 For the purpose of classifying music genres, the current method uses Random Forest
Classifier, works based on decision trees. Random forest creates uncorrelated forest
of trees whose prediction is more accurate than that of a single tree and Support
Vector Machine (SVM), creates a hyper plane that distinguish between two different
classes. The algorithm improves the complexity of the classifier by performing
structural risk minimization to achieve good generalization performance.
 These machine learning techniques classify music into genres by examining
spectrograms and acoustic characteristics. This method has helped create functioning
model, but as technology advances and more sophisticated methods are used.
Although the existing method is a good starting point, a more advanced and
comprehensive approach to music genre classification is necessary given how quickly
technology is advancing.
PROPOSED TECHNIQUE USED OR ALGORITHM USED:

 MULTI-LAYER PERCEPTRON (MLP):

 Our suggested method’s fundamental component is the categorization of musical genres using a Multi-Layer Perceptron
(MLP). This advanced neural network outperforms conventional techniques like logistic regression and support vector
classifier (SVC) in its ability to learn intricate patterns. Our MLP-based method, which was trained on the FMA dataset’s
complexities, achieves an astounding 99.55% accuracy and a validation accuracy of 92.21%.
 The choice to give characteristics precedence over traditional extracted features highlights how well MLP captures subtle
musical subtleties. The Flask-powered web-based detection capability makes the system more useful by allowing real-time
genre recognition for a better user experience.
MODULES

o Data Collection
o Data Preparation
o Model Selection
o Model Creation for MLP
o Model Evaluation and Saving
1) Data Collection:

The endeavour starts with gathering the necessary information to categories music genres. The
first audio file in the dataset is played, and its raw data and sample rate are shown using the
librosa library. The audio files in the dataset are located using the glob function. Furthermore,
facts on the dataset are loaded from a CSV file called "datasetinfo.csv," which
contains information on several characteristics that were taken out of the audio recordings.

2) Data Preparation:

To guarantee that the gathered dataset is suitable for model training, careful preparation is
done. This entails taking care of problems including mistakes, missing data, and duplication. To
find correlations between variables, exploratory data analysis approaches are used in
conjunction with data type conversions and normalization. Next, the dataset is judiciously
divided into training kand assessment sets.
3) Model Selection:

A Multi-Layer Perceptron is the algorithm of choice for classifying musical genres (MLP). The
tensorflow/ Keras library is used to define the model architecture, which consists of many dense
layers with dropout regularization. Sparse categorical cross entropy loss and the Adam optimizer
are used to assemble the model. The choice of MLP as an efficient method for identifying
patterns in the music data is highlighted in this section.

4) Model Creation for MLP:

The Multi-Layer Perceptron (MLP) model is built using the sequential model from the Keras
package. Multiple deep layers make up the architecture, which is intended to capture complex
patterns in the collection of musical genres. Five thick layers with different numbers of units and
activation functions make up this MLP model. In the first layer, input data of shape
(x_train.shape[1],) is fed into a ReLU activation function with 512 units. After every dense layer,
dropout layers (with a dropout rate of 0.2) are inserted deliberately to improve the model's
capacity to generalize and reduce over fitting. Finding the genre with the highest probability is
done by analyzing the probabilities for each class of music genres in the final dense layer, which
has 10 units and a softmax activation function. To make training easier, the sparse categorical
cross entropy loss and Adam optimizer are included in the model compilation.
5) Model Evaluation and Saving:

Analyzing the probabilities for each class of musical genres in the last dense layer—which has
10 units and a softmax activation function will help you determine which genre has the highest
likelihood. In order to facilitate training, the model compilation includes the Adam optimizer
and sparse categorical cross entropy loss.
SYSTEM REQUIREMENTS

HARDWARE REQUIREMENTS
The hardware requirements may serve as the basis for a contract for the implementation of the system
and should therefore be a complete and consistent specification of the whole system. They are used by software
engineers as the starting point for the system design. It should what the system do and not how it should be
implemented.
•
PROCESSOR : DUAL CORE 2 DUOS.
•RAM : 4GB DD RAM
•HARD DISK : 250 GB
SOFTWARE REQUIREMENTS

The software requirements document is the specification of the system. It should include both a definition
and a specification of requirements. It is a set of what the system should do rather than how it should do it.
The software requirements provide a basis for creating the software requirements specification. It is useful
in estimating cost, planning team activities, performing tasks and tracking the teams and tracking the
team’s progress throughout the development activity.

•Operating System : Windows 7/8/10

•Platform : Vs Code/ Spyder3
•Programming Language : Python
•Front End : HTML, CSS

Music Genre Classification Slides
No ratings yet
Music Genre Classification Slides
15 pages
Biomedical Signal Processing Course
No ratings yet
Biomedical Signal Processing Course
2 pages
Course Description - Biosignal Processing BME401 Fall-2022
No ratings yet
Course Description - Biosignal Processing BME401 Fall-2022
4 pages
FPGA-Based Neural Network for Arrhythmia Detection
No ratings yet
FPGA-Based Neural Network for Arrhythmia Detection
16 pages
Bioelectric Measurement Techniques
100% (1)
Bioelectric Measurement Techniques
13 pages
Elective-II Soft Computing
No ratings yet
Elective-II Soft Computing
3 pages
Solutions for Duda's Pattern Classification
No ratings yet
Solutions for Duda's Pattern Classification
77 pages
IIR Filters
100% (1)
IIR Filters
36 pages
CCS369 - TSS-Unit 5
No ratings yet
CCS369 - TSS-Unit 5
23 pages
Musical Signal Processing With Labview All Modules 3.9 PDF
No ratings yet
Musical Signal Processing With Labview All Modules 3.9 PDF
167 pages
Detailed Lesson Plan-Dsp
No ratings yet
Detailed Lesson Plan-Dsp
6 pages
Pattern Recognition Unit 1,2
No ratings yet
Pattern Recognition Unit 1,2
82 pages
FFT and DFT in Digital Signal Processing
No ratings yet
FFT and DFT in Digital Signal Processing
25 pages
Fundamental Steps of Digital Image Processing
No ratings yet
Fundamental Steps of Digital Image Processing
30 pages
ECG Signal Detection with MATLAB
100% (1)
ECG Signal Detection with MATLAB
10 pages
2.human Emotion Based Music Player Using OpenCV and Deep Learning Using Raspberry Pi
No ratings yet
2.human Emotion Based Music Player Using OpenCV and Deep Learning Using Raspberry Pi
2 pages
Computer Organization and Architecture: UNIT-2
No ratings yet
Computer Organization and Architecture: UNIT-2
29 pages
Tractable vs Intractable Problems in VLSI
No ratings yet
Tractable vs Intractable Problems in VLSI
6 pages
2 Binning Techniques in Data Mining With Examples
No ratings yet
2 Binning Techniques in Data Mining With Examples
10 pages
Multimedia Compression Question Bank
100% (4)
Multimedia Compression Question Bank
16 pages
Xc4000 Fpga Mod 4
No ratings yet
Xc4000 Fpga Mod 4
9 pages
Electrode Theory
No ratings yet
Electrode Theory
93 pages
Facets of Data Important
No ratings yet
Facets of Data Important
4 pages
Power Spectrum Estimation in MATLAB
No ratings yet
Power Spectrum Estimation in MATLAB
3 pages
Difference Between Mealy Machine and Moore Machine Gate Notes 46
No ratings yet
Difference Between Mealy Machine and Moore Machine Gate Notes 46
2 pages
Understanding Version Spaces in ML
No ratings yet
Understanding Version Spaces in ML
26 pages
Mi Unit 1
100% (1)
Mi Unit 1
43 pages
FFT Algorithms and DFT Basics
No ratings yet
FFT Algorithms and DFT Basics
37 pages
Designing A Learning System
100% (1)
Designing A Learning System
3 pages
4-Data Cleaning, Data Integration, Data Transformation, Data Reduction-03-02-2024
No ratings yet
4-Data Cleaning, Data Integration, Data Transformation, Data Reduction-03-02-2024
22 pages
1 Elements, Variables and Data Categorization
No ratings yet
1 Elements, Variables and Data Categorization
27 pages
ME 306 Class Notes - 31
No ratings yet
ME 306 Class Notes - 31
10 pages
MATLAB Array Basics and Lab Assignment
No ratings yet
MATLAB Array Basics and Lab Assignment
5 pages
Speech Processing Question Paper
No ratings yet
Speech Processing Question Paper
6 pages
Architecture: Simple Neural Nets For Pattern Classification
No ratings yet
Architecture: Simple Neural Nets For Pattern Classification
15 pages
Image Filtering Techniques Overview
0% (1)
Image Filtering Techniques Overview
56 pages
Fundamentals of Speech Recognitiony - Lawrence Rabiner - Biing-Hwang Juang PDF
No ratings yet
Fundamentals of Speech Recognitiony - Lawrence Rabiner - Biing-Hwang Juang PDF
546 pages
Computer Vision and Image Processing - Fundamentals and Applications
No ratings yet
Computer Vision and Image Processing - Fundamentals and Applications
46 pages
Finite Word Length Effects
No ratings yet
Finite Word Length Effects
31 pages
Advanced Machine Learning Course Overview
No ratings yet
Advanced Machine Learning Course Overview
7 pages
Smart Automation Audio Classification System
No ratings yet
Smart Automation Audio Classification System
6 pages
Digital System Design Using Verilog
No ratings yet
Digital System Design Using Verilog
2 pages
MFCC Computation for Speech Recognition
100% (2)
MFCC Computation for Speech Recognition
6 pages
Digital Signal Processing Prof. S. C. Dutta Roy Department of Electrical Engineering Indian Institute of Technology, Delhi Lecture - 38 FIR Design
No ratings yet
Digital Signal Processing Prof. S. C. Dutta Roy Department of Electrical Engineering Indian Institute of Technology, Delhi Lecture - 38 FIR Design
18 pages
Biomedical Engineering Exam Questions
No ratings yet
Biomedical Engineering Exam Questions
2 pages
BM3411 Set3
No ratings yet
BM3411 Set3
2 pages
Data Mining Assignment Analysis
No ratings yet
Data Mining Assignment Analysis
10 pages
Pattern Recognition and Classification Overview
No ratings yet
Pattern Recognition and Classification Overview
20 pages
AI Problem Solving Methods Overview
No ratings yet
AI Problem Solving Methods Overview
63 pages
Laboratory Manual: Faculty of Engineering and Technology Bachelor of Technology
No ratings yet
Laboratory Manual: Faculty of Engineering and Technology Bachelor of Technology
50 pages
DSP Applications in Telecommunication & Biomedicine
No ratings yet
DSP Applications in Telecommunication & Biomedicine
11 pages
Diagnostic Equipment Question Bank
No ratings yet
Diagnostic Equipment Question Bank
4 pages
Aai Report
No ratings yet
Aai Report
17 pages
Irjet Music Information Retrieval and Ge
No ratings yet
Irjet Music Information Retrieval and Ge
8 pages
Music Genre Classification with ML
No ratings yet
Music Genre Classification with ML
5 pages
Music Genre Classification For Indian Mu
No ratings yet
Music Genre Classification For Indian Mu
9 pages
Music Genre Classification with ML
No ratings yet
Music Genre Classification with ML
40 pages
Music Genre Classification with ML
No ratings yet
Music Genre Classification with ML
13 pages
Music Genre Classification with KNN
No ratings yet
Music Genre Classification with KNN
11 pages
A Comparative Study of Algorithmic Approaches For Automated Music Genre Classification
No ratings yet
A Comparative Study of Algorithmic Approaches For Automated Music Genre Classification
6 pages
GPT Assistant Training Insights
No ratings yet
GPT Assistant Training Insights
50 pages
General Guidelines For Individual Capstone Projects
No ratings yet
General Guidelines For Individual Capstone Projects
4 pages
Enhanced Heart Disease Diagnosis Model
No ratings yet
Enhanced Heart Disease Diagnosis Model
157 pages
IoT-Based Gas Leakage Detection System
No ratings yet
IoT-Based Gas Leakage Detection System
7 pages
Multi-Epoch Learning With Data Augmentation For Deep Click-Through Rate Prediction
No ratings yet
Multi-Epoch Learning With Data Augmentation For Deep Click-Through Rate Prediction
10 pages
2025-Query Rephrasing For Context Independence in Scene Knowledge-Guided Visual Grounding
No ratings yet
2025-Query Rephrasing For Context Independence in Scene Knowledge-Guided Visual Grounding
10 pages
Modeling Translationese in NMT Systems
No ratings yet
Modeling Translationese in NMT Systems
10 pages
Early CKD Detection with Machine Learning
No ratings yet
Early CKD Detection with Machine Learning
6 pages
DWM Lab Manual Updated Copy SH-25docx (1) - 250713 - 190808
No ratings yet
DWM Lab Manual Updated Copy SH-25docx (1) - 250713 - 190808
46 pages
A Global Inventory of Photovoltaic Solar Energy Generating Units
No ratings yet
A Global Inventory of Photovoltaic Solar Energy Generating Units
8 pages
Report Final
No ratings yet
Report Final
68 pages
1.1 Project Scope: CMRTC 1
No ratings yet
1.1 Project Scope: CMRTC 1
25 pages
Major Project Synopsis
No ratings yet
Major Project Synopsis
14 pages
ML MCQs Set
No ratings yet
ML MCQs Set
18 pages
Leveraging Large Language Models For Optimized Item Categorization Using UNSPSC Taxonomy
No ratings yet
Leveraging Large Language Models For Optimized Item Categorization Using UNSPSC Taxonomy
10 pages
Sign Language Detection Project Report
No ratings yet
Sign Language Detection Project Report
22 pages
Multi-Docker-Eval: A Shovel of The Gold Rush' Benchmark On Automatic Environment Building For Software Engineering
No ratings yet
Multi-Docker-Eval: A Shovel of The Gold Rush' Benchmark On Automatic Environment Building For Software Engineering
15 pages
Introduction to Machine Learning Course
No ratings yet
Introduction to Machine Learning Course
45 pages
PREDICTION AND ANALYSIS OF SOIL MACRONUTRIENTS USING MACHINE LEARNING TECHNIQUES Ijariie23123
No ratings yet
PREDICTION AND ANALYSIS OF SOIL MACRONUTRIENTS USING MACHINE LEARNING TECHNIQUES Ijariie23123
8 pages
IJIKMv 20 Art 004 Nuryani 11039
No ratings yet
IJIKMv 20 Art 004 Nuryani 11039
20 pages
Ensemble ML for Fake News Detection
No ratings yet
Ensemble ML for Fake News Detection
12 pages
Final DWDM
No ratings yet
Final DWDM
27 pages
Class 12 AI Unit-1 MCQ
No ratings yet
Class 12 AI Unit-1 MCQ
9 pages
Crack The Coding Interview
No ratings yet
Crack The Coding Interview
19 pages
Leveraging Infrared Spectroscopy For
No ratings yet
Leveraging Infrared Spectroscopy For
13 pages
Unit 3
No ratings yet
Unit 3
98 pages
Pump Cavitation Detection via ML
No ratings yet
Pump Cavitation Detection via ML
7 pages
001
No ratings yet
001
100 pages
ML Unit-1
No ratings yet
ML Unit-1
64 pages
1 s2.0 S0926580522002618 Main
No ratings yet
1 s2.0 S0926580522002618 Main
17 pages