SlideShare a Scribd company logo
1 of 4
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2823
Music Genre Recognition Using Convolution Neural Network
Krishna Mohana A J1, Pramod Kumar P M1, Harivinod N1, Nagaraj K1
1Assistant Professor, Dept. of Computer Science & Engineering,
Vivekananda College of Engineering and Technology Puttur, Karnataka, India
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract – Music is an ever changing or evolving field. We
see a lot of songs, instrumentals, symphonies and other music
forms releasing every day. All this music is of different genre
moods and themes. Music streaming platformsneedtoclassify
these songs to improve User Experience (UX). We can use
various features (like vocals, instruments) of a song to classify
them into different forms. However, identifying these features
is a challenging task. In order to identify the features of a song
that can appropriately classify them into their respective
genre with reduced human interaction, we go for deep
learning techniques. The proposed project is based on these
techniques. Our project mainly focuses on the basis of
implementing music genre recognition using Convolution
Neural Network (CNN/ConvNet) model which trained using
labeled Mel Spectrogram is obtained from audio dataset.
Key Words: Genre recognition, ConvNet, Labeled Mel
Spectrogram, Deep learning.
1. INTRODUCTION
Music is one of the most fascinating and importantinvention
of the human era. Music, in a generic fashion, can be defined
as a rhythmic combinationof vocalsand/orsoundgenerated
by instruments. Right from the beginning, music hasbeen an
important part of people’s life. It is a doorway to the hearts
of people. Theoretically, music is a form of data which has
evolved to comprise a sophisticated pattern. The type of
music we listen is dependent on the mood we’re in i.e., the
type of music and the mood of the listener are closely
related. For emotions, we call it mood, however for music,
we call it genre. We have different forms of music across the
globe with drastically varying pattern, composed using
variety of musical instruments that may be percussions or
string instruments and unique singing patterns. There are
about 250 recognizable genres in the world of music. Not
every genre is totally different from one another. There are
genres that belong to a same group, however, within that
group, it can be differentiated based of several features. For
example, Rock genre is a broad group which can be further
differentiated into hard rock, progressive rock, soft rock,
punk rock, pop rock, folk rock and so on. Along with that,
there are thousands of songs which are created by
incorporating the musical melody of different genres into
one combined form. Classifying the songs into its respective
genres can be done in various ways. One method is to have
experts of each genre to identify. But this is a non-feasible
method. However, the technological advancement
particularly in the field of Computer Sciencehaveenabledus
to train a machine to work like human (basic ideology of
Machine Learning/Deep Learning) to do different tasks like
high scale computation, classification etc. In the light of that,
in our project, we propose a model that identifies the genre
of the music.
It is a challenging task given that the data we need to classify
is audio, which is a form of analog data. The direct digital
format of the audio file cannot be used to classify the audio.
Therefore, we need to represent the audio clips using more
suitable format. In our project, we use Mel Frequency
Spectrogram representation of the audio input. Mel
Spectrogram mathematically is a short-term power
spectrum of sound, based on a linear cosine transform of a
log power spectrum on a non-linear mel scale of frequency.
Mel Spectrogram has been standardized by the European
Telecommunication Standards Institute (ETSI).
Each genre of music has a recognizable pattern which in mel
spectrogram representation can be accurately identified by
the model. The intensity level of the pixels in mel
spectrogram representation is the deciding factor in
identifying the genre of the input audio file.
2. LITERATURE
Michael Haggblade, Yang Hong and Kenny Kao have
discussed the non-trivial problem of music genre
classification [1]. They compared the performance of 5
different classification techniques based on accuracy,
handling unstructured data and other parameters. It was
observed that among the 5 techniques namely Kullback-
Leiber (KL) divergence,k-NearestNeighbor(k-NN),k-Means,
Directed Acyclic Graph Support Vector Machine (DAG SVM)
and Artificial Neural Network (ANN), ANN was the best
technique. Even though k-NN technique had an accuracy of
96%, the accuracy dropped when the number of genres tobe
classified increased and there were difficulties in handling
the unstructured data format of the music.
Audio clip is an analog data form. In order to perform
classification of audio files, theaudiofilemustberepresented
in a digital format. Haggblade used Mel Frequency Cepstral
Coefficients (MFCC) to represent audio file.
Sharaj Panwar, Arun Das, Mehdi Roopaei,PaulRaddiscussed
the deep learning approach for mapping music genre [2].
Deep feature learning methods have been aggressively
applied in the field of music tagging retrieval. Genre
categorization, mood classification, and chord detection are
the most common tags from local spectral to temporal
structure. Convolutional Neural networks (CNNs) using
kernels extract the local features thatareindifferentlevelsof
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2824
hierarchy. CRNN architectures as a powerful music tagging.
The AUC-ROC index for the proposed architecture is 0.893
which shows itssuperiorityratherthantraditionalstructures
on the same database.
Narek Abroyan suggested Convolution Neural Network is
viable option forreal timeclassification problems[3].During
the last several years, artificial intelligence researchers and
specialists achieved notable results in visual, voice and
natural language processing tasks byusingnewmethodsand
approaches of deep learning, such as convolutional neural
networks. However, not much research is going on of usage
such networks for elaboration of real-time data. Abroyan
suggested that ConvNets can be trained and used for real-
time data classification, opening doors to a wide range of
areas where it can find applications.
Convolution Neural Network has been suggested as a viable
technique whenever the audio input comes into picture. Che
Wei Huang et al. used the ConvNet along with recurrent
neural network on speech audio clips to identify the emotion
of the speaker [4]. The method of discriminative feature
learning employed by convolution layers uses the mel
frequency spectrogram representation of audio clips and
identifies the emotion of the speaker.
Wootaek Lim et al. also proposed that ConvNet is the best
technique for speech emotion recognition [5]. He suggested
that even though at simpler form, traditional low-level
descriptors have high accuracy, these descriptors to be
trained with complex unstructured data and extracting good
features is a difficult task to accomplish.
3. METHODOLOGY
In order to identify the genre of a song, first we need to build
a ConvNet and train the same with labeled audiodataset. We
employ supervised learning method for the training of the
ConvNet. In our project, we considered10well known music
genres. For each genre, we consider about 100 audio files
belonging to that genre for the training. Fig 1 shows the first
phase of the project: building dataset. There are many audio
datasets available in the Internet like GTZAN, FMA etc.,
however we chose to build our own dataset.
Fig -1: Build Dataset
The built dataset is then used to train the ConvNet model
required for future genre recognition which is shown in Fig
2. We developed our project using Python machine learning
libraries. Python provides the necessary tools for Mel
spectrogram generation, building of ConvNet.
Fig -2: Train ConvNet Model
Fig 3 represents the trained model deployed for genre
recognition. The trained model uses its training experience
on the Mel spectrogram of test audio file and analyses it.
Convolution layers of the ConvNet map the test audio file to
the corresponding genre of the music.
Fig -3: Classify Music
4. ALGORITHM
In our project, we have used Convolution Neural Network
(CNN/ConvNet) model for the genre classification.
Convolution Neural Network is a deep learning algorithm
which is mainly used to differentiate images based on their
spatial features. The preprocessing required in a ConvNet is
much lower as compared to other classification algorithms.
ConvNet is able to successfully capture the spatial and
temporal characteristics in an image. The basic architecture
of a ConvNet consists of Inputlayer,Convolutionandpooling
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2825
layers, Fully Connected Activation layer. Convolution layer
comprises of various filters or kernels that aid in extracting
the major features of an image. The main objective is to
extract the high-level features such as edges and intensity,
from an input image. There can be numerous convolution
layers where the first filter captures low level features like
edges, color, gradient,orientationetc.,andsubsequentlayers
capture higher level features.
Pooling layer is responsible for reducing the spatial size of
the convolved feature in order to decrease the amount of
computational power required to process the data through
dimensionality reduction.Furthermore,itisusein extracting
dominant features of the input. There are two types of
pooling namely Max Pooling and Average pooling.
Conventionally Max pooling is preferred over Average
pooling because Max pooling essentially performs Noise
Suppression, which is a positive feature.
Fully Connected Activation layer is usually a cheap way of
learning non-linear combinations of the high-level features
as represented by the output of the convolution layers. Over
a series of epochs, the model is able to distinguish between
dominating and certain low-level features in the image and
classify them using the Softmax Classification technique.
Softmax function or Normalized exponential function is a
function that takes as input a vector of K real numbers, and
normalizes it into a probability distribution consisting of K
probabilities.
Fig -4: Convolution Neural Network Architecture
5. EXPERIMENTAL RESULT
Mel spectrogram is generated at both training phase as well
as the testing phase. In training phase, the Mel spectrogram
gives an insight to the model about the level of intensity
corresponding to that genre of the music. In testing phase,
Mel spectrogram of the test input enables the model to
classify it into corresponding genre.Notethateachgenre has
a specific level of intensity output in the spectrogram. This
feature is exploited to classify the music.
Fig 5 display the output of the model as observed in the web
interface built. The inputgivenwasthewell-knownpopsong
by Adele, “Someone Like You”. The corresponding output
shows Pop (green) as the major genre along with hip hop
and reggae as other genres.
Fig 6 shows the output of the model for a famous reggae
song by Bob Marley, “The Redemption Song”. The major
output shown is reggae with a minor mix of pop, hiphopand
blues.
Fig 7 provides the Mel spectrogram output for the pop song
previously mentioned
Fig -5: Screenshot of the output obtained for a pop song
Fig -6: Screenshot of the output obtained for a reggae song
Fig -7: Screenshot of the Mel spectrogram for pop song
6. CONCLUSIONS
Genre Recognition is interesting feature in multimedia
applications like music player, streaming applications etc.
The project developed is completely software oriented and
can be run in any PC with optimum hardware. We have
implemented a Convolution Neural Network which was
trained with spectrogram of audio files and the trained
model was deployed in a web interface. Despite there being
numerous classification algorithms and techniques, the
Convolution Neural Network has a better performance over
other mechanism.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2826
REFERENCES
[1] R. Islam, Mingxing Xu and Yuchao Fan, “Chinese
Traditional Opera database for Music Genre
Recognition”, 2015 International Conference Oriental
COCOSDA held jointly with 2015 Conference on Asian
Spoken Language Research and Evaluation (O-
COCOSDA/CASLRE), 2015, pp. 38-41, doi:
10.1109/ICSDA.2015.7357861.
[2] Sharaj Panwar, Arun Das, Mehdi Roopaei, Paul Rad, “A
deep learningapproachfor mappingmusic genres”,12th
System of Systems Engineering Conference (SoSE),
2017, pp. 1-5, doi: 10.1109/SYSOSE.2017.7994970.
[3] Narek Abroyan, “Convolutional and recurrent neural
networks for real-time data classification”, 2017
Seventh International Conference on Innovative
Computing Technology (INTECH), 2017, pp. 42-45, doi:
10.1109/INTECH.2017.8102422.
[4] Che-Wei Huang, Shrikanth Shri Narayanan, “Deep
convolutional recurrent neural network with attention
mechanism for robust speech emotionrecognition”,july
2017 IEEE International Conference on Multimedia and
Expo (ICME), 2017, pp. 538-588, doi:
10.1109/ICME.2017.8019296.
[5] Wootaek Lim, Daeyoung Jang and Taejin Lee, “Speech
Emotion RecognitionusingConvolutional andRecurrent
Neural Networks”, In 2016 Annual Conference of the
International Speech CommunicationAssociation,2016,
pp. 1-4.

More Related Content

What's hot

IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...IRJET Journal
 
Music genre prediction
Music genre predictionMusic genre prediction
Music genre predictionAnusha Chavva
 
Culture-aware Music Recommendation
Culture-aware Music  RecommendationCulture-aware Music  Recommendation
Culture-aware Music RecommendationDaniel Wolff
 
IRJET- Implementing Musical Instrument Recognition using CNN and SVM
IRJET- Implementing Musical Instrument Recognition using CNN and SVMIRJET- Implementing Musical Instrument Recognition using CNN and SVM
IRJET- Implementing Musical Instrument Recognition using CNN and SVMIRJET Journal
 
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSEFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSijnlc
 
Teaching presentation
Teaching presentationTeaching presentation
Teaching presentationChoo Yang
 
Malayalam Isolated Digit Recognition using HMM and PLP cepstral coefficient
Malayalam Isolated Digit Recognition using HMM and PLP cepstral coefficientMalayalam Isolated Digit Recognition using HMM and PLP cepstral coefficient
Malayalam Isolated Digit Recognition using HMM and PLP cepstral coefficientijait
 
Multidimensional Patterns of Disturbance in Digital Social Networks
Multidimensional Patterns of Disturbance in Digital Social NetworksMultidimensional Patterns of Disturbance in Digital Social Networks
Multidimensional Patterns of Disturbance in Digital Social NetworksDimitar Denev
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement techniqueeSAT Publishing House
 
Deep Learning Meetup #5
Deep Learning Meetup #5Deep Learning Meetup #5
Deep Learning Meetup #5Aloïs Gruson
 
Identification of frequency domain using quantum based optimization neural ne...
Identification of frequency domain using quantum based optimization neural ne...Identification of frequency domain using quantum based optimization neural ne...
Identification of frequency domain using quantum based optimization neural ne...eSAT Publishing House
 
RecSys 2008: Social Ranking
RecSys 2008: Social RankingRecSys 2008: Social Ranking
RecSys 2008: Social RankingUCL-CS MobiSys
 

What's hot (14)

IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
 
1801 1805
1801 18051801 1805
1801 1805
 
Music genre prediction
Music genre predictionMusic genre prediction
Music genre prediction
 
Culture-aware Music Recommendation
Culture-aware Music  RecommendationCulture-aware Music  Recommendation
Culture-aware Music Recommendation
 
IRJET- Implementing Musical Instrument Recognition using CNN and SVM
IRJET- Implementing Musical Instrument Recognition using CNN and SVMIRJET- Implementing Musical Instrument Recognition using CNN and SVM
IRJET- Implementing Musical Instrument Recognition using CNN and SVM
 
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSEFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
 
PHD_Sam_Karimian_Azari_E
PHD_Sam_Karimian_Azari_EPHD_Sam_Karimian_Azari_E
PHD_Sam_Karimian_Azari_E
 
Teaching presentation
Teaching presentationTeaching presentation
Teaching presentation
 
Malayalam Isolated Digit Recognition using HMM and PLP cepstral coefficient
Malayalam Isolated Digit Recognition using HMM and PLP cepstral coefficientMalayalam Isolated Digit Recognition using HMM and PLP cepstral coefficient
Malayalam Isolated Digit Recognition using HMM and PLP cepstral coefficient
 
Multidimensional Patterns of Disturbance in Digital Social Networks
Multidimensional Patterns of Disturbance in Digital Social NetworksMultidimensional Patterns of Disturbance in Digital Social Networks
Multidimensional Patterns of Disturbance in Digital Social Networks
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement technique
 
Deep Learning Meetup #5
Deep Learning Meetup #5Deep Learning Meetup #5
Deep Learning Meetup #5
 
Identification of frequency domain using quantum based optimization neural ne...
Identification of frequency domain using quantum based optimization neural ne...Identification of frequency domain using quantum based optimization neural ne...
Identification of frequency domain using quantum based optimization neural ne...
 
RecSys 2008: Social Ranking
RecSys 2008: Social RankingRecSys 2008: Social Ranking
RecSys 2008: Social Ranking
 

Similar to IRJET- Music Genre Recognition using Convolution Neural Network

Automatic Music Generation Using Deep Learning
Automatic Music Generation Using Deep LearningAutomatic Music Generation Using Deep Learning
Automatic Music Generation Using Deep LearningIRJET Journal
 
Extraction and Conversion of Vocals
Extraction and Conversion of VocalsExtraction and Conversion of Vocals
Extraction and Conversion of VocalsIRJET Journal
 
IRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound RecognitionIRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound RecognitionIRJET Journal
 
Recognition of music genres using deep learning.
Recognition of music genres using deep learning.Recognition of music genres using deep learning.
Recognition of music genres using deep learning.IRJET Journal
 
Nithin Xavier research_proposal
Nithin Xavier research_proposalNithin Xavier research_proposal
Nithin Xavier research_proposalNithin Xavier
 
CONTENT BASED AUDIO CLASSIFIER & FEATURE EXTRACTION USING ANN TECNIQUES
CONTENT BASED AUDIO CLASSIFIER & FEATURE EXTRACTION USING ANN TECNIQUESCONTENT BASED AUDIO CLASSIFIER & FEATURE EXTRACTION USING ANN TECNIQUES
CONTENT BASED AUDIO CLASSIFIER & FEATURE EXTRACTION USING ANN TECNIQUESAM Publications
 
Anvita Eusipco 2004
Anvita Eusipco 2004Anvita Eusipco 2004
Anvita Eusipco 2004guest6e7a1b1
 
IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...
IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...
IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...IRJET Journal
 
AI THROUGH THE EYES OF ORGANISE SOUND
AI THROUGH THE EYES OF ORGANISE SOUNDAI THROUGH THE EYES OF ORGANISE SOUND
AI THROUGH THE EYES OF ORGANISE SOUNDJaideep Ghosh
 
Audio Classification using Artificial Neural Network with Denoising Algorithm...
Audio Classification using Artificial Neural Network with Denoising Algorithm...Audio Classification using Artificial Neural Network with Denoising Algorithm...
Audio Classification using Artificial Neural Network with Denoising Algorithm...IRJET Journal
 
Application of Recurrent Neural Networks paired with LSTM - Music Generation
Application of Recurrent Neural Networks paired with LSTM - Music GenerationApplication of Recurrent Neural Networks paired with LSTM - Music Generation
Application of Recurrent Neural Networks paired with LSTM - Music GenerationIRJET Journal
 
IRJET- Musical Instrument Recognition using CNN and SVM
IRJET-  	  Musical Instrument Recognition using CNN and SVMIRJET-  	  Musical Instrument Recognition using CNN and SVM
IRJET- Musical Instrument Recognition using CNN and SVMIRJET Journal
 
Literature Survey for Music Genre Classification Using Neural Network
Literature Survey for Music Genre Classification Using Neural NetworkLiterature Survey for Music Genre Classification Using Neural Network
Literature Survey for Music Genre Classification Using Neural NetworkIRJET Journal
 
IRJET - Audio Emotion Analysis
IRJET - Audio Emotion AnalysisIRJET - Audio Emotion Analysis
IRJET - Audio Emotion AnalysisIRJET Journal
 
Ijarcet vol-2-issue-4-1347-1351
Ijarcet vol-2-issue-4-1347-1351Ijarcet vol-2-issue-4-1347-1351
Ijarcet vol-2-issue-4-1347-1351Editor IJARCET
 
2012 a rebeloijmir
2012 a rebeloijmir2012 a rebeloijmir
2012 a rebeloijmirMiguel Ponce
 
Applsci 08-00606-v3
Applsci 08-00606-v3Applsci 08-00606-v3
Applsci 08-00606-v3IsraelEbonko
 

Similar to IRJET- Music Genre Recognition using Convolution Neural Network (20)

Automatic Music Generation Using Deep Learning
Automatic Music Generation Using Deep LearningAutomatic Music Generation Using Deep Learning
Automatic Music Generation Using Deep Learning
 
Extraction and Conversion of Vocals
Extraction and Conversion of VocalsExtraction and Conversion of Vocals
Extraction and Conversion of Vocals
 
IRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound RecognitionIRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound Recognition
 
T26123129
T26123129T26123129
T26123129
 
Recognition of music genres using deep learning.
Recognition of music genres using deep learning.Recognition of music genres using deep learning.
Recognition of music genres using deep learning.
 
Nithin Xavier research_proposal
Nithin Xavier research_proposalNithin Xavier research_proposal
Nithin Xavier research_proposal
 
CONTENT BASED AUDIO CLASSIFIER & FEATURE EXTRACTION USING ANN TECNIQUES
CONTENT BASED AUDIO CLASSIFIER & FEATURE EXTRACTION USING ANN TECNIQUESCONTENT BASED AUDIO CLASSIFIER & FEATURE EXTRACTION USING ANN TECNIQUES
CONTENT BASED AUDIO CLASSIFIER & FEATURE EXTRACTION USING ANN TECNIQUES
 
Anvita Eusipco 2004
Anvita Eusipco 2004Anvita Eusipco 2004
Anvita Eusipco 2004
 
Anvita Eusipco 2004
Anvita Eusipco 2004Anvita Eusipco 2004
Anvita Eusipco 2004
 
IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...
IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...
IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...
 
AI THROUGH THE EYES OF ORGANISE SOUND
AI THROUGH THE EYES OF ORGANISE SOUNDAI THROUGH THE EYES OF ORGANISE SOUND
AI THROUGH THE EYES OF ORGANISE SOUND
 
Audio Classification using Artificial Neural Network with Denoising Algorithm...
Audio Classification using Artificial Neural Network with Denoising Algorithm...Audio Classification using Artificial Neural Network with Denoising Algorithm...
Audio Classification using Artificial Neural Network with Denoising Algorithm...
 
Application of Recurrent Neural Networks paired with LSTM - Music Generation
Application of Recurrent Neural Networks paired with LSTM - Music GenerationApplication of Recurrent Neural Networks paired with LSTM - Music Generation
Application of Recurrent Neural Networks paired with LSTM - Music Generation
 
IRJET- Musical Instrument Recognition using CNN and SVM
IRJET-  	  Musical Instrument Recognition using CNN and SVMIRJET-  	  Musical Instrument Recognition using CNN and SVM
IRJET- Musical Instrument Recognition using CNN and SVM
 
1801 1805
1801 18051801 1805
1801 1805
 
Literature Survey for Music Genre Classification Using Neural Network
Literature Survey for Music Genre Classification Using Neural NetworkLiterature Survey for Music Genre Classification Using Neural Network
Literature Survey for Music Genre Classification Using Neural Network
 
IRJET - Audio Emotion Analysis
IRJET - Audio Emotion AnalysisIRJET - Audio Emotion Analysis
IRJET - Audio Emotion Analysis
 
Ijarcet vol-2-issue-4-1347-1351
Ijarcet vol-2-issue-4-1347-1351Ijarcet vol-2-issue-4-1347-1351
Ijarcet vol-2-issue-4-1347-1351
 
2012 a rebeloijmir
2012 a rebeloijmir2012 a rebeloijmir
2012 a rebeloijmir
 
Applsci 08-00606-v3
Applsci 08-00606-v3Applsci 08-00606-v3
Applsci 08-00606-v3
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 

Recently uploaded (20)

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 

IRJET- Music Genre Recognition using Convolution Neural Network

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2823 Music Genre Recognition Using Convolution Neural Network Krishna Mohana A J1, Pramod Kumar P M1, Harivinod N1, Nagaraj K1 1Assistant Professor, Dept. of Computer Science & Engineering, Vivekananda College of Engineering and Technology Puttur, Karnataka, India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract – Music is an ever changing or evolving field. We see a lot of songs, instrumentals, symphonies and other music forms releasing every day. All this music is of different genre moods and themes. Music streaming platformsneedtoclassify these songs to improve User Experience (UX). We can use various features (like vocals, instruments) of a song to classify them into different forms. However, identifying these features is a challenging task. In order to identify the features of a song that can appropriately classify them into their respective genre with reduced human interaction, we go for deep learning techniques. The proposed project is based on these techniques. Our project mainly focuses on the basis of implementing music genre recognition using Convolution Neural Network (CNN/ConvNet) model which trained using labeled Mel Spectrogram is obtained from audio dataset. Key Words: Genre recognition, ConvNet, Labeled Mel Spectrogram, Deep learning. 1. INTRODUCTION Music is one of the most fascinating and importantinvention of the human era. Music, in a generic fashion, can be defined as a rhythmic combinationof vocalsand/orsoundgenerated by instruments. Right from the beginning, music hasbeen an important part of people’s life. It is a doorway to the hearts of people. Theoretically, music is a form of data which has evolved to comprise a sophisticated pattern. The type of music we listen is dependent on the mood we’re in i.e., the type of music and the mood of the listener are closely related. For emotions, we call it mood, however for music, we call it genre. We have different forms of music across the globe with drastically varying pattern, composed using variety of musical instruments that may be percussions or string instruments and unique singing patterns. There are about 250 recognizable genres in the world of music. Not every genre is totally different from one another. There are genres that belong to a same group, however, within that group, it can be differentiated based of several features. For example, Rock genre is a broad group which can be further differentiated into hard rock, progressive rock, soft rock, punk rock, pop rock, folk rock and so on. Along with that, there are thousands of songs which are created by incorporating the musical melody of different genres into one combined form. Classifying the songs into its respective genres can be done in various ways. One method is to have experts of each genre to identify. But this is a non-feasible method. However, the technological advancement particularly in the field of Computer Sciencehaveenabledus to train a machine to work like human (basic ideology of Machine Learning/Deep Learning) to do different tasks like high scale computation, classification etc. In the light of that, in our project, we propose a model that identifies the genre of the music. It is a challenging task given that the data we need to classify is audio, which is a form of analog data. The direct digital format of the audio file cannot be used to classify the audio. Therefore, we need to represent the audio clips using more suitable format. In our project, we use Mel Frequency Spectrogram representation of the audio input. Mel Spectrogram mathematically is a short-term power spectrum of sound, based on a linear cosine transform of a log power spectrum on a non-linear mel scale of frequency. Mel Spectrogram has been standardized by the European Telecommunication Standards Institute (ETSI). Each genre of music has a recognizable pattern which in mel spectrogram representation can be accurately identified by the model. The intensity level of the pixels in mel spectrogram representation is the deciding factor in identifying the genre of the input audio file. 2. LITERATURE Michael Haggblade, Yang Hong and Kenny Kao have discussed the non-trivial problem of music genre classification [1]. They compared the performance of 5 different classification techniques based on accuracy, handling unstructured data and other parameters. It was observed that among the 5 techniques namely Kullback- Leiber (KL) divergence,k-NearestNeighbor(k-NN),k-Means, Directed Acyclic Graph Support Vector Machine (DAG SVM) and Artificial Neural Network (ANN), ANN was the best technique. Even though k-NN technique had an accuracy of 96%, the accuracy dropped when the number of genres tobe classified increased and there were difficulties in handling the unstructured data format of the music. Audio clip is an analog data form. In order to perform classification of audio files, theaudiofilemustberepresented in a digital format. Haggblade used Mel Frequency Cepstral Coefficients (MFCC) to represent audio file. Sharaj Panwar, Arun Das, Mehdi Roopaei,PaulRaddiscussed the deep learning approach for mapping music genre [2]. Deep feature learning methods have been aggressively applied in the field of music tagging retrieval. Genre categorization, mood classification, and chord detection are the most common tags from local spectral to temporal structure. Convolutional Neural networks (CNNs) using kernels extract the local features thatareindifferentlevelsof
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2824 hierarchy. CRNN architectures as a powerful music tagging. The AUC-ROC index for the proposed architecture is 0.893 which shows itssuperiorityratherthantraditionalstructures on the same database. Narek Abroyan suggested Convolution Neural Network is viable option forreal timeclassification problems[3].During the last several years, artificial intelligence researchers and specialists achieved notable results in visual, voice and natural language processing tasks byusingnewmethodsand approaches of deep learning, such as convolutional neural networks. However, not much research is going on of usage such networks for elaboration of real-time data. Abroyan suggested that ConvNets can be trained and used for real- time data classification, opening doors to a wide range of areas where it can find applications. Convolution Neural Network has been suggested as a viable technique whenever the audio input comes into picture. Che Wei Huang et al. used the ConvNet along with recurrent neural network on speech audio clips to identify the emotion of the speaker [4]. The method of discriminative feature learning employed by convolution layers uses the mel frequency spectrogram representation of audio clips and identifies the emotion of the speaker. Wootaek Lim et al. also proposed that ConvNet is the best technique for speech emotion recognition [5]. He suggested that even though at simpler form, traditional low-level descriptors have high accuracy, these descriptors to be trained with complex unstructured data and extracting good features is a difficult task to accomplish. 3. METHODOLOGY In order to identify the genre of a song, first we need to build a ConvNet and train the same with labeled audiodataset. We employ supervised learning method for the training of the ConvNet. In our project, we considered10well known music genres. For each genre, we consider about 100 audio files belonging to that genre for the training. Fig 1 shows the first phase of the project: building dataset. There are many audio datasets available in the Internet like GTZAN, FMA etc., however we chose to build our own dataset. Fig -1: Build Dataset The built dataset is then used to train the ConvNet model required for future genre recognition which is shown in Fig 2. We developed our project using Python machine learning libraries. Python provides the necessary tools for Mel spectrogram generation, building of ConvNet. Fig -2: Train ConvNet Model Fig 3 represents the trained model deployed for genre recognition. The trained model uses its training experience on the Mel spectrogram of test audio file and analyses it. Convolution layers of the ConvNet map the test audio file to the corresponding genre of the music. Fig -3: Classify Music 4. ALGORITHM In our project, we have used Convolution Neural Network (CNN/ConvNet) model for the genre classification. Convolution Neural Network is a deep learning algorithm which is mainly used to differentiate images based on their spatial features. The preprocessing required in a ConvNet is much lower as compared to other classification algorithms. ConvNet is able to successfully capture the spatial and temporal characteristics in an image. The basic architecture of a ConvNet consists of Inputlayer,Convolutionandpooling
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2825 layers, Fully Connected Activation layer. Convolution layer comprises of various filters or kernels that aid in extracting the major features of an image. The main objective is to extract the high-level features such as edges and intensity, from an input image. There can be numerous convolution layers where the first filter captures low level features like edges, color, gradient,orientationetc.,andsubsequentlayers capture higher level features. Pooling layer is responsible for reducing the spatial size of the convolved feature in order to decrease the amount of computational power required to process the data through dimensionality reduction.Furthermore,itisusein extracting dominant features of the input. There are two types of pooling namely Max Pooling and Average pooling. Conventionally Max pooling is preferred over Average pooling because Max pooling essentially performs Noise Suppression, which is a positive feature. Fully Connected Activation layer is usually a cheap way of learning non-linear combinations of the high-level features as represented by the output of the convolution layers. Over a series of epochs, the model is able to distinguish between dominating and certain low-level features in the image and classify them using the Softmax Classification technique. Softmax function or Normalized exponential function is a function that takes as input a vector of K real numbers, and normalizes it into a probability distribution consisting of K probabilities. Fig -4: Convolution Neural Network Architecture 5. EXPERIMENTAL RESULT Mel spectrogram is generated at both training phase as well as the testing phase. In training phase, the Mel spectrogram gives an insight to the model about the level of intensity corresponding to that genre of the music. In testing phase, Mel spectrogram of the test input enables the model to classify it into corresponding genre.Notethateachgenre has a specific level of intensity output in the spectrogram. This feature is exploited to classify the music. Fig 5 display the output of the model as observed in the web interface built. The inputgivenwasthewell-knownpopsong by Adele, “Someone Like You”. The corresponding output shows Pop (green) as the major genre along with hip hop and reggae as other genres. Fig 6 shows the output of the model for a famous reggae song by Bob Marley, “The Redemption Song”. The major output shown is reggae with a minor mix of pop, hiphopand blues. Fig 7 provides the Mel spectrogram output for the pop song previously mentioned Fig -5: Screenshot of the output obtained for a pop song Fig -6: Screenshot of the output obtained for a reggae song Fig -7: Screenshot of the Mel spectrogram for pop song 6. CONCLUSIONS Genre Recognition is interesting feature in multimedia applications like music player, streaming applications etc. The project developed is completely software oriented and can be run in any PC with optimum hardware. We have implemented a Convolution Neural Network which was trained with spectrogram of audio files and the trained model was deployed in a web interface. Despite there being numerous classification algorithms and techniques, the Convolution Neural Network has a better performance over other mechanism.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2826 REFERENCES [1] R. Islam, Mingxing Xu and Yuchao Fan, “Chinese Traditional Opera database for Music Genre Recognition”, 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O- COCOSDA/CASLRE), 2015, pp. 38-41, doi: 10.1109/ICSDA.2015.7357861. [2] Sharaj Panwar, Arun Das, Mehdi Roopaei, Paul Rad, “A deep learningapproachfor mappingmusic genres”,12th System of Systems Engineering Conference (SoSE), 2017, pp. 1-5, doi: 10.1109/SYSOSE.2017.7994970. [3] Narek Abroyan, “Convolutional and recurrent neural networks for real-time data classification”, 2017 Seventh International Conference on Innovative Computing Technology (INTECH), 2017, pp. 42-45, doi: 10.1109/INTECH.2017.8102422. [4] Che-Wei Huang, Shrikanth Shri Narayanan, “Deep convolutional recurrent neural network with attention mechanism for robust speech emotionrecognition”,july 2017 IEEE International Conference on Multimedia and Expo (ICME), 2017, pp. 538-588, doi: 10.1109/ICME.2017.8019296. [5] Wootaek Lim, Daeyoung Jang and Taejin Lee, “Speech Emotion RecognitionusingConvolutional andRecurrent Neural Networks”, In 2016 Annual Conference of the International Speech CommunicationAssociation,2016, pp. 1-4.