SlideShare a Scribd company logo
1 of 26
*Automated genre classification is the process
by which a musical piece is associated to a
genre to allow users to search, browse, and
organize their music catalogues; through
machine learning and advanced algorithms.
*In simple terms, your songs are sorted,
according to genre, without any intervention
or effort on your part.
*FEATURE EXTRACTION
*CLASSIFICATION
*Each digital audio file has some features.
These are extracted for the purpose of genre
identification.
*These features can be classified into three
categories, namely, timbre, pitch and rhythm.
*Timbre – the quality that distinguishes
different types of sound production, such as
voices and musical instruments, string
instruments, wind instruments, and percussion
instruments.
*Pitch – the perception-based quality that
allows ordering of sound on a frequency-
related scale.
*Rhythm – the timing of musical sounds and
silences on a human scale.
* A list of features
of audio file
*Some formulae and procedures used to calculate
features
*Zero Crossings Rate
for (int samp = 0; samp < samples.length - 1; samp++)
{
if (samples[samp] > 0.0 && samples[samp + 1] < 0.0)
count++;
else if (samples[samp] < 0.0 && samples[samp + 1] > 0.0)
count++;
else if (samples[samp] == 0.0 && samples[samp + 1] != 0.0)
count++;
}
*Beat Sum
double sum = 0.0;
for (int i = 0; i < beat_histogram.length; i++)
sum += beat_histogram[i];
double[] result = new double[1];
result[0] = sum;
return result;
*Strongest Frequency Via Zero Crossings
result = (zero_crossings / 2.0) * (sampling_rate / (double) samples.length)
*The above extracted features are then used to
identify genre using one or more clustering
algorithms.
*Many approaches are used for the above,
including Unsupervised and Supervised
approach.
*Unsupervised Approaches have no
knowledge about genres. Classifier can
observe the data position in the feature
space, but do not know what the genre
cluster of the data is.
*Unsupervised classifiers:
K-means, Agglomerative hierarchical clustering,
Self-organizing Map (SOM), Growing hierarchical Self-
organizing Map (GHSOM).
*In Supervised Approaches, the system is
trained by manually labeling the data at
first, then, when unlabeled data (new
coming data) comes, the trained system
is used to classify it into a known genre.
*Supervised classifiers:
K-nearest neighbor (KNN), Gaussian Mixture Model
(GMM), Linear Discriminant Analysis (LDA), Support Vector
Machines (SVMs), Artificial Neural Networks (ANNs).
*A fuzzy inference system is implemented.
*It is a supervised classifier.
*Rules are manually created.
*The rules are, then, implemented on two
feature sets, and the output evaluated.
*Feature set 1 = (Zero Crossings, Beat Sum,
Strongest Frequency)
*Feature set 2 =(MFCC)
*Classification results
Accuracy Hits Ratio
Feature Set 1 (ZCR + BS +
SF)
85.0% 65.38%
Feature Set 2 (MFCC) 72.5% 65.9%
*The “front-end” of my project.
*The Music Matrix is a NxN matrix where each
cell represents a list of song(s) which are
placed in one or more genres, in a fuzzy
manner.
*This system clearly demonstrates multi-label
songs.
*For example, choosing a cell in the following
matrix may cause a list of songs to be played,
that are 60%-70% classic, and 10%-15% pop.
*Huge size of genre (and sub-genre) list.
*Non-Agreement on Taxonomies – Well-known
websites like Allmusic (http://www.allmusic.com—
531genres), Amazon (http://www.amazon.com—719
genres), and Mp3 (http://www.mp3.com—430
genres).
*Trivialization of music art.
*Classification Basis
*Fuzzy definition of genres
*Differences in human perception
*Scalability of any AMC system
*Automated Genre Classification is a non-trivial
task.
*Emotion and music-matching is subjective.
*The problems of genre taxonomy are carried
onto Automated Genre Classification.
*Extraction of all features of an audio file is not only
unnecessary, but also counterproductive.
*Different combinations of extracted features and
various classification algorithms yield different
results, of different accuracy.
*A combination of low-level signal properties such as
zero-crossing rate, spectral centroid and skewness,
mean energy, etc. and perception-based features
such as MFCCs, beat histograms, etc. may be the
most appropriate set.
*Multi-label classification is the most
appropriate for real world.
*A fuzzy classification algorithm must be used to
allow for multi-label songs.
*A lot of novelty functions have been created,
but, sadly, they return results of lesser
accuracy.
*Practices used for Automated Genre
Classification can also be used to sieve similar
songs. It may help in copyright and IPR
protection.
Ref: http://www.thatsongsoundslike.com/
Music Matrix - A Fuzzy Automated Genre Classification

More Related Content

Recently uploaded

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxnelietumpap1
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 

Recently uploaded (20)

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 

Featured

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationErica Santiago
 

Featured (20)

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
 

Music Matrix - A Fuzzy Automated Genre Classification

  • 1.
  • 2.
  • 3. *Automated genre classification is the process by which a musical piece is associated to a genre to allow users to search, browse, and organize their music catalogues; through machine learning and advanced algorithms. *In simple terms, your songs are sorted, according to genre, without any intervention or effort on your part.
  • 5.
  • 6. *Each digital audio file has some features. These are extracted for the purpose of genre identification. *These features can be classified into three categories, namely, timbre, pitch and rhythm.
  • 7. *Timbre – the quality that distinguishes different types of sound production, such as voices and musical instruments, string instruments, wind instruments, and percussion instruments. *Pitch – the perception-based quality that allows ordering of sound on a frequency- related scale. *Rhythm – the timing of musical sounds and silences on a human scale.
  • 8. * A list of features of audio file
  • 9. *Some formulae and procedures used to calculate features *Zero Crossings Rate for (int samp = 0; samp < samples.length - 1; samp++) { if (samples[samp] > 0.0 && samples[samp + 1] < 0.0) count++; else if (samples[samp] < 0.0 && samples[samp + 1] > 0.0) count++; else if (samples[samp] == 0.0 && samples[samp + 1] != 0.0) count++; }
  • 10. *Beat Sum double sum = 0.0; for (int i = 0; i < beat_histogram.length; i++) sum += beat_histogram[i]; double[] result = new double[1]; result[0] = sum; return result;
  • 11. *Strongest Frequency Via Zero Crossings result = (zero_crossings / 2.0) * (sampling_rate / (double) samples.length)
  • 12. *The above extracted features are then used to identify genre using one or more clustering algorithms. *Many approaches are used for the above, including Unsupervised and Supervised approach.
  • 13. *Unsupervised Approaches have no knowledge about genres. Classifier can observe the data position in the feature space, but do not know what the genre cluster of the data is. *Unsupervised classifiers: K-means, Agglomerative hierarchical clustering, Self-organizing Map (SOM), Growing hierarchical Self- organizing Map (GHSOM).
  • 14. *In Supervised Approaches, the system is trained by manually labeling the data at first, then, when unlabeled data (new coming data) comes, the trained system is used to classify it into a known genre. *Supervised classifiers: K-nearest neighbor (KNN), Gaussian Mixture Model (GMM), Linear Discriminant Analysis (LDA), Support Vector Machines (SVMs), Artificial Neural Networks (ANNs).
  • 15. *A fuzzy inference system is implemented. *It is a supervised classifier. *Rules are manually created. *The rules are, then, implemented on two feature sets, and the output evaluated. *Feature set 1 = (Zero Crossings, Beat Sum, Strongest Frequency) *Feature set 2 =(MFCC)
  • 16. *Classification results Accuracy Hits Ratio Feature Set 1 (ZCR + BS + SF) 85.0% 65.38% Feature Set 2 (MFCC) 72.5% 65.9%
  • 17. *The “front-end” of my project. *The Music Matrix is a NxN matrix where each cell represents a list of song(s) which are placed in one or more genres, in a fuzzy manner. *This system clearly demonstrates multi-label songs.
  • 18. *For example, choosing a cell in the following matrix may cause a list of songs to be played, that are 60%-70% classic, and 10%-15% pop.
  • 19. *Huge size of genre (and sub-genre) list. *Non-Agreement on Taxonomies – Well-known websites like Allmusic (http://www.allmusic.com— 531genres), Amazon (http://www.amazon.com—719 genres), and Mp3 (http://www.mp3.com—430 genres). *Trivialization of music art. *Classification Basis
  • 20. *Fuzzy definition of genres *Differences in human perception *Scalability of any AMC system
  • 21.
  • 22. *Automated Genre Classification is a non-trivial task. *Emotion and music-matching is subjective. *The problems of genre taxonomy are carried onto Automated Genre Classification.
  • 23. *Extraction of all features of an audio file is not only unnecessary, but also counterproductive. *Different combinations of extracted features and various classification algorithms yield different results, of different accuracy. *A combination of low-level signal properties such as zero-crossing rate, spectral centroid and skewness, mean energy, etc. and perception-based features such as MFCCs, beat histograms, etc. may be the most appropriate set.
  • 24. *Multi-label classification is the most appropriate for real world. *A fuzzy classification algorithm must be used to allow for multi-label songs. *A lot of novelty functions have been created, but, sadly, they return results of lesser accuracy.
  • 25. *Practices used for Automated Genre Classification can also be used to sieve similar songs. It may help in copyright and IPR protection. Ref: http://www.thatsongsoundslike.com/