SlideShare a Scribd company logo
1 of 18
Genre Classification
and Analysis
Anat Gilboa
Yanjun Qi, PhD
The Journey
Music Information
Retrieval 101
Constructing approaches
to a not-so-well-defined
problem
Finding good data
Simplifying the
problem
Data visualization
Finding not-so-
good data
Fall 2014
Today!Iterating
Machine
Learning
101
Spring 2015
Let’s find out…
• What makes one song similar to another?
• What are the characteristics by which we can “classify” the genre of a song?
The Problem
• Supervised
• Regression (Linear, Logistic, Ridge)
• Classification (Bagging, Naive Bayes, SVM, NN, KNN)
• Unsupervised
• Dimension Reduction (PCA)
• Clustering (K-means, GMM/EM, Hierarchical)
Machine Learning 101
Music Information Retrieval
101
• Aims to extend the understanding and usefulness of
music data, through research, development and
application of computational approaches and tools
• Combines concepts and techniques from music,
computer science, signal processing and cognition
• Music information: bibliographical, surveys, tags,
scores, MIDI, audio, etc
Adoption
• USPOP2002
• Magnatagatune
• CAL500
• RWC MDB
• International Society for Music Information Retrieval
(ISMIR) 2011 Dataset
• Collection of audio features and metadata for 1,000,000 contemporary popular
music tracks.
• 44,745 unique artists w/dated tracks starting from 1922
• 10,000 song subset (1%, 1.8 gb)
• Each song has a number of features…
The Million Song Dataset
loudness
mode
mode confidence
release
release 7digitalid
sections confidence
sections start
segments confidence
segments loudness max
segments loudness max time
segments loudness max start
segments pitches
segments start
segments timbre
similar artists
song hotttnesss
song id
start of fade out
tatums confidence
tatums start
tempo
time signature
time signature confidence
title
track id
track 7digitalid
year
analysis sample rate
artist 7digitalid
artist familiarity
artist hotttnesss
artist id
artist latitude
artist location
artist longitude
artist mbid
artist mbtags
artist mbtags count
artist name
artist playmeid
artist terms
artist terms freq
artist terms weight
audio md5
bars confidence
bars start
beats confidence
beats start
danceability
duration
end of fade in
energy
key
key confidence
key
tempo
Song Fields
Numerical Features
Danceability - how danceable a song is. 0 is least danceable, 100 is most danceable.
Duration - the length of the song in seconds.
Energy - the overall energy of the song, 0 is least, 100 is most.
Hotttnesss - the popularity of the song, 0 is least, 100 is most.
Key - the key the song. 0 is C, 1 is C# and so on.
Liveness - the likelihood that a song was performed in front of an audience. Above 80 is usually live.
Loudness - the overall loudness of the song in decibels.
Mode - the mode of the song where major is 0 and minor is 1.
Speechiness - how much spoken word is in the song. 0 is least, 100 is most
Tempo - the most frequently occurring tempo in the song, in beats-per-minute.
Time signature - the number of beats per measure in the song.
Acousticness how acoustic vs. electric is the song
Valence how positive or negative is the mood of the song
Inspiration came from…
• 8,761 songs
• (ty, API request timeouts & rate limiting)
• 307 genres-extracted from the Artist API
• k-means centroids
• 3,944 artists
• Between 1 - 11 appearances in the set
The Facts
• Use K-means to create centroids for each genre
• Hypothesis: If there are 307 genres
represented, would each be in the same
cluster?
• Create K-nearest neighbor tool to fetch k nearest
songs to some specified datapoint
• f(Tempo, Key, K)
Tasks
K-Means
K-NN
The Future
• There’s a long way to go…
• No one can predict the future…
• MIR is awesome and powerful
• But seriously, K-fold cross validation
–Anat
“Inspirational Quote”
Sources
• http://developer.echonest.com/docs/v4/_static/AnalyzeDocumentation.pdf
• https://github.com/echonest
• https://github.com/tbertinmahieux/MSongsDB
Thank You!

More Related Content

What's hot

application of digital image processing and methods
application of digital image processing and methodsapplication of digital image processing and methods
application of digital image processing and methodsSIRILsam
 
digital image processing, image processing
digital image processing, image processingdigital image processing, image processing
digital image processing, image processingKalyan Acharjya
 
Image Restoration (Order Statistics Filters)
Image Restoration (Order Statistics Filters)Image Restoration (Order Statistics Filters)
Image Restoration (Order Statistics Filters)Kalyan Acharjya
 
Music project powerpoint
Music project powerpointMusic project powerpoint
Music project powerpointBurgie21
 
3 multimedia elements - audio
3   multimedia elements - audio3   multimedia elements - audio
3 multimedia elements - audioKelly Bauer
 
Image denoising algorithms
Image denoising algorithmsImage denoising algorithms
Image denoising algorithmsMohammad Sunny
 
La médiation musicale en bibliothèque
La médiation musicale en bibliothèqueLa médiation musicale en bibliothèque
La médiation musicale en bibliothèqueXavier Galaup
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniquessonukumar142
 
Music production
Music productionMusic production
Music productionjcklp1
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionHugo Moreno
 
Competitor analysis of Music Streaming Services
Competitor analysis of Music Streaming ServicesCompetitor analysis of Music Streaming Services
Competitor analysis of Music Streaming ServicesTiffany Sam
 
speech processing basics
speech processing basicsspeech processing basics
speech processing basicssivakumar m
 
AI_for_Health_Professional_Workshop_
AI_for_Health_Professional_Workshop_AI_for_Health_Professional_Workshop_
AI_for_Health_Professional_Workshop_Vaikunthan Rajaratnam
 
Applications of Emotions Recognition
Applications of Emotions RecognitionApplications of Emotions Recognition
Applications of Emotions RecognitionFrancesco Bonadiman
 
Digital image processing
Digital image processingDigital image processing
Digital image processingDeevena Dayaal
 
Music and technology
Music and technologyMusic and technology
Music and technologyJoel Nally
 
Music genre prediction
Music genre predictionMusic genre prediction
Music genre predictionAnusha Chavva
 

What's hot (20)

application of digital image processing and methods
application of digital image processing and methodsapplication of digital image processing and methods
application of digital image processing and methods
 
digital image processing, image processing
digital image processing, image processingdigital image processing, image processing
digital image processing, image processing
 
Image Restoration (Order Statistics Filters)
Image Restoration (Order Statistics Filters)Image Restoration (Order Statistics Filters)
Image Restoration (Order Statistics Filters)
 
Music project powerpoint
Music project powerpointMusic project powerpoint
Music project powerpoint
 
3 multimedia elements - audio
3   multimedia elements - audio3   multimedia elements - audio
3 multimedia elements - audio
 
Image denoising algorithms
Image denoising algorithmsImage denoising algorithms
Image denoising algorithms
 
La médiation musicale en bibliothèque
La médiation musicale en bibliothèqueLa médiation musicale en bibliothèque
La médiation musicale en bibliothèque
 
Lean information
Lean informationLean information
Lean information
 
Medical image analysis
Medical image analysisMedical image analysis
Medical image analysis
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
 
Music production
Music productionMusic production
Music production
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
A Comparison of Block-Matching Motion Estimation Algorithms
A Comparison of Block-Matching Motion Estimation AlgorithmsA Comparison of Block-Matching Motion Estimation Algorithms
A Comparison of Block-Matching Motion Estimation Algorithms
 
Competitor analysis of Music Streaming Services
Competitor analysis of Music Streaming ServicesCompetitor analysis of Music Streaming Services
Competitor analysis of Music Streaming Services
 
speech processing basics
speech processing basicsspeech processing basics
speech processing basics
 
AI_for_Health_Professional_Workshop_
AI_for_Health_Professional_Workshop_AI_for_Health_Professional_Workshop_
AI_for_Health_Professional_Workshop_
 
Applications of Emotions Recognition
Applications of Emotions RecognitionApplications of Emotions Recognition
Applications of Emotions Recognition
 
Digital image processing
Digital image processingDigital image processing
Digital image processing
 
Music and technology
Music and technologyMusic and technology
Music and technology
 
Music genre prediction
Music genre predictionMusic genre prediction
Music genre prediction
 

Similar to Genre Classification and Analysis

Using mashup technology to improve findability
Using mashup technology to improve findabilityUsing mashup technology to improve findability
Using mashup technology to improve findabilitySten Govaerts
 
Understanding Music Playlists
Understanding Music PlaylistsUnderstanding Music Playlists
Understanding Music PlaylistsKeunwoo Choi
 
Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008Rui Pedro Paiva
 
Introduction musictech
Introduction musictechIntroduction musictech
Introduction musictechJia Liu
 
Multi modal music mood classification
Multi modal music mood classificationMulti modal music mood classification
Multi modal music mood classificationxiaohusmile
 
Capturing Themed Evidence, a Hybrid Approach
Capturing Themed Evidence, a Hybrid ApproachCapturing Themed Evidence, a Hybrid Approach
Capturing Themed Evidence, a Hybrid ApproachEnrico Daga
 
Denktank 2010
Denktank 2010Denktank 2010
Denktank 2010ocor203
 
Creating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music VisualizationCreating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music Visualizationicchp2012
 
Big Data Research - Pandara
Big Data Research - Pandara Big Data Research - Pandara
Big Data Research - Pandara Leeya Ressom
 
Dancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop musicDancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop musicJAYALAKSHMI NAIR
 
Music Objects to Social Machines
Music Objects to Social MachinesMusic Objects to Social Machines
Music Objects to Social MachinesDavid De Roure
 
Searching for the right feelings: Emotional metadata in music
Searching for the right feelings: Emotional metadata in musicSearching for the right feelings: Emotional metadata in music
Searching for the right feelings: Emotional metadata in musicDiane Rasmussen Pennington
 
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...Oscar Celma
 
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiOThe convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiOThe Research Council of Norway, IKTPLUSS
 
Gracenote: Music Navigation and Discovery
Gracenote: Music Navigation and DiscoveryGracenote: Music Navigation and Discovery
Gracenote: Music Navigation and Discoveryswissnex San Francisco
 

Similar to Genre Classification and Analysis (20)

MIR
MIRMIR
MIR
 
Using mashup technology to improve findability
Using mashup technology to improve findabilityUsing mashup technology to improve findability
Using mashup technology to improve findability
 
楊奕軒/音樂資料檢索
楊奕軒/音樂資料檢索楊奕軒/音樂資料檢索
楊奕軒/音樂資料檢索
 
Understanding Music Playlists
Understanding Music PlaylistsUnderstanding Music Playlists
Understanding Music Playlists
 
Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008
 
Introduction musictech
Introduction musictechIntroduction musictech
Introduction musictech
 
Multi modal music mood classification
Multi modal music mood classificationMulti modal music mood classification
Multi modal music mood classification
 
Capturing Themed Evidence, a Hybrid Approach
Capturing Themed Evidence, a Hybrid ApproachCapturing Themed Evidence, a Hybrid Approach
Capturing Themed Evidence, a Hybrid Approach
 
Denktank 2010
Denktank 2010Denktank 2010
Denktank 2010
 
Let the Computer Do the Work
Let the Computer Do the WorkLet the Computer Do the Work
Let the Computer Do the Work
 
Creating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music VisualizationCreating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music Visualization
 
Big Data Research - Pandara
Big Data Research - Pandara Big Data Research - Pandara
Big Data Research - Pandara
 
Dancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop musicDancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop music
 
Music Objects to Social Machines
Music Objects to Social MachinesMusic Objects to Social Machines
Music Objects to Social Machines
 
Enfuse_QS.pdf
Enfuse_QS.pdfEnfuse_QS.pdf
Enfuse_QS.pdf
 
Searching for the right feelings: Emotional metadata in music
Searching for the right feelings: Emotional metadata in musicSearching for the right feelings: Emotional metadata in music
Searching for the right feelings: Emotional metadata in music
 
Setlistör Demo Slides
Setlistör Demo SlidesSetlistör Demo Slides
Setlistör Demo Slides
 
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
 
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiOThe convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
 
Gracenote: Music Navigation and Discovery
Gracenote: Music Navigation and DiscoveryGracenote: Music Navigation and Discovery
Gracenote: Music Navigation and Discovery
 

Recently uploaded

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 

Recently uploaded (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 

Genre Classification and Analysis

  • 2. The Journey Music Information Retrieval 101 Constructing approaches to a not-so-well-defined problem Finding good data Simplifying the problem Data visualization Finding not-so- good data Fall 2014 Today!Iterating Machine Learning 101 Spring 2015
  • 3. Let’s find out… • What makes one song similar to another? • What are the characteristics by which we can “classify” the genre of a song? The Problem
  • 4. • Supervised • Regression (Linear, Logistic, Ridge) • Classification (Bagging, Naive Bayes, SVM, NN, KNN) • Unsupervised • Dimension Reduction (PCA) • Clustering (K-means, GMM/EM, Hierarchical) Machine Learning 101
  • 5. Music Information Retrieval 101 • Aims to extend the understanding and usefulness of music data, through research, development and application of computational approaches and tools • Combines concepts and techniques from music, computer science, signal processing and cognition • Music information: bibliographical, surveys, tags, scores, MIDI, audio, etc
  • 6. Adoption • USPOP2002 • Magnatagatune • CAL500 • RWC MDB • International Society for Music Information Retrieval (ISMIR) 2011 Dataset
  • 7. • Collection of audio features and metadata for 1,000,000 contemporary popular music tracks. • 44,745 unique artists w/dated tracks starting from 1922 • 10,000 song subset (1%, 1.8 gb) • Each song has a number of features… The Million Song Dataset
  • 8. loudness mode mode confidence release release 7digitalid sections confidence sections start segments confidence segments loudness max segments loudness max time segments loudness max start segments pitches segments start segments timbre similar artists song hotttnesss song id start of fade out tatums confidence tatums start tempo time signature time signature confidence title track id track 7digitalid year analysis sample rate artist 7digitalid artist familiarity artist hotttnesss artist id artist latitude artist location artist longitude artist mbid artist mbtags artist mbtags count artist name artist playmeid artist terms artist terms freq artist terms weight audio md5 bars confidence bars start beats confidence beats start danceability duration end of fade in energy key key confidence key tempo Song Fields
  • 9. Numerical Features Danceability - how danceable a song is. 0 is least danceable, 100 is most danceable. Duration - the length of the song in seconds. Energy - the overall energy of the song, 0 is least, 100 is most. Hotttnesss - the popularity of the song, 0 is least, 100 is most. Key - the key the song. 0 is C, 1 is C# and so on. Liveness - the likelihood that a song was performed in front of an audience. Above 80 is usually live. Loudness - the overall loudness of the song in decibels. Mode - the mode of the song where major is 0 and minor is 1. Speechiness - how much spoken word is in the song. 0 is least, 100 is most Tempo - the most frequently occurring tempo in the song, in beats-per-minute. Time signature - the number of beats per measure in the song. Acousticness how acoustic vs. electric is the song Valence how positive or negative is the mood of the song
  • 11. • 8,761 songs • (ty, API request timeouts & rate limiting) • 307 genres-extracted from the Artist API • k-means centroids • 3,944 artists • Between 1 - 11 appearances in the set The Facts
  • 12. • Use K-means to create centroids for each genre • Hypothesis: If there are 307 genres represented, would each be in the same cluster? • Create K-nearest neighbor tool to fetch k nearest songs to some specified datapoint • f(Tempo, Key, K) Tasks
  • 14. K-NN
  • 15. The Future • There’s a long way to go… • No one can predict the future… • MIR is awesome and powerful • But seriously, K-fold cross validation

Editor's Notes

  1. It’s been a long ride to get here.
  2. Expectation-maximization Hierarchical
  3. non-profit organisation which, among other things, oversees the organisation of the ISMIR Conference. The ISMIR conference is held annually and is the world's leading research forum on processing, searching, organising and accessing music-related data. six original collections: the Popular Music Database (100 songs), Royalty-Free Music Database (15 songs), Classical Music Database (50 pieces), Jazz Music Database (50 pieces), Music Genre Database (100 pieces), and Musical Instrument Sound Database (50 instruments)
  4. MSD is a freely-available collection of audio features and metadata for a million contemporary popular music tracks. By the way, this is metadata…I didn’t casually download 10,000 songs and make a hadoop cluster to compute, although this could potentially go there… Each song has a number of features but we’re interested in
  5. I met an engineer who represented Spotify,
  6. Not entirely sure why Aerosmith and Red Hot Chilly Peppers have 11 songs, but maybe it’s because they came out with more songs, too.
  7. Not entirely sure why Aerosmith and Red Hot Chilly Peppers have 11 songs, but maybe it’s because they came out with more songs, too.