SlideShare a Scribd company logo
Genre Classification
and Analysis
Anat Gilboa
Yanjun Qi, PhD
The Journey
Music Information
Retrieval 101
Constructing approaches
to a not-so-well-defined
problem
Finding good data
Simplifying the
problem
Data visualization
Finding not-so-
good data
Fall 2014
Today!Iterating
Machine
Learning
101
Spring 2015
Let’s find out…
• What makes one song similar to another?
• What are the characteristics by which we can “classify” the genre of a song?
The Problem
• Supervised
• Regression (Linear, Logistic, Ridge)
• Classification (Bagging, Naive Bayes, SVM, NN, KNN)
• Unsupervised
• Dimension Reduction (PCA)
• Clustering (K-means, GMM/EM, Hierarchical)
Machine Learning 101
Music Information Retrieval
101
• Aims to extend the understanding and usefulness of
music data, through research, development and
application of computational approaches and tools
• Combines concepts and techniques from music,
computer science, signal processing and cognition
• Music information: bibliographical, surveys, tags,
scores, MIDI, audio, etc
Adoption
• USPOP2002
• Magnatagatune
• CAL500
• RWC MDB
• International Society for Music Information Retrieval
(ISMIR) 2011 Dataset
• Collection of audio features and metadata for 1,000,000 contemporary popular
music tracks.
• 44,745 unique artists w/dated tracks starting from 1922
• 10,000 song subset (1%, 1.8 gb)
• Each song has a number of features…
The Million Song Dataset
loudness
mode
mode confidence
release
release 7digitalid
sections confidence
sections start
segments confidence
segments loudness max
segments loudness max time
segments loudness max start
segments pitches
segments start
segments timbre
similar artists
song hotttnesss
song id
start of fade out
tatums confidence
tatums start
tempo
time signature
time signature confidence
title
track id
track 7digitalid
year
analysis sample rate
artist 7digitalid
artist familiarity
artist hotttnesss
artist id
artist latitude
artist location
artist longitude
artist mbid
artist mbtags
artist mbtags count
artist name
artist playmeid
artist terms
artist terms freq
artist terms weight
audio md5
bars confidence
bars start
beats confidence
beats start
danceability
duration
end of fade in
energy
key
key confidence
key
tempo
Song Fields
Numerical Features
Danceability - how danceable a song is. 0 is least danceable, 100 is most danceable.
Duration - the length of the song in seconds.
Energy - the overall energy of the song, 0 is least, 100 is most.
Hotttnesss - the popularity of the song, 0 is least, 100 is most.
Key - the key the song. 0 is C, 1 is C# and so on.
Liveness - the likelihood that a song was performed in front of an audience. Above 80 is usually live.
Loudness - the overall loudness of the song in decibels.
Mode - the mode of the song where major is 0 and minor is 1.
Speechiness - how much spoken word is in the song. 0 is least, 100 is most
Tempo - the most frequently occurring tempo in the song, in beats-per-minute.
Time signature - the number of beats per measure in the song.
Acousticness how acoustic vs. electric is the song
Valence how positive or negative is the mood of the song
Inspiration came from…
• 8,761 songs
• (ty, API request timeouts & rate limiting)
• 307 genres-extracted from the Artist API
• k-means centroids
• 3,944 artists
• Between 1 - 11 appearances in the set
The Facts
• Use K-means to create centroids for each genre
• Hypothesis: If there are 307 genres
represented, would each be in the same
cluster?
• Create K-nearest neighbor tool to fetch k nearest
songs to some specified datapoint
• f(Tempo, Key, K)
Tasks
K-Means
K-NN
The Future
• There’s a long way to go…
• No one can predict the future…
• MIR is awesome and powerful
• But seriously, K-fold cross validation
–Anat
“Inspirational Quote”
Sources
• http://developer.echonest.com/docs/v4/_static/AnalyzeDocumentation.pdf
• https://github.com/echonest
• https://github.com/tbertinmahieux/MSongsDB
Thank You!

More Related Content

What's hot

Image processing fundamentals
Image processing fundamentalsImage processing fundamentals
Image processing fundamentals
A B Shinde
 
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Simplilearn
 
A Comparison of Block-Matching Motion Estimation Algorithms
A Comparison of Block-Matching Motion Estimation AlgorithmsA Comparison of Block-Matching Motion Estimation Algorithms
A Comparison of Block-Matching Motion Estimation Algorithms
Multimedia and Vision Laboratory at Universidad del Valle
 
Crime Analysis & Prediction System
Crime Analysis & Prediction SystemCrime Analysis & Prediction System
Crime Analysis & Prediction System
BigDataCloud
 
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
Simplilearn
 
phase 1 ppt dal adulteration.pptx
phase 1 ppt dal adulteration.pptxphase 1 ppt dal adulteration.pptx
phase 1 ppt dal adulteration.pptx
bharatt7
 
Machine Learning for Disease Prediction
Machine Learning for Disease PredictionMachine Learning for Disease Prediction
Machine Learning for Disease Prediction
Mustafa Oğuz
 
Face recognition ppt
Face recognition pptFace recognition ppt
Face recognition ppt
Santosh Kumar
 
IRJET- Music Genre Classification using Machine Learning Algorithms: A Compar...
IRJET- Music Genre Classification using Machine Learning Algorithms: A Compar...IRJET- Music Genre Classification using Machine Learning Algorithms: A Compar...
IRJET- Music Genre Classification using Machine Learning Algorithms: A Compar...
IRJET Journal
 
An Introduction to Image Processing and Artificial Intelligence
An Introduction to Image Processing and Artificial IntelligenceAn Introduction to Image Processing and Artificial Intelligence
An Introduction to Image Processing and Artificial Intelligence
Wasif Altaf
 
Speech emotion recognition
Speech emotion recognitionSpeech emotion recognition
Speech emotion recognition
saniya shaikh
 
Indian-Sign-Language-Recognition
Indian-Sign-Language-RecognitionIndian-Sign-Language-Recognition
Indian-Sign-Language-Recognition
Sajanraj T D
 
Crop Yield Prediction using Machine Learning
Crop Yield Prediction using Machine LearningCrop Yield Prediction using Machine Learning
Crop Yield Prediction using Machine Learning
IRJET Journal
 
Web mining
Web mining Web mining
Web mining
TeklayBirhane
 
Emotion Speech Recognition - Convolutional Neural Network Capstone Project
Emotion Speech Recognition - Convolutional Neural Network Capstone ProjectEmotion Speech Recognition - Convolutional Neural Network Capstone Project
Emotion Speech Recognition - Convolutional Neural Network Capstone Project
Diego Rios
 
Data Mining: an Introduction
Data Mining: an IntroductionData Mining: an Introduction
Data Mining: an Introduction
Ali Abbasi
 
3D reconstruction
3D reconstruction3D reconstruction
3D reconstruction
Jorge Leandro, Ph.D.
 
Machine model to classify dogs and cat
Machine model to classify dogs and catMachine model to classify dogs and cat
Machine model to classify dogs and cat
Akash Parui
 
Bakhrabad gas field assistant manager question solution
Bakhrabad gas field assistant manager question  solutionBakhrabad gas field assistant manager question  solution
Bakhrabad gas field assistant manager question solution
Itmona
 

What's hot (20)

Image processing fundamentals
Image processing fundamentalsImage processing fundamentals
Image processing fundamentals
 
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
 
A Comparison of Block-Matching Motion Estimation Algorithms
A Comparison of Block-Matching Motion Estimation AlgorithmsA Comparison of Block-Matching Motion Estimation Algorithms
A Comparison of Block-Matching Motion Estimation Algorithms
 
Crime Analysis & Prediction System
Crime Analysis & Prediction SystemCrime Analysis & Prediction System
Crime Analysis & Prediction System
 
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
 
phase 1 ppt dal adulteration.pptx
phase 1 ppt dal adulteration.pptxphase 1 ppt dal adulteration.pptx
phase 1 ppt dal adulteration.pptx
 
undergraduate_transcript
undergraduate_transcriptundergraduate_transcript
undergraduate_transcript
 
Machine Learning for Disease Prediction
Machine Learning for Disease PredictionMachine Learning for Disease Prediction
Machine Learning for Disease Prediction
 
Face recognition ppt
Face recognition pptFace recognition ppt
Face recognition ppt
 
IRJET- Music Genre Classification using Machine Learning Algorithms: A Compar...
IRJET- Music Genre Classification using Machine Learning Algorithms: A Compar...IRJET- Music Genre Classification using Machine Learning Algorithms: A Compar...
IRJET- Music Genre Classification using Machine Learning Algorithms: A Compar...
 
An Introduction to Image Processing and Artificial Intelligence
An Introduction to Image Processing and Artificial IntelligenceAn Introduction to Image Processing and Artificial Intelligence
An Introduction to Image Processing and Artificial Intelligence
 
Speech emotion recognition
Speech emotion recognitionSpeech emotion recognition
Speech emotion recognition
 
Indian-Sign-Language-Recognition
Indian-Sign-Language-RecognitionIndian-Sign-Language-Recognition
Indian-Sign-Language-Recognition
 
Crop Yield Prediction using Machine Learning
Crop Yield Prediction using Machine LearningCrop Yield Prediction using Machine Learning
Crop Yield Prediction using Machine Learning
 
Web mining
Web mining Web mining
Web mining
 
Emotion Speech Recognition - Convolutional Neural Network Capstone Project
Emotion Speech Recognition - Convolutional Neural Network Capstone ProjectEmotion Speech Recognition - Convolutional Neural Network Capstone Project
Emotion Speech Recognition - Convolutional Neural Network Capstone Project
 
Data Mining: an Introduction
Data Mining: an IntroductionData Mining: an Introduction
Data Mining: an Introduction
 
3D reconstruction
3D reconstruction3D reconstruction
3D reconstruction
 
Machine model to classify dogs and cat
Machine model to classify dogs and catMachine model to classify dogs and cat
Machine model to classify dogs and cat
 
Bakhrabad gas field assistant manager question solution
Bakhrabad gas field assistant manager question  solutionBakhrabad gas field assistant manager question  solution
Bakhrabad gas field assistant manager question solution
 

Similar to Genre Classification and Analysis

MIR
MIRMIR
Using mashup technology to improve findability
Using mashup technology to improve findabilityUsing mashup technology to improve findability
Using mashup technology to improve findability
Sten Govaerts
 
20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir
Yi-Hsuan Yang
 
楊奕軒/音樂資料檢索
楊奕軒/音樂資料檢索楊奕軒/音樂資料檢索
楊奕軒/音樂資料檢索
台灣資料科學年會
 
Understanding Music Playlists
Understanding Music PlaylistsUnderstanding Music Playlists
Understanding Music Playlists
Keunwoo Choi
 
Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008
Rui Pedro Paiva
 
Introduction musictech
Introduction musictechIntroduction musictech
Introduction musictech
Jia Liu
 
Multi modal music mood classification
Multi modal music mood classificationMulti modal music mood classification
Multi modal music mood classification
xiaohusmile
 
Capturing Themed Evidence, a Hybrid Approach
Capturing Themed Evidence, a Hybrid ApproachCapturing Themed Evidence, a Hybrid Approach
Capturing Themed Evidence, a Hybrid Approach
Enrico Daga
 
Denktank 2010
Denktank 2010Denktank 2010
Denktank 2010
ocor203
 
Let the Computer Do the Work
Let the Computer Do the WorkLet the Computer Do the Work
Let the Computer Do the Work
WGBH Media Library and Archives
 
Creating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music VisualizationCreating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music Visualization
icchp2012
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music
NAVER D2
 
Deep Learning Meetup #5
Deep Learning Meetup #5Deep Learning Meetup #5
Deep Learning Meetup #5
Aloïs Gruson
 
Big Data Research - Pandara
Big Data Research - Pandara Big Data Research - Pandara
Big Data Research - Pandara
Leeya Ressom
 
Dancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop musicDancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop music
JAYALAKSHMI NAIR
 
Music Objects to Social Machines
Music Objects to Social MachinesMusic Objects to Social Machines
Music Objects to Social Machines
David De Roure
 
Enfuse_QS.pdf
Enfuse_QS.pdfEnfuse_QS.pdf
Enfuse_QS.pdf
ElioLaureano1
 
Searching for the right feelings: Emotional metadata in music
Searching for the right feelings: Emotional metadata in musicSearching for the right feelings: Emotional metadata in music
Searching for the right feelings: Emotional metadata in music
Diane Rasmussen Pennington
 
Setlistör Demo Slides
Setlistör Demo SlidesSetlistör Demo Slides
Setlistör Demo Slides
Doga Kerestecioglu
 

Similar to Genre Classification and Analysis (20)

MIR
MIRMIR
MIR
 
Using mashup technology to improve findability
Using mashup technology to improve findabilityUsing mashup technology to improve findability
Using mashup technology to improve findability
 
20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir
 
楊奕軒/音樂資料檢索
楊奕軒/音樂資料檢索楊奕軒/音樂資料檢索
楊奕軒/音樂資料檢索
 
Understanding Music Playlists
Understanding Music PlaylistsUnderstanding Music Playlists
Understanding Music Playlists
 
Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008
 
Introduction musictech
Introduction musictechIntroduction musictech
Introduction musictech
 
Multi modal music mood classification
Multi modal music mood classificationMulti modal music mood classification
Multi modal music mood classification
 
Capturing Themed Evidence, a Hybrid Approach
Capturing Themed Evidence, a Hybrid ApproachCapturing Themed Evidence, a Hybrid Approach
Capturing Themed Evidence, a Hybrid Approach
 
Denktank 2010
Denktank 2010Denktank 2010
Denktank 2010
 
Let the Computer Do the Work
Let the Computer Do the WorkLet the Computer Do the Work
Let the Computer Do the Work
 
Creating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music VisualizationCreating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music Visualization
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music
 
Deep Learning Meetup #5
Deep Learning Meetup #5Deep Learning Meetup #5
Deep Learning Meetup #5
 
Big Data Research - Pandara
Big Data Research - Pandara Big Data Research - Pandara
Big Data Research - Pandara
 
Dancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop musicDancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop music
 
Music Objects to Social Machines
Music Objects to Social MachinesMusic Objects to Social Machines
Music Objects to Social Machines
 
Enfuse_QS.pdf
Enfuse_QS.pdfEnfuse_QS.pdf
Enfuse_QS.pdf
 
Searching for the right feelings: Emotional metadata in music
Searching for the right feelings: Emotional metadata in musicSearching for the right feelings: Emotional metadata in music
Searching for the right feelings: Emotional metadata in music
 
Setlistör Demo Slides
Setlistör Demo SlidesSetlistör Demo Slides
Setlistör Demo Slides
 

Recently uploaded

Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 

Recently uploaded (20)

Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 

Genre Classification and Analysis

  • 2. The Journey Music Information Retrieval 101 Constructing approaches to a not-so-well-defined problem Finding good data Simplifying the problem Data visualization Finding not-so- good data Fall 2014 Today!Iterating Machine Learning 101 Spring 2015
  • 3. Let’s find out… • What makes one song similar to another? • What are the characteristics by which we can “classify” the genre of a song? The Problem
  • 4. • Supervised • Regression (Linear, Logistic, Ridge) • Classification (Bagging, Naive Bayes, SVM, NN, KNN) • Unsupervised • Dimension Reduction (PCA) • Clustering (K-means, GMM/EM, Hierarchical) Machine Learning 101
  • 5. Music Information Retrieval 101 • Aims to extend the understanding and usefulness of music data, through research, development and application of computational approaches and tools • Combines concepts and techniques from music, computer science, signal processing and cognition • Music information: bibliographical, surveys, tags, scores, MIDI, audio, etc
  • 6. Adoption • USPOP2002 • Magnatagatune • CAL500 • RWC MDB • International Society for Music Information Retrieval (ISMIR) 2011 Dataset
  • 7. • Collection of audio features and metadata for 1,000,000 contemporary popular music tracks. • 44,745 unique artists w/dated tracks starting from 1922 • 10,000 song subset (1%, 1.8 gb) • Each song has a number of features… The Million Song Dataset
  • 8. loudness mode mode confidence release release 7digitalid sections confidence sections start segments confidence segments loudness max segments loudness max time segments loudness max start segments pitches segments start segments timbre similar artists song hotttnesss song id start of fade out tatums confidence tatums start tempo time signature time signature confidence title track id track 7digitalid year analysis sample rate artist 7digitalid artist familiarity artist hotttnesss artist id artist latitude artist location artist longitude artist mbid artist mbtags artist mbtags count artist name artist playmeid artist terms artist terms freq artist terms weight audio md5 bars confidence bars start beats confidence beats start danceability duration end of fade in energy key key confidence key tempo Song Fields
  • 9. Numerical Features Danceability - how danceable a song is. 0 is least danceable, 100 is most danceable. Duration - the length of the song in seconds. Energy - the overall energy of the song, 0 is least, 100 is most. Hotttnesss - the popularity of the song, 0 is least, 100 is most. Key - the key the song. 0 is C, 1 is C# and so on. Liveness - the likelihood that a song was performed in front of an audience. Above 80 is usually live. Loudness - the overall loudness of the song in decibels. Mode - the mode of the song where major is 0 and minor is 1. Speechiness - how much spoken word is in the song. 0 is least, 100 is most Tempo - the most frequently occurring tempo in the song, in beats-per-minute. Time signature - the number of beats per measure in the song. Acousticness how acoustic vs. electric is the song Valence how positive or negative is the mood of the song
  • 11. • 8,761 songs • (ty, API request timeouts & rate limiting) • 307 genres-extracted from the Artist API • k-means centroids • 3,944 artists • Between 1 - 11 appearances in the set The Facts
  • 12. • Use K-means to create centroids for each genre • Hypothesis: If there are 307 genres represented, would each be in the same cluster? • Create K-nearest neighbor tool to fetch k nearest songs to some specified datapoint • f(Tempo, Key, K) Tasks
  • 14. K-NN
  • 15. The Future • There’s a long way to go… • No one can predict the future… • MIR is awesome and powerful • But seriously, K-fold cross validation

Editor's Notes

  1. It’s been a long ride to get here.
  2. Expectation-maximization Hierarchical
  3. non-profit organisation which, among other things, oversees the organisation of the ISMIR Conference. The ISMIR conference is held annually and is the world's leading research forum on processing, searching, organising and accessing music-related data. six original collections: the Popular Music Database (100 songs), Royalty-Free Music Database (15 songs), Classical Music Database (50 pieces), Jazz Music Database (50 pieces), Music Genre Database (100 pieces), and Musical Instrument Sound Database (50 instruments)
  4. MSD is a freely-available collection of audio features and metadata for a million contemporary popular music tracks. By the way, this is metadata…I didn’t casually download 10,000 songs and make a hadoop cluster to compute, although this could potentially go there… Each song has a number of features but we’re interested in
  5. I met an engineer who represented Spotify,
  6. Not entirely sure why Aerosmith and Red Hot Chilly Peppers have 11 songs, but maybe it’s because they came out with more songs, too.
  7. Not entirely sure why Aerosmith and Red Hot Chilly Peppers have 11 songs, but maybe it’s because they came out with more songs, too.