SlideShare a Scribd company logo
Daniel Wolff and Tillman Weyde
ADAPTING METRICS FOR MUSIC SIMILARITY                                                                           Music Informatics Research Group
                                                                                                                Northampton Square, EC1V 0HB London

          USING COMPARATIVE RATINGS                                                                             daniel.wolff.1@soi.city.ac.uk
                                                                                                                t.e.weyde@soi.city.ac.uk



                         Motivation                                                         Similarity Data                                                           Experiments
This poster presents a machine learning approach for analysing      • TagATune gamers have to                                              • 5-fold cross validation with test-sets of ~106 binary
user data that specifies song similarity. Understanding how         agree on the “outlier” clip out of 3                                   rankings, evaluate fulfilled rankings
we relate and compare music has been a topic of great
interest in musicology as well as for business applications, such                                                                          Test Set:
as music recommender systems. The way music is compared             • Data for 533 clip triplets
seems to vary between different cultures. Adapting a generic           Avg. 14 votes per triplet                                            MLR:     82%
model to user ratings is useful for personalisation and can help       1019 clips included                                                  mlrDiag:71%
to better understand such differences.                                                                                                      SVM:     70%
                                                                    Postprocessing:                                                         Eucl.:   67%
In our experiments we find that a significant amount of             • Consider the triplet histograms as voting                             (Wij = δij)
information can be gained from comparative similarity ratings,         Determine winning outlier (B) where possible
allowing for an improved similarity estimation on seen and                                                                     B
unseen data.                                                           Discard votings featuring no clear winner                   C
                                                                                                                  A                        Training Set:
                                                                    • Derive relative clip similarity constraints:                          MLR: Best, but
           Audio and Similarity Dataset:                               (A, B, C), B being the outlier implies                                bad for <50
          MagnaTagATune [E. Law et al. 2009]                           sim(A, C) > sim(A, B) AND sim(A, C) > sim(B, C)                       constr.
                                                                                                                                            mlrDiag: weak
Online Song excerpts from the Magnatune label                       • Derive binary rankings                                                  adaptation
• 30 seconds long, can be divided into 4 broad categories:             Alternative representation of constraints                            SVM: good on training data, bad generalisation
   ”electronica” (30%), ”classical” (28%), ”world” (15%) and           Inconsistent constraints are removed (where clips are similar
   ”rock” (17%)                                                        and dissimilar at the same time)
• Annotation data (user tags) and similarity ratings from the
                                                                                                                                                Feature dimension / PCA feature test
human computation game „TagATune“
                                                                                                                                           • Features reduced to 20 –dim using Principal Component
                                                                              Similarity Model and Adaptation                              Analysis (PCA)
                          Features                                  • Mahalanobis metric for measuring clip similarity:
                                                                                                                                           MLR:     77%
The clips in our database are described using a combination of                    dW ( x , y ) = ( x − y ) T W ( x − y )                   mlrDiag: 76%
content-based and genre features:
                                                                                                                                           SVM:     76%
                                                                       Matrix W defines the similarity measure, clip feature
Chroma and timbre features precomputed by “TheEchoNest”                 vectors x and y
• Postprocessing:
   K-means: 4 clusters per clip and feature type,                      Generalised Euclidean metric
   12-dim. chroma features are transposed to root note C                allows for geometric interpretation
   12-dim. timbre features are clipped                                  psychological validity has been questioned                                                     Conclusion
   Both normalised to a maximum value of 1
                                                                                                                                           • Similarity constraints contain generalisable information,
                                                                    • We compare two different algorithms for optimising dW
                                                                                                                                           which can be trained using the tested methods.
2-3 genres per clip are annotated in the Magnatune                     1. MLR: [McFee and Lanckriet 2010] optimise a full W to
                                                                                                                                           • MLR works well on both feature types tested
catalogue                                                              binary rankings
                                                                                                                                           • mlrDiag tradeoff for regularisation and constraints has to
• Each clip is assigned a 44-dim. binary genre vector                  1.1. mlrDiag: MLR variant restrained to a diagonal matrix W
                                                                                                                                           be investigated
                                                                       2. SVM: [Schultz and Joachims 2003] optimise a weighted
                                                                                                                                           • Faster SVM works comparably well for low-dimensional
Chroma and timbre centroid information and genre features are          Euclidean metric using a diagonal matrix W                          feature space
combined into one 148-dim. vector per clip
                                                                                                                                           For references and details, please ask or see our paper in the proceedings.

More Related Content

Viewers also liked

Monumento en ruinas
Monumento en ruinasMonumento en ruinas
Monumento en ruinas
Rafael Medina
 
Project f.s copy
Project f.s   copyProject f.s   copy
Project f.s copy
Karishma Shah
 
Reconocimiento 1
Reconocimiento 1Reconocimiento 1
Reconocimiento 1
xeronimo
 
Presentación1
Presentación1Presentación1
Presentación1RaquelGar
 
IT Resonance
IT Resonance IT Resonance
IT Resonance
mkpchandran
 
Sunrise House Garden
Sunrise House GardenSunrise House Garden
Sunrise House Garden
Novos Lançamentos no Rio
 
Propuesta para cuidar el aparato digestivo
Propuesta para cuidar el aparato digestivoPropuesta para cuidar el aparato digestivo
Propuesta para cuidar el aparato digestivo
UO
 
Independence
IndependenceIndependence
Independence
Moinak Sarma Roy
 
PALM - business idea sweet spot
PALM - business idea sweet spotPALM - business idea sweet spot
PALM - business idea sweet spot
David Bozward
 
Merican revolution
Merican revolution Merican revolution
Merican revolution
mdjanes75
 
Mini zine-folding-guide
Mini zine-folding-guideMini zine-folding-guide
Mini zine-folding-guide
Havusto
 
My Heart Beats for Palestine
My Heart Beats for PalestineMy Heart Beats for Palestine
My Heart Beats for Palestine
Nubia **
 
JnNURM Bus Financing - Delhi Experience
JnNURM Bus Financing - Delhi ExperienceJnNURM Bus Financing - Delhi Experience
JnNURM Bus Financing - Delhi Experience
Jaspal Singh
 

Viewers also liked (16)

Utopia / Si has begut o consumit drogues, no agafis la moto
Utopia / Si has begut o consumit drogues, no agafis la motoUtopia / Si has begut o consumit drogues, no agafis la moto
Utopia / Si has begut o consumit drogues, no agafis la moto
 
Monumento en ruinas
Monumento en ruinasMonumento en ruinas
Monumento en ruinas
 
Project f.s copy
Project f.s   copyProject f.s   copy
Project f.s copy
 
Reconocimiento 1
Reconocimiento 1Reconocimiento 1
Reconocimiento 1
 
Presentación1
Presentación1Presentación1
Presentación1
 
IT Resonance
IT Resonance IT Resonance
IT Resonance
 
Sunrise House Garden
Sunrise House GardenSunrise House Garden
Sunrise House Garden
 
Propuesta para cuidar el aparato digestivo
Propuesta para cuidar el aparato digestivoPropuesta para cuidar el aparato digestivo
Propuesta para cuidar el aparato digestivo
 
Independence
IndependenceIndependence
Independence
 
PALM - business idea sweet spot
PALM - business idea sweet spotPALM - business idea sweet spot
PALM - business idea sweet spot
 
Lektiecafe
LektiecafeLektiecafe
Lektiecafe
 
Merican revolution
Merican revolution Merican revolution
Merican revolution
 
Mini zine-folding-guide
Mini zine-folding-guideMini zine-folding-guide
Mini zine-folding-guide
 
My Heart Beats for Palestine
My Heart Beats for PalestineMy Heart Beats for Palestine
My Heart Beats for Palestine
 
JnNURM Bus Financing - Delhi Experience
JnNURM Bus Financing - Delhi ExperienceJnNURM Bus Financing - Delhi Experience
JnNURM Bus Financing - Delhi Experience
 
Mitglieder
MitgliederMitglieder
Mitglieder
 

Similar to ADAPTING METRICS FOR MUSIC SIMILARITY USING COMPARATIVE RATINGS

Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
AIST
 
Ph d colloquium 2
Ph d colloquium 2Ph d colloquium 2
Ph d colloquium 2
Rishi Roy
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix Dataset
Ben Mabey
 
Comparison of image segmentation
Comparison of image segmentationComparison of image segmentation
Comparison of image segmentation
Haitham Ahmed
 
Extending Recommendation Systems With Semantics And Context Awareness
Extending Recommendation Systems With Semantics And Context AwarenessExtending Recommendation Systems With Semantics And Context Awareness
Extending Recommendation Systems With Semantics And Context Awareness
Victor Codina
 
AGE AND GENDER DETECTION.pptx
AGE AND GENDER DETECTION.pptxAGE AND GENDER DETECTION.pptx
AGE AND GENDER DETECTION.pptx
ssuserb4a9ba
 
ageandgenderdetection-220802061020-9ee5a2cd.pptx
ageandgenderdetection-220802061020-9ee5a2cd.pptxageandgenderdetection-220802061020-9ee5a2cd.pptx
ageandgenderdetection-220802061020-9ee5a2cd.pptx
dhaliwalharsh055
 
Keynote at IWLS 2017
Keynote at IWLS 2017Keynote at IWLS 2017
Keynote at IWLS 2017
Manish Pandey
 
論文紹介:Learning With Neighbor Consistency for Noisy Labels
論文紹介:Learning With Neighbor Consistency for Noisy Labels論文紹介:Learning With Neighbor Consistency for Noisy Labels
論文紹介:Learning With Neighbor Consistency for Noisy Labels
Toru Tamaki
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
Stefan Duprey
 
A scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clusteringA scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clustering
AllenWu
 
Blind Source Camera Identification
Blind Source Camera Identification Blind Source Camera Identification
Blind Source Camera Identification
Sudhanshu Patel
 
Responsible AI in Industry: Practical Challenges and Lessons Learned
Responsible AI in Industry: Practical Challenges and Lessons LearnedResponsible AI in Industry: Practical Challenges and Lessons Learned
Responsible AI in Industry: Practical Challenges and Lessons Learned
Krishnaram Kenthapadi
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
Acad
 
Throttling Malware Families in 2D
Throttling Malware Families in 2DThrottling Malware Families in 2D
Throttling Malware Families in 2D
Mohamed Nassar
 
Graph Based Machine Learning with Applications to Media Analytics
Graph Based Machine Learning with Applications to Media AnalyticsGraph Based Machine Learning with Applications to Media Analytics
Graph Based Machine Learning with Applications to Media Analytics
NYC Predictive Analytics
 
Generative Adversarial Networks and Their Applications in Medical Imaging
Generative Adversarial Networks  and Their Applications in Medical ImagingGenerative Adversarial Networks  and Their Applications in Medical Imaging
Generative Adversarial Networks and Their Applications in Medical Imaging
Sanghoon Hong
 
poster_silhouette
poster_silhouetteposter_silhouette
The importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsThe importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systems
Francesca Lazzeri, PhD
 
Mattar_PhD_Thesis
Mattar_PhD_ThesisMattar_PhD_Thesis
Mattar_PhD_Thesis
Marwan Mattar
 

Similar to ADAPTING METRICS FOR MUSIC SIMILARITY USING COMPARATIVE RATINGS (20)

Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
 
Ph d colloquium 2
Ph d colloquium 2Ph d colloquium 2
Ph d colloquium 2
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix Dataset
 
Comparison of image segmentation
Comparison of image segmentationComparison of image segmentation
Comparison of image segmentation
 
Extending Recommendation Systems With Semantics And Context Awareness
Extending Recommendation Systems With Semantics And Context AwarenessExtending Recommendation Systems With Semantics And Context Awareness
Extending Recommendation Systems With Semantics And Context Awareness
 
AGE AND GENDER DETECTION.pptx
AGE AND GENDER DETECTION.pptxAGE AND GENDER DETECTION.pptx
AGE AND GENDER DETECTION.pptx
 
ageandgenderdetection-220802061020-9ee5a2cd.pptx
ageandgenderdetection-220802061020-9ee5a2cd.pptxageandgenderdetection-220802061020-9ee5a2cd.pptx
ageandgenderdetection-220802061020-9ee5a2cd.pptx
 
Keynote at IWLS 2017
Keynote at IWLS 2017Keynote at IWLS 2017
Keynote at IWLS 2017
 
論文紹介:Learning With Neighbor Consistency for Noisy Labels
論文紹介:Learning With Neighbor Consistency for Noisy Labels論文紹介:Learning With Neighbor Consistency for Noisy Labels
論文紹介:Learning With Neighbor Consistency for Noisy Labels
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
 
A scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clusteringA scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clustering
 
Blind Source Camera Identification
Blind Source Camera Identification Blind Source Camera Identification
Blind Source Camera Identification
 
Responsible AI in Industry: Practical Challenges and Lessons Learned
Responsible AI in Industry: Practical Challenges and Lessons LearnedResponsible AI in Industry: Practical Challenges and Lessons Learned
Responsible AI in Industry: Practical Challenges and Lessons Learned
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Throttling Malware Families in 2D
Throttling Malware Families in 2DThrottling Malware Families in 2D
Throttling Malware Families in 2D
 
Graph Based Machine Learning with Applications to Media Analytics
Graph Based Machine Learning with Applications to Media AnalyticsGraph Based Machine Learning with Applications to Media Analytics
Graph Based Machine Learning with Applications to Media Analytics
 
Generative Adversarial Networks and Their Applications in Medical Imaging
Generative Adversarial Networks  and Their Applications in Medical ImagingGenerative Adversarial Networks  and Their Applications in Medical Imaging
Generative Adversarial Networks and Their Applications in Medical Imaging
 
poster_silhouette
poster_silhouetteposter_silhouette
poster_silhouette
 
The importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsThe importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systems
 
Mattar_PhD_Thesis
Mattar_PhD_ThesisMattar_PhD_Thesis
Mattar_PhD_Thesis
 

Recently uploaded

Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
Neo4j
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
Ajin Abraham
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 

Recently uploaded (20)

Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
Artificial Intelligence and Electronic Warfare
Artificial Intelligence and Electronic WarfareArtificial Intelligence and Electronic Warfare
Artificial Intelligence and Electronic Warfare
 

ADAPTING METRICS FOR MUSIC SIMILARITY USING COMPARATIVE RATINGS

  • 1. Daniel Wolff and Tillman Weyde ADAPTING METRICS FOR MUSIC SIMILARITY Music Informatics Research Group Northampton Square, EC1V 0HB London USING COMPARATIVE RATINGS daniel.wolff.1@soi.city.ac.uk t.e.weyde@soi.city.ac.uk Motivation Similarity Data Experiments This poster presents a machine learning approach for analysing • TagATune gamers have to • 5-fold cross validation with test-sets of ~106 binary user data that specifies song similarity. Understanding how agree on the “outlier” clip out of 3 rankings, evaluate fulfilled rankings we relate and compare music has been a topic of great interest in musicology as well as for business applications, such Test Set: as music recommender systems. The way music is compared • Data for 533 clip triplets seems to vary between different cultures. Adapting a generic Avg. 14 votes per triplet MLR: 82% model to user ratings is useful for personalisation and can help 1019 clips included mlrDiag:71% to better understand such differences. SVM: 70% Postprocessing: Eucl.: 67% In our experiments we find that a significant amount of • Consider the triplet histograms as voting (Wij = δij) information can be gained from comparative similarity ratings, Determine winning outlier (B) where possible allowing for an improved similarity estimation on seen and B unseen data. Discard votings featuring no clear winner C A Training Set: • Derive relative clip similarity constraints: MLR: Best, but Audio and Similarity Dataset: (A, B, C), B being the outlier implies bad for <50 MagnaTagATune [E. Law et al. 2009] sim(A, C) > sim(A, B) AND sim(A, C) > sim(B, C) constr. mlrDiag: weak Online Song excerpts from the Magnatune label • Derive binary rankings adaptation • 30 seconds long, can be divided into 4 broad categories: Alternative representation of constraints SVM: good on training data, bad generalisation ”electronica” (30%), ”classical” (28%), ”world” (15%) and Inconsistent constraints are removed (where clips are similar ”rock” (17%) and dissimilar at the same time) • Annotation data (user tags) and similarity ratings from the Feature dimension / PCA feature test human computation game „TagATune“ • Features reduced to 20 –dim using Principal Component Similarity Model and Adaptation Analysis (PCA) Features • Mahalanobis metric for measuring clip similarity: MLR: 77% The clips in our database are described using a combination of dW ( x , y ) = ( x − y ) T W ( x − y ) mlrDiag: 76% content-based and genre features: SVM: 76% Matrix W defines the similarity measure, clip feature Chroma and timbre features precomputed by “TheEchoNest” vectors x and y • Postprocessing: K-means: 4 clusters per clip and feature type, Generalised Euclidean metric 12-dim. chroma features are transposed to root note C allows for geometric interpretation 12-dim. timbre features are clipped psychological validity has been questioned Conclusion Both normalised to a maximum value of 1 • Similarity constraints contain generalisable information, • We compare two different algorithms for optimising dW which can be trained using the tested methods. 2-3 genres per clip are annotated in the Magnatune 1. MLR: [McFee and Lanckriet 2010] optimise a full W to • MLR works well on both feature types tested catalogue binary rankings • mlrDiag tradeoff for regularisation and constraints has to • Each clip is assigned a 44-dim. binary genre vector 1.1. mlrDiag: MLR variant restrained to a diagonal matrix W be investigated 2. SVM: [Schultz and Joachims 2003] optimise a weighted • Faster SVM works comparably well for low-dimensional Chroma and timbre centroid information and genre features are Euclidean metric using a diagonal matrix W feature space combined into one 148-dim. vector per clip For references and details, please ask or see our paper in the proceedings.