SlideShare a Scribd company logo
Understanding music with machine learning
From signal processing to representation learning
Jimena Royo-Letelier
Research Scientist
R&D team
WiMLDS Paris
November 23th, 2017
2
Deezer
● 180+ countries
● 40+ millions of track
● 16 millions active users
❖ 50+ curators
❖ Flow: continuous
recommender system
Data playground everywhere, many teams working on ML
> only 22% women in them
3
Automatic catalog structuration
R&D mission
analyze, describe and categorize Deezer catalog
4
An ever evolving, large catalog
● 40+ millions of track
● ~16 millions of artists
● 20k tracks ingested every day
● Audio Content
Music
roughly any commercial music ever recorded
any genre, period, quality, context
Speech
audiobooks / podcasts / radio shows / sport talks
● Usage
playlist
listening sessions
● Textual data
lyrics
biographies
reviews
● Images
album covers
artist photos
of heterogeneous multimodal data
Deezer
5
described primarily by metadata
but with inherent to music indexation problems
● cultural ambiguities
● missing / wrong / ambiguous metadata
with hidden musical relations
● covers / versions / samples / remixes
An ever evolving, large catalog
Deezer
6
R&D mission
analyze, describe and categorize Deezer catalog
(i.e.: a great part of music ever produced)
> automatic extraction of information & relations
> signal processing and machine learning (ML)
natural languages processing
image processing
audio signal processing
music information retrieval (MIR)
representation learning
Automatic catalog structuration
7
Music Information Retrieval
8
Music Information Retrieval
Main Idea
> Study of algorithms and system for extracting musical meaningful parameters from audio signals,
e.g.: beats, chords, structure, mood, etc.
Fourier analysis features engineering machine learning
audio
descriptors
(e.g.: chromagram)
raw audio
decision
system
(e.g.: SVM)
time-frequency
representation
(e.g.: spectrogram)
Principle
9
Music Information Retrieval
Main Idea
> Study of algorithms and system for extracting auditory meaningful parameters from audio signals
Nowadays end-to-end trend
audio
descriptors
(e.g.: chromagram)
raw audio
time-frequency
representation
machine (deep) learning
decision
system
10
Applications @ Deezer
multimodal (audio+lyrics) mood estimation
audio quality discrimination
Towards End-to-end Multimodal Music Mood Detection Based On Audio And Lyrics
R. Delbouys, et al. - Submitted
Music Information Retrieval
11
Representational Learning
12
Representation Learning
Main Idea
> Create a representation space (embedding) by directly learning a parametric map
from input to representation
> Learn a low dimensional space that represents high-level characteristics of audio
content in which proximity may be interpreted as audio similarity.
Principle
map f
(e.g.: convolutional/recurrent neural network)
x = low level feature embedding f(x)
R^d, d small
13
Application @ Deezer
> Artist * Disambiguation
Representation Learning
○ homonym artists (collision)
(same name, different artist)
○ artists unclear naming
(different name, same artist)
Bills Evans, jazz pianist
Bills Evans, jazz saxophonist
Inti-Illimani vs Inti Illimani
Youssou N’Dour vs Youssou Ndour
Cat Stevens vs Yusuf Islam
* not a completely well-posed notion
Metric learning for music artist disambiguation from audio
J. Royo-Letelier, et al. - Submitted
14
Triplet loss mechanism
L(X) =
with triplet X = (x*, x+, x-).
x* x*x+ x-
same artist
different artist from first two
Sample triplets X and leave out
not “hard” triplets: L(X) = 0
hard not-hard
Metric Learning
x = spectrogram f = convolutional neural network f(x)
15
“Do or Die” homonyme artists
● Embedding system learn how to discriminate artists.
● Artists characteristics learned by the system
generalize to data (tracks/albums/artist)
not seen during learning phase.
Qualitative results
Album level generalisation
Artist level generalisation
16
Take Away
❖ Large multimodal music related data (audio/text /images/usage)
❖ Great challenges in automatically understanding music
➢ Big scalability constraints
➢ Big variability constraints
➢ Hidden relations
➢ Possible ambiguities
❖ ML and MIR useful for many applications concerning music indexation and characterisation
❖ ML system are able to learn high-level characteristics of musical audio content
such artist membership
❖ Too bad no more women tackling these challenges !
> https://wimir.wordpress.com/mentoring-program/
17
Thanks !

More Related Content

What's hot

Anya- Ancillary task draft
Anya- Ancillary task draftAnya- Ancillary task draft
Anya- Ancillary task draft
Anya Szelewska
 
Art of Music Sampling
Art of Music SamplingArt of Music Sampling
Art of Music Sampling
Andrew Chando Yu
 
Jamzion prj mix solo video tapes [IDEA]
Jamzion prj mix solo video tapes [IDEA]Jamzion prj mix solo video tapes [IDEA]
Jamzion prj mix solo video tapes [IDEA]
David Buitrago Arenas
 
Celebrate Music - Yendif music share
Celebrate Music - Yendif music shareCelebrate Music - Yendif music share
Celebrate Music - Yendif music share
celyn thomas
 
Careers in disc jockeying
Careers in   disc jockeyingCareers in   disc jockeying
Careers in disc jockeying
entranzz123
 
Careers in disc jockeying
Careers in   disc jockeyingCareers in   disc jockeying
Careers in disc jockeying
entranzz123
 

What's hot (9)

Soundtracks
SoundtracksSoundtracks
Soundtracks
 
Anya- Ancillary task draft
Anya- Ancillary task draftAnya- Ancillary task draft
Anya- Ancillary task draft
 
Art of Music Sampling
Art of Music SamplingArt of Music Sampling
Art of Music Sampling
 
Jamzion prj mix solo video tapes [IDEA]
Jamzion prj mix solo video tapes [IDEA]Jamzion prj mix solo video tapes [IDEA]
Jamzion prj mix solo video tapes [IDEA]
 
Celebrate Music - Yendif music share
Celebrate Music - Yendif music shareCelebrate Music - Yendif music share
Celebrate Music - Yendif music share
 
CV Alexey Shmariovich
CV Alexey ShmariovichCV Alexey Shmariovich
CV Alexey Shmariovich
 
Careers in disc jockeying
Careers in   disc jockeyingCareers in   disc jockeying
Careers in disc jockeying
 
Careers in disc jockeying
Careers in   disc jockeyingCareers in   disc jockeying
Careers in disc jockeying
 
SWOT
SWOTSWOT
SWOT
 

Similar to Understang Music with Machine Learning by Jimena Royo-Letelier

Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information Retrieval
Andrea Gazzarini
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information Retrieval
Sease
 
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
multimediaeval
 
Finbar m usic industry vision 2014
Finbar m usic industry vision 2014Finbar m usic industry vision 2014
Finbar m usic industry vision 2014Finbar O'Hanlon
 
Challenges and Opportunities in Digital Musicology
Challenges and Opportunities in Digital Musicology Challenges and Opportunities in Digital Musicology
Challenges and Opportunities in Digital Musicology
Eleanor Selfridge-Field
 
MLA Workshop: Cataloging Music Audiovisual Materials Using RDA
MLA Workshop: Cataloging Music Audiovisual Materials Using RDAMLA Workshop: Cataloging Music Audiovisual Materials Using RDA
MLA Workshop: Cataloging Music Audiovisual Materials Using RDA
ALATechSource
 
Machine Learning for Creative AI Applications in Music (2018 May)
Machine Learning for Creative AI Applications in Music (2018 May)Machine Learning for Creative AI Applications in Music (2018 May)
Machine Learning for Creative AI Applications in Music (2018 May)
Yi-Hsuan Yang
 
Denktank 2010
Denktank 2010Denktank 2010
Denktank 2010
ocor203
 
O.Ogunyemi MANUSCRIPT2016
O.Ogunyemi MANUSCRIPT2016O.Ogunyemi MANUSCRIPT2016
O.Ogunyemi MANUSCRIPT2016Titus Ogunyemi
 
Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015
Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015
Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015
Philip Desenne
 
20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir
Yi-Hsuan Yang
 
Musical Information Retrieval Take 2: Interval Hashing Based Ranking
Musical Information Retrieval Take 2: Interval Hashing Based RankingMusical Information Retrieval Take 2: Interval Hashing Based Ranking
Musical Information Retrieval Take 2: Interval Hashing Based Ranking
Sease
 
Interval Hashing Based Ranking
Interval Hashing Based RankingInterval Hashing Based Ranking
Interval Hashing Based Ranking
Andrea Gazzarini
 
Deep learning for audio-based music recommendation
Deep learning for audio-based music recommendationDeep learning for audio-based music recommendation
Deep learning for audio-based music recommendation
Russia.AI
 
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
MusicNet
 
Towards a musical Semantic Web
Towards a musical Semantic WebTowards a musical Semantic Web
Towards a musical Semantic Web
Yves Raimond
 
Thunderbolt paper
Thunderbolt paperThunderbolt paper
Thunderbolt paper
Manoj Manduva
 
Music Generation with Deep Learning
Music Generation with Deep LearningMusic Generation with Deep Learning
Music Generation with Deep Learning
Levi Guo
 

Similar to Understang Music with Machine Learning by Jimena Royo-Letelier (19)

Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information Retrieval
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information Retrieval
 
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
 
Finbar m usic industry vision 2014
Finbar m usic industry vision 2014Finbar m usic industry vision 2014
Finbar m usic industry vision 2014
 
Challenges and Opportunities in Digital Musicology
Challenges and Opportunities in Digital Musicology Challenges and Opportunities in Digital Musicology
Challenges and Opportunities in Digital Musicology
 
MLA Workshop: Cataloging Music Audiovisual Materials Using RDA
MLA Workshop: Cataloging Music Audiovisual Materials Using RDAMLA Workshop: Cataloging Music Audiovisual Materials Using RDA
MLA Workshop: Cataloging Music Audiovisual Materials Using RDA
 
Machine Learning for Creative AI Applications in Music (2018 May)
Machine Learning for Creative AI Applications in Music (2018 May)Machine Learning for Creative AI Applications in Music (2018 May)
Machine Learning for Creative AI Applications in Music (2018 May)
 
Denktank 2010
Denktank 2010Denktank 2010
Denktank 2010
 
O.Ogunyemi MANUSCRIPT2016
O.Ogunyemi MANUSCRIPT2016O.Ogunyemi MANUSCRIPT2016
O.Ogunyemi MANUSCRIPT2016
 
Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015
Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015
Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015
 
Mnri web page
Mnri web pageMnri web page
Mnri web page
 
20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir
 
Musical Information Retrieval Take 2: Interval Hashing Based Ranking
Musical Information Retrieval Take 2: Interval Hashing Based RankingMusical Information Retrieval Take 2: Interval Hashing Based Ranking
Musical Information Retrieval Take 2: Interval Hashing Based Ranking
 
Interval Hashing Based Ranking
Interval Hashing Based RankingInterval Hashing Based Ranking
Interval Hashing Based Ranking
 
Deep learning for audio-based music recommendation
Deep learning for audio-based music recommendationDeep learning for audio-based music recommendation
Deep learning for audio-based music recommendation
 
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
 
Towards a musical Semantic Web
Towards a musical Semantic WebTowards a musical Semantic Web
Towards a musical Semantic Web
 
Thunderbolt paper
Thunderbolt paperThunderbolt paper
Thunderbolt paper
 
Music Generation with Deep Learning
Music Generation with Deep LearningMusic Generation with Deep Learning
Music Generation with Deep Learning
 

More from Paris Women in Machine Learning and Data Science

Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
Paris Women in Machine Learning and Data Science
 
How and why AI should fight cybersexism, by Chloe Daudier
How and why AI should fight cybersexism, by Chloe DaudierHow and why AI should fight cybersexism, by Chloe Daudier
How and why AI should fight cybersexism, by Chloe Daudier
Paris Women in Machine Learning and Data Science
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
Paris Women in Machine Learning and Data Science
 
Managing international tech teams, by Natasha Dimban
Managing international tech teams, by Natasha DimbanManaging international tech teams, by Natasha Dimban
Managing international tech teams, by Natasha Dimban
Paris Women in Machine Learning and Data Science
 
Optimizing GenAI apps, by N. El Mawass and Maria Knorps
Optimizing GenAI apps, by N. El Mawass and Maria KnorpsOptimizing GenAI apps, by N. El Mawass and Maria Knorps
Optimizing GenAI apps, by N. El Mawass and Maria Knorps
Paris Women in Machine Learning and Data Science
 
Perspectives, by M. Pannegeon
Perspectives, by M. PannegeonPerspectives, by M. Pannegeon
Evaluation strategies for dealing with partially labelled or unlabelled data
Evaluation strategies for dealing with partially labelled or unlabelled dataEvaluation strategies for dealing with partially labelled or unlabelled data
Evaluation strategies for dealing with partially labelled or unlabelled data
Paris Women in Machine Learning and Data Science
 
Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...
Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...
Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...
Paris Women in Machine Learning and Data Science
 
An age-old question, by Caroline Jean-Pierre
An age-old question, by Caroline Jean-PierreAn age-old question, by Caroline Jean-Pierre
An age-old question, by Caroline Jean-Pierre
Paris Women in Machine Learning and Data Science
 
Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré
Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle LautréApplying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré
Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré
Paris Women in Machine Learning and Data Science
 
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure SoulierHow to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
Paris Women in Machine Learning and Data Science
 
Global Ambitions Local Realities, by Anna Abreu
Global Ambitions Local Realities, by Anna AbreuGlobal Ambitions Local Realities, by Anna Abreu
Global Ambitions Local Realities, by Anna Abreu
Paris Women in Machine Learning and Data Science
 
Plug-and-Play methods for inverse problems in imagine, by Julie Delon
Plug-and-Play methods for inverse problems in imagine, by Julie DelonPlug-and-Play methods for inverse problems in imagine, by Julie Delon
Plug-and-Play methods for inverse problems in imagine, by Julie Delon
Paris Women in Machine Learning and Data Science
 
Sales Forecasting as a Data Product by Francesca Iannuzzi
Sales Forecasting as a Data Product by Francesca IannuzziSales Forecasting as a Data Product by Francesca Iannuzzi
Sales Forecasting as a Data Product by Francesca Iannuzzi
Paris Women in Machine Learning and Data Science
 
Identifying and mitigating bias in machine learning, by Ruta Binkyte
Identifying and mitigating bias in machine learning, by Ruta BinkyteIdentifying and mitigating bias in machine learning, by Ruta Binkyte
Identifying and mitigating bias in machine learning, by Ruta Binkyte
Paris Women in Machine Learning and Data Science
 
“Turning your ML algorithms into full web apps in no time with Python" by Mar...
“Turning your ML algorithms into full web apps in no time with Python" by Mar...“Turning your ML algorithms into full web apps in no time with Python" by Mar...
“Turning your ML algorithms into full web apps in no time with Python" by Mar...
Paris Women in Machine Learning and Data Science
 
Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...
Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...
Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...
Paris Women in Machine Learning and Data Science
 
Sandrine Henry presents the BechdelAI project
Sandrine Henry presents the BechdelAI projectSandrine Henry presents the BechdelAI project
Sandrine Henry presents the BechdelAI project
Paris Women in Machine Learning and Data Science
 
Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...
Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...
Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...
Paris Women in Machine Learning and Data Science
 
Khrystyna Grynko WiMLDS - From marketing to Tech.pdf
Khrystyna Grynko WiMLDS - From marketing to Tech.pdfKhrystyna Grynko WiMLDS - From marketing to Tech.pdf
Khrystyna Grynko WiMLDS - From marketing to Tech.pdf
Paris Women in Machine Learning and Data Science
 

More from Paris Women in Machine Learning and Data Science (20)

Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
How and why AI should fight cybersexism, by Chloe Daudier
How and why AI should fight cybersexism, by Chloe DaudierHow and why AI should fight cybersexism, by Chloe Daudier
How and why AI should fight cybersexism, by Chloe Daudier
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Managing international tech teams, by Natasha Dimban
Managing international tech teams, by Natasha DimbanManaging international tech teams, by Natasha Dimban
Managing international tech teams, by Natasha Dimban
 
Optimizing GenAI apps, by N. El Mawass and Maria Knorps
Optimizing GenAI apps, by N. El Mawass and Maria KnorpsOptimizing GenAI apps, by N. El Mawass and Maria Knorps
Optimizing GenAI apps, by N. El Mawass and Maria Knorps
 
Perspectives, by M. Pannegeon
Perspectives, by M. PannegeonPerspectives, by M. Pannegeon
Perspectives, by M. Pannegeon
 
Evaluation strategies for dealing with partially labelled or unlabelled data
Evaluation strategies for dealing with partially labelled or unlabelled dataEvaluation strategies for dealing with partially labelled or unlabelled data
Evaluation strategies for dealing with partially labelled or unlabelled data
 
Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...
Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...
Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...
 
An age-old question, by Caroline Jean-Pierre
An age-old question, by Caroline Jean-PierreAn age-old question, by Caroline Jean-Pierre
An age-old question, by Caroline Jean-Pierre
 
Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré
Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle LautréApplying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré
Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré
 
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure SoulierHow to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
 
Global Ambitions Local Realities, by Anna Abreu
Global Ambitions Local Realities, by Anna AbreuGlobal Ambitions Local Realities, by Anna Abreu
Global Ambitions Local Realities, by Anna Abreu
 
Plug-and-Play methods for inverse problems in imagine, by Julie Delon
Plug-and-Play methods for inverse problems in imagine, by Julie DelonPlug-and-Play methods for inverse problems in imagine, by Julie Delon
Plug-and-Play methods for inverse problems in imagine, by Julie Delon
 
Sales Forecasting as a Data Product by Francesca Iannuzzi
Sales Forecasting as a Data Product by Francesca IannuzziSales Forecasting as a Data Product by Francesca Iannuzzi
Sales Forecasting as a Data Product by Francesca Iannuzzi
 
Identifying and mitigating bias in machine learning, by Ruta Binkyte
Identifying and mitigating bias in machine learning, by Ruta BinkyteIdentifying and mitigating bias in machine learning, by Ruta Binkyte
Identifying and mitigating bias in machine learning, by Ruta Binkyte
 
“Turning your ML algorithms into full web apps in no time with Python" by Mar...
“Turning your ML algorithms into full web apps in no time with Python" by Mar...“Turning your ML algorithms into full web apps in no time with Python" by Mar...
“Turning your ML algorithms into full web apps in no time with Python" by Mar...
 
Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...
Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...
Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...
 
Sandrine Henry presents the BechdelAI project
Sandrine Henry presents the BechdelAI projectSandrine Henry presents the BechdelAI project
Sandrine Henry presents the BechdelAI project
 
Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...
Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...
Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...
 
Khrystyna Grynko WiMLDS - From marketing to Tech.pdf
Khrystyna Grynko WiMLDS - From marketing to Tech.pdfKhrystyna Grynko WiMLDS - From marketing to Tech.pdf
Khrystyna Grynko WiMLDS - From marketing to Tech.pdf
 

Recently uploaded

Tutorial for 16S rRNA Gene Analysis with QIIME2.pdf
Tutorial for 16S rRNA Gene Analysis with QIIME2.pdfTutorial for 16S rRNA Gene Analysis with QIIME2.pdf
Tutorial for 16S rRNA Gene Analysis with QIIME2.pdf
aqil azizi
 
Forklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella PartsForklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella Parts
Intella Parts
 
Unbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptxUnbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptx
ChristineTorrepenida1
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
karthi keyan
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
ydteq
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
WENKENLI1
 
Basic Industrial Engineering terms for apparel
Basic Industrial Engineering terms for apparelBasic Industrial Engineering terms for apparel
Basic Industrial Engineering terms for apparel
top1002
 
14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application
SyedAbiiAzazi1
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Teleport Manpower Consultant
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
ongomchris
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTSHeap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Soumen Santra
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
BrazilAccount1
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
FluxPrime1
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 

Recently uploaded (20)

Tutorial for 16S rRNA Gene Analysis with QIIME2.pdf
Tutorial for 16S rRNA Gene Analysis with QIIME2.pdfTutorial for 16S rRNA Gene Analysis with QIIME2.pdf
Tutorial for 16S rRNA Gene Analysis with QIIME2.pdf
 
Forklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella PartsForklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella Parts
 
Unbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptxUnbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptx
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
 
Basic Industrial Engineering terms for apparel
Basic Industrial Engineering terms for apparelBasic Industrial Engineering terms for apparel
Basic Industrial Engineering terms for apparel
 
14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTSHeap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 

Understang Music with Machine Learning by Jimena Royo-Letelier

  • 1. Understanding music with machine learning From signal processing to representation learning Jimena Royo-Letelier Research Scientist R&D team WiMLDS Paris November 23th, 2017
  • 2. 2 Deezer ● 180+ countries ● 40+ millions of track ● 16 millions active users ❖ 50+ curators ❖ Flow: continuous recommender system Data playground everywhere, many teams working on ML > only 22% women in them
  • 3. 3 Automatic catalog structuration R&D mission analyze, describe and categorize Deezer catalog
  • 4. 4 An ever evolving, large catalog ● 40+ millions of track ● ~16 millions of artists ● 20k tracks ingested every day ● Audio Content Music roughly any commercial music ever recorded any genre, period, quality, context Speech audiobooks / podcasts / radio shows / sport talks ● Usage playlist listening sessions ● Textual data lyrics biographies reviews ● Images album covers artist photos of heterogeneous multimodal data Deezer
  • 5. 5 described primarily by metadata but with inherent to music indexation problems ● cultural ambiguities ● missing / wrong / ambiguous metadata with hidden musical relations ● covers / versions / samples / remixes An ever evolving, large catalog Deezer
  • 6. 6 R&D mission analyze, describe and categorize Deezer catalog (i.e.: a great part of music ever produced) > automatic extraction of information & relations > signal processing and machine learning (ML) natural languages processing image processing audio signal processing music information retrieval (MIR) representation learning Automatic catalog structuration
  • 8. 8 Music Information Retrieval Main Idea > Study of algorithms and system for extracting musical meaningful parameters from audio signals, e.g.: beats, chords, structure, mood, etc. Fourier analysis features engineering machine learning audio descriptors (e.g.: chromagram) raw audio decision system (e.g.: SVM) time-frequency representation (e.g.: spectrogram) Principle
  • 9. 9 Music Information Retrieval Main Idea > Study of algorithms and system for extracting auditory meaningful parameters from audio signals Nowadays end-to-end trend audio descriptors (e.g.: chromagram) raw audio time-frequency representation machine (deep) learning decision system
  • 10. 10 Applications @ Deezer multimodal (audio+lyrics) mood estimation audio quality discrimination Towards End-to-end Multimodal Music Mood Detection Based On Audio And Lyrics R. Delbouys, et al. - Submitted Music Information Retrieval
  • 12. 12 Representation Learning Main Idea > Create a representation space (embedding) by directly learning a parametric map from input to representation > Learn a low dimensional space that represents high-level characteristics of audio content in which proximity may be interpreted as audio similarity. Principle map f (e.g.: convolutional/recurrent neural network) x = low level feature embedding f(x) R^d, d small
  • 13. 13 Application @ Deezer > Artist * Disambiguation Representation Learning ○ homonym artists (collision) (same name, different artist) ○ artists unclear naming (different name, same artist) Bills Evans, jazz pianist Bills Evans, jazz saxophonist Inti-Illimani vs Inti Illimani Youssou N’Dour vs Youssou Ndour Cat Stevens vs Yusuf Islam * not a completely well-posed notion Metric learning for music artist disambiguation from audio J. Royo-Letelier, et al. - Submitted
  • 14. 14 Triplet loss mechanism L(X) = with triplet X = (x*, x+, x-). x* x*x+ x- same artist different artist from first two Sample triplets X and leave out not “hard” triplets: L(X) = 0 hard not-hard Metric Learning x = spectrogram f = convolutional neural network f(x)
  • 15. 15 “Do or Die” homonyme artists ● Embedding system learn how to discriminate artists. ● Artists characteristics learned by the system generalize to data (tracks/albums/artist) not seen during learning phase. Qualitative results Album level generalisation Artist level generalisation
  • 16. 16 Take Away ❖ Large multimodal music related data (audio/text /images/usage) ❖ Great challenges in automatically understanding music ➢ Big scalability constraints ➢ Big variability constraints ➢ Hidden relations ➢ Possible ambiguities ❖ ML and MIR useful for many applications concerning music indexation and characterisation ❖ ML system are able to learn high-level characteristics of musical audio content such artist membership ❖ Too bad no more women tackling these challenges ! > https://wimir.wordpress.com/mentoring-program/