SlideShare a Scribd company logo
CSTalks

   Similarity Measures in Music Information
               Retrieval Systems




             Speaker:    Zhonghua Li
             Supervisor:    Ye Wang
           lizhongh@comp.nus.edu.sg
                                              1
Where do you search for music?
 Music is an important part of our way of life.
 Music stores
 Online music




                                  Mufin




                                                   2
3
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      4
Definition
 Music Information Retrieval
   Is the process of searching for, and finding, music objects, or
    part of objects, via a query framed musically and/or in musical
    terms.
     Music objects: Recordings (wav, mp3, etc.), scores, parts, etc.
     Musically framed query: Singing, humming, keyboad, notation-based,
      MIDI files, sound files, etc.
     Music terms: Genre, style, tempo, bibliography, etc.
   Applications
     Music search, recommendation, identification ,etc.




                                                                           5
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Music Similarity Measure
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      6
Applications -- Search
 Text-based Music Search
   Compare a textual query with the metadata
   Is adopted by most existing systems.
   Examples: Last.fm, Musicovery, …




                                                7
Applications -- Search
 Content-based Music Search
   Compare audio query with audio content
   Query-by-humming/singing/recording: midomi




                                                 8
Applications -- Search
 Content-based Music Search
   Compare rhythm tapped with audio content
   Query-by-tapping: SongTapper




                                               9
Applications -- Search
 User’s information need                                            Online     Offline


  (intention) :                                          Query


  Explicit Query: text, audio, etc.    Intention Gap
                                                                   Query
                                                                 Formation


 Similarity measure:                                  Descriptors                         Documents


  Query  Music documents in the                                 Match
                                                                                                   Descriptor
                                                                                                   Extraction
   Database                                                                     Indexing
                                       Semantic Gap      Index                             Descriptors
 Ranking: relevant documents by
  domain specific criterions (no. of                             Ranking


  hits ).                                              Ranked List


                                                                 Presentation


                                                         Results



                                                                                                         10
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      11
Applications – Recommendation
 Collaborative-Filtering-based Recommendation
   Last.fm: what you (and others ) listen to and like,
   Amazon: customers who shopped for … also shopped for …




                                                             12
Applications – Recommendation
 Collaborative-Filtering-based Recommendation
   Last.fm: what you (and others ) listen to and like,
   Amazon: customers who shopped for … also shopped for …
   Example:
     Users: A, B, and C
     Music: 1, 2, …, 8

                          Small similarity               Large similarity
                  C                              A                      B

                                                          Similarity
                  4
                             Similarity      1            Measure
                                                                       1
                             Measure
                                                 2                          2
                      6                              3                          3
              8                                  4                     5

                                                         Recommend to user A
                                                                                    13
Applications -- Recommendation
 Audio Content-based Recommendation
  Recommend songs which have similar audio content to the
   songs that you like.
  Pandora:
                                       Music database              Music Experts
               User
                                                          Listen



                                          Instrument:
       Instrument:        Similarity        Instrument:
                                          Vocal:
       Vocal:                               Vocal:
                                          Structure:
                          Measure
       Structure:                         …Structure:
       …                                    …
                                       400 Attributes/song

                     Recommendations
                                                                                   14
Applications -- Recommendation
 User’s information need                              Online / Offline        Offline


  (intention):                                       User Profile


  Implicit user profiles: ratings,   Intention Gap
                                                                  Profile
                                                                 Capture
    listening history, etc.
                                                     Descriptors                          Documents
 Similarity measure:
                                                                                                  Descriptor
  User profiles Music                                          Match                             Extraction

   documents/other user profiles     Semantic Gap       Index
                                                                               Indexing
                                                                                          Descriptors


 Ranking: relevant documents                                   Ranking

  by some domain specific
                                                     Ranked List
  criterions (no. of hits).
                                                                Presentation


                                                       Results


                                                                                                        15
Similarity Measure
 one of the most fundamental concepts in MIR
                                                   Online / Offline        Offline
 Closely related to
                                                 User Profile
                                                    /query
   What information music                                    Profile/query
                                 Intention Gap
                                                             Capture
    contains.
   How this information is
                                                 Descriptors                          Documents



   represented.                                             Match
                                                                                              Descriptor
                                                                                              Extraction


   How to match between themSemantic Gap
                                                                           Indexing
                                                    Index                             Descriptors


                                                            Ranking


                                                 Ranked List


                                                            Presentation


                                                   Results

                                                                                                    16
Music Information Plane
                              Similarity can be measure
                              from different aspects.

                                         Song1: New favorite -
                                         Alison Krauss
                                         Song2: She is Beautiful -
                                         Andrew W.K.
                                                Song1          Song2
                                                Female         Male
                          Dissimilar            Gentle         Aggressive
                                                Slow           fast

                                                   Guitar
                          Similar                  Tempo: ~162 BPM
                                                   (Beat Per Minute)

                  * O. C. Herrada. Music recommendation and discovery in
       Music      the long tail. PhD thsis. 2008.                           17
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      18
Similarity Measure Methods
 Text-based Method: Okapi BM-25 Ranking
   Given: queryQ, containing keywords q1, …, qn, music documents: bag of words.
   BM25 ranking function can be formulated as:




   f(qi, D) is qi’s term frequency (tf) in document D.
   |D| is the length of document D in words.
   avgdl is the average document length in the collection.
   k1 and b are free parameters, usually set as k1=2.0 and b=0.75.
   IDF(qi) is the inverse document frequency (idf), calculated as:



. The query term appears in this document frequently. f (qi, D)
. And it doesn’t appear in other document. IDF
                                                                                   19
Similarity Measure Methods
 Text-based Method:
   Pros
     Simple & efficient
   Cons
     Affected by noisy/wrong texts
     Songs with no text cannot be retrieved
     Require high-level domain knowledge to create good metadata
     “Text retrieval on audio metadata” not pure music retrieval




                                                                    20
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      21
Similarity Measure Methods
 Audio Feature-based Method




           Audio feature   Distribution     Model
  Music
            extraction      modeling      comparison




                                                       22
Similarity Measure Methods
 Audio Feature-based Method
   Audio Feature extraction -> Distribution modeling -> Model Comparison




                                                                    frame



            Feature
            Vector             …


                                                                            23
Existing Works
 Audio Feature-based Method
   Audio Feature extraction  Distribution modeling  Model Comparison

     Use low-level feature directly
          Pitch, loudness, MFCC (Blum et al.[3], 1999)
          Histogram of MFCC (Foote[4], 1997)
          Spectrum, rhythm, chord changesingleVector (Tzanetakis [5], 2002)

     Low-level features  higher-level features.
        Cluster MFCC=>model comparison (Aucouturier[6], 2002)
        MFCC => Gaussian Mixture Models => model comparison
        MFCC =>“anchor space”, compare probability models (Berenzweig et al.
         [7], 2003)


                                                                                24
Similarity Measure Methods
 Audio Feature-based Method
  Audio Feature extraction  Distribution modeling  Model Comparison

    Euclidean /Cosine distance (uniform-length feature vectors)


    Distance between two probability distributions
         Kullback-Leibler divergence (KL Distance) / relative entropy

    No closed form for Gaussian Distributions
       Centroid distance: Euclidean distance between the overall means;
       Sampling based method: compute the likelihood of one model given points
        sampled from another; very computationally expensive;
       Earth-Mover’s distance


              Berenzweig, A., Logan, B., Ellis, D. P., and Whitman, B. P., A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity
              Measures. Computer Music Journal. 28, 2004,                                                                                         25
Similarity Measure Methods
 Audio Feature-based Method
   Pros
     Can deal with new songs with no or few texts.
     Save human labors from annotating each song manually
   Cons
     Time complexity is relatively high.
     Features ≠ audio piece: Two songs with very similar features may sounds
      very different.
     The average performance is reaching the glass ceiling of around 65% in
      accuracy.




                                                                                26
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      27
Similarity Measure Methods
 Semantic Concept-based Method
   Nature of user queries
     Far beyond of bibliographic text and audio search
     Semantically-rich
     Syntactically- undetermined
     e.g.: “Find me a classical and happy song”, or “Find me a song to relax”
    “Find me some songs for parties/ weddings/ in churches” …
   Collaborative(social) tagging is very popular on Web 2.0.
    Users annotate their feelings or opinions to the music. Tags,
    comments, etc.



                                                                                28
Similarity Measure Methods
 Semantic Concept-based Method
     Tags VS user queries (Last.fm)
Tag Type            Frequency                          Multi-tag search queries
Genre               68%                                51%
Locale              12%                                7%
Mood                5%                                 4%
Opinion             4%                                 2%
Instrumentation     4%                                 5%
Style               3%                                 26%



                    . Paul Lamere. Social tagging and music information
                    retrieval. Journal of New Music Research. 2008.
                    . Klaas Bosteels, Elias Pampalk, and Etienne E. Kerre. Music
                    retrieval based on social tags: a case study. ISMIR, 2008.     29
Similarity Measure Methods
    Vocabulary: classical, jazz, … piano, violin, …, female, male, …


                                         …

              Model         Model          …               Model


                                             …                         Probability vector
Song1                                                                     Similarity
                                             …
Song2



                                                                                            30
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      31
Similarity Measure Methods
 Multimodal Method
  Information keeps growing.
  One of the most important ongoing trends:

             Metadata


        Audio       Semantic
       Content      Concept


    Users are important.



                                               32
Similarity Measure Methods
 Multimodal Method
                                                                             Document Vectors



 Customization




                                                                  Fuzzy Music Semantic
                                                                  Vector
                 B. Zhang, J. Shen, Q. Xiang, and Y. Wang. CompositMap: a novel framework for music similarity measure. ACM Multimedia,
                 2009.                                                                                                                    33
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      34
Conclusion and Future Directions
 What makes MIR (and the similarity measure) so
 tricky?
  Music information is
   Multimodal: audio, metadata, social , …
   Multicultural: e.g., modern art, Indian ragas, …
   Multirepresentational: audio, MIDI, score, …
   Multifaceted: melody, tempo, beat, …
  …
 Similarity can be measured from different aspects.



                                                       35
Conclusions and Future Directions
 What do users really want?
                                                   Intention Gap
   User interactions with the system.
   Learn a good user preference modeling


 What kind of music features can really capture this need?
   Content –Tags                                  Semantic Gap
   Leverage more social data? Comments, ratings, groups, playlist, other
    user created information, …


 How to fuse multiple information effectively?
   Identify the relevant/discriminative information aspects
   Fusion Methods


                                                                            36
37
References
 [2] F[1] O. C. Herrada. Music recommendation and discovery in the long
    tail. PhD thsis. 2008.
   . Pachet. Knowledge management and musical metadata. Encyclopedia of
    Knowledge Management. Idea Group, 2005.
   [3] T. L. Blum, D. F. Keislar, J. A. Wheaton, and E. H. Wold. Method and article
    of manufacture for content-based analysis, storage, retrieval, and
    segmentation of audio information. U.S. Patent 5, 918, 223.
   [4] J. T. Foote. Content-based retrieval of music and audio. SPIE, 1997.
   [5] G. Tzanetakis. Manipulation, analysis, and retrieval system for audio
    signals. PhD thsis, 2002.
   [6] J. J. Aucouturier and F. Pachet. Music similarity measure: What’s the use?
    International Symposium on Music information retrieval. 2002.
   [7] A. Berenzweig, D. P. W. Ellis and S. Lawrence. Anchor space for
    classification and similarity measurement for music. ICME 2003.




                                                                                       38
References
 [8] B. Zhang, J. Shen, Q. Xiang and Y. Wang. CompositeMap: a
    novel framework for music similarity measure. SIGIR, 2009.
   [9] B. Whiteman and S. Lawrence. Inferring descriptions and
    similarity for music from community metadata. International
    computer music conference. 2002.
   [10] M. Schedl, T. Pohle, P. Knees and G. Widmer. Assigning
    and visualizing music genre by web-based co-occurrence
    analysis. ISMIR 2006.
   [11] B. Whitman and Paris Smaragdis. Combining musical and
    cultural features for intelligent style detection. ISMIR 2002.
   [12] L. Chen, P. Wright, and W. Nejdl. Improving music genre
    classification using collaborative tagging data. WSDM, 2009.
   [13] Benedikt Raes. Automatic generation of music metadata.
    ISMIR, 2009.


                                                                     39

More Related Content

Similar to CSTalks - Music Information Retrieval - 23 Feb

Mehfil : Song Recommendation System Using Sentiment Detected
Mehfil : Song Recommendation System Using Sentiment DetectedMehfil : Song Recommendation System Using Sentiment Detected
Mehfil : Song Recommendation System Using Sentiment Detected
IRJET Journal
 
Music recognition
Music recognition Music recognition
Music recognition
aaronloklok
 
Knn a machine learning approach to recognize a musical instrument
Knn  a machine learning approach to recognize a musical instrumentKnn  a machine learning approach to recognize a musical instrument
Knn a machine learning approach to recognize a musical instrument
IJARIIT
 
Music data mining
Music  data miningMusic  data mining
Music data mining
Mihir Rathod
 
Music Personalization At Spotify
Music Personalization At SpotifyMusic Personalization At Spotify
Music Personalization At Spotify
Vidhya Murali
 
survey on Hybrid recommendation mechanism to get effective ranking results fo...
survey on Hybrid recommendation mechanism to get effective ranking results fo...survey on Hybrid recommendation mechanism to get effective ranking results fo...
survey on Hybrid recommendation mechanism to get effective ranking results fo...
Suraj Ligade
 
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
MusicNet
 
Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...
Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...
Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...
IRJET Journal
 
1. paul dwyer internet radio emma
1. paul dwyer internet radio emma1. paul dwyer internet radio emma
1. paul dwyer internet radio emma
Paul Dwyer
 
Music discovery on the net
Music discovery on the netMusic discovery on the net
Music discovery on the net
guestbf080
 
Emofy
Emofy Emofy
Multi modal music mood classification
Multi modal music mood classificationMulti modal music mood classification
Multi modal music mood classification
xiaohusmile
 
Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...
Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...
Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...
Eoin Brazil
 
Stop Looking and Start Listening
Stop Looking and Start ListeningStop Looking and Start Listening
Stop Looking and Start Listening
Becky Stewart
 
01-00-ACA-Introduction-2-MIR.pdf
01-00-ACA-Introduction-2-MIR.pdf01-00-ACA-Introduction-2-MIR.pdf
01-00-ACA-Introduction-2-MIR.pdf
AlexanderLerch4
 
Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...
Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...
Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...
I MT
 
Applications of AI and NLP to advance Music Recommendations on Voice Assistants
Applications of AI and NLP to advance Music Recommendations on Voice AssistantsApplications of AI and NLP to advance Music Recommendations on Voice Assistants
Applications of AI and NLP to advance Music Recommendations on Voice Assistants
AI Publications
 
web based music genre classification.pptx
web based music genre classification.pptxweb based music genre classification.pptx
web based music genre classification.pptx
UmaMahesh786960
 
Case-based Sequential Ordering of Songs for Playlist Recommendation
Case-based Sequential Ordering of Songs for Playlist RecommendationCase-based Sequential Ordering of Songs for Playlist Recommendation
Case-based Sequential Ordering of Songs for Playlist Recommendation
Uri Levanon
 
Poster vega north
Poster vega northPoster vega north
Poster vega north
AcxelVega
 

Similar to CSTalks - Music Information Retrieval - 23 Feb (20)

Mehfil : Song Recommendation System Using Sentiment Detected
Mehfil : Song Recommendation System Using Sentiment DetectedMehfil : Song Recommendation System Using Sentiment Detected
Mehfil : Song Recommendation System Using Sentiment Detected
 
Music recognition
Music recognition Music recognition
Music recognition
 
Knn a machine learning approach to recognize a musical instrument
Knn  a machine learning approach to recognize a musical instrumentKnn  a machine learning approach to recognize a musical instrument
Knn a machine learning approach to recognize a musical instrument
 
Music data mining
Music  data miningMusic  data mining
Music data mining
 
Music Personalization At Spotify
Music Personalization At SpotifyMusic Personalization At Spotify
Music Personalization At Spotify
 
survey on Hybrid recommendation mechanism to get effective ranking results fo...
survey on Hybrid recommendation mechanism to get effective ranking results fo...survey on Hybrid recommendation mechanism to get effective ranking results fo...
survey on Hybrid recommendation mechanism to get effective ranking results fo...
 
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
 
Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...
Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...
Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...
 
1. paul dwyer internet radio emma
1. paul dwyer internet radio emma1. paul dwyer internet radio emma
1. paul dwyer internet radio emma
 
Music discovery on the net
Music discovery on the netMusic discovery on the net
Music discovery on the net
 
Emofy
Emofy Emofy
Emofy
 
Multi modal music mood classification
Multi modal music mood classificationMulti modal music mood classification
Multi modal music mood classification
 
Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...
Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...
Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...
 
Stop Looking and Start Listening
Stop Looking and Start ListeningStop Looking and Start Listening
Stop Looking and Start Listening
 
01-00-ACA-Introduction-2-MIR.pdf
01-00-ACA-Introduction-2-MIR.pdf01-00-ACA-Introduction-2-MIR.pdf
01-00-ACA-Introduction-2-MIR.pdf
 
Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...
Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...
Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...
 
Applications of AI and NLP to advance Music Recommendations on Voice Assistants
Applications of AI and NLP to advance Music Recommendations on Voice AssistantsApplications of AI and NLP to advance Music Recommendations on Voice Assistants
Applications of AI and NLP to advance Music Recommendations on Voice Assistants
 
web based music genre classification.pptx
web based music genre classification.pptxweb based music genre classification.pptx
web based music genre classification.pptx
 
Case-based Sequential Ordering of Songs for Playlist Recommendation
Case-based Sequential Ordering of Songs for Playlist RecommendationCase-based Sequential Ordering of Songs for Playlist Recommendation
Case-based Sequential Ordering of Songs for Playlist Recommendation
 
Poster vega north
Poster vega northPoster vega north
Poster vega north
 

More from cstalks

CSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 NovCSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 Nov
cstalks
 
CSTalks-Natural Language Processing-17Aug
CSTalks-Natural Language Processing-17AugCSTalks-Natural Language Processing-17Aug
CSTalks-Natural Language Processing-17Aug
cstalks
 
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17AugCSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
cstalks
 
CSTalks-Visualizing Software Behavior-14Sep
CSTalks-Visualizing Software Behavior-14SepCSTalks-Visualizing Software Behavior-14Sep
CSTalks-Visualizing Software Behavior-14Sep
cstalks
 
CSTalks-Polymorphic heterogeneous multicore systems-17Aug
CSTalks-Polymorphic heterogeneous multicore systems-17AugCSTalks-Polymorphic heterogeneous multicore systems-17Aug
CSTalks-Polymorphic heterogeneous multicore systems-17Aug
cstalks
 
CSTalks-Quaternary Semantics Recomandation System-24 Aug
CSTalks-Quaternary Semantics Recomandation System-24 AugCSTalks-Quaternary Semantics Recomandation System-24 Aug
CSTalks-Quaternary Semantics Recomandation System-24 Aug
cstalks
 
CSTalks - Object detection and tracking - 25th May
CSTalks - Object detection and tracking - 25th MayCSTalks - Object detection and tracking - 25th May
CSTalks - Object detection and tracking - 25th May
cstalks
 
CSTalks - The Multicore Midlife Crisis - 30 Mar
CSTalks - The Multicore Midlife Crisis - 30 MarCSTalks - The Multicore Midlife Crisis - 30 Mar
CSTalks - The Multicore Midlife Crisis - 30 Mar
cstalks
 
CSTalks - On machine learning - 2 Mar
CSTalks - On machine learning - 2 MarCSTalks - On machine learning - 2 Mar
CSTalks - On machine learning - 2 Mar
cstalks
 
CSTalks - Real movie recommendation - 9 Mar
CSTalks - Real movie recommendation - 9 MarCSTalks - Real movie recommendation - 9 Mar
CSTalks - Real movie recommendation - 9 Mar
cstalks
 
CSTalks-LifeBeyondPhD-16Mar
CSTalks-LifeBeyondPhD-16MarCSTalks-LifeBeyondPhD-16Mar
CSTalks-LifeBeyondPhD-16Mar
cstalks
 
CSTalks - Peer-to-peer - 16 Feb
CSTalks - Peer-to-peer - 16 FebCSTalks - Peer-to-peer - 16 Feb
CSTalks - Peer-to-peer - 16 Feb
cstalks
 
CSTalks - Named Data Networks - 9 Feb
CSTalks - Named Data Networks - 9 FebCSTalks - Named Data Networks - 9 Feb
CSTalks - Named Data Networks - 9 Feb
cstalks
 
CSTalks - Model Checking - 26 Jan
CSTalks - Model Checking - 26 JanCSTalks - Model Checking - 26 Jan
CSTalks - Model Checking - 26 Jan
cstalks
 
CSTalks - GPGPU - 19 Jan
CSTalks  -  GPGPU - 19 JanCSTalks  -  GPGPU - 19 Jan
CSTalks - GPGPU - 19 Jan
cstalks
 

More from cstalks (15)

CSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 NovCSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 Nov
 
CSTalks-Natural Language Processing-17Aug
CSTalks-Natural Language Processing-17AugCSTalks-Natural Language Processing-17Aug
CSTalks-Natural Language Processing-17Aug
 
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17AugCSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
 
CSTalks-Visualizing Software Behavior-14Sep
CSTalks-Visualizing Software Behavior-14SepCSTalks-Visualizing Software Behavior-14Sep
CSTalks-Visualizing Software Behavior-14Sep
 
CSTalks-Polymorphic heterogeneous multicore systems-17Aug
CSTalks-Polymorphic heterogeneous multicore systems-17AugCSTalks-Polymorphic heterogeneous multicore systems-17Aug
CSTalks-Polymorphic heterogeneous multicore systems-17Aug
 
CSTalks-Quaternary Semantics Recomandation System-24 Aug
CSTalks-Quaternary Semantics Recomandation System-24 AugCSTalks-Quaternary Semantics Recomandation System-24 Aug
CSTalks-Quaternary Semantics Recomandation System-24 Aug
 
CSTalks - Object detection and tracking - 25th May
CSTalks - Object detection and tracking - 25th MayCSTalks - Object detection and tracking - 25th May
CSTalks - Object detection and tracking - 25th May
 
CSTalks - The Multicore Midlife Crisis - 30 Mar
CSTalks - The Multicore Midlife Crisis - 30 MarCSTalks - The Multicore Midlife Crisis - 30 Mar
CSTalks - The Multicore Midlife Crisis - 30 Mar
 
CSTalks - On machine learning - 2 Mar
CSTalks - On machine learning - 2 MarCSTalks - On machine learning - 2 Mar
CSTalks - On machine learning - 2 Mar
 
CSTalks - Real movie recommendation - 9 Mar
CSTalks - Real movie recommendation - 9 MarCSTalks - Real movie recommendation - 9 Mar
CSTalks - Real movie recommendation - 9 Mar
 
CSTalks-LifeBeyondPhD-16Mar
CSTalks-LifeBeyondPhD-16MarCSTalks-LifeBeyondPhD-16Mar
CSTalks-LifeBeyondPhD-16Mar
 
CSTalks - Peer-to-peer - 16 Feb
CSTalks - Peer-to-peer - 16 FebCSTalks - Peer-to-peer - 16 Feb
CSTalks - Peer-to-peer - 16 Feb
 
CSTalks - Named Data Networks - 9 Feb
CSTalks - Named Data Networks - 9 FebCSTalks - Named Data Networks - 9 Feb
CSTalks - Named Data Networks - 9 Feb
 
CSTalks - Model Checking - 26 Jan
CSTalks - Model Checking - 26 JanCSTalks - Model Checking - 26 Jan
CSTalks - Model Checking - 26 Jan
 
CSTalks - GPGPU - 19 Jan
CSTalks  -  GPGPU - 19 JanCSTalks  -  GPGPU - 19 Jan
CSTalks - GPGPU - 19 Jan
 

Recently uploaded

HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 

Recently uploaded (20)

HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 

CSTalks - Music Information Retrieval - 23 Feb

  • 1. CSTalks Similarity Measures in Music Information Retrieval Systems Speaker: Zhonghua Li Supervisor: Ye Wang lizhongh@comp.nus.edu.sg 1
  • 2. Where do you search for music?  Music is an important part of our way of life.  Music stores  Online music Mufin 2
  • 3. 3
  • 4. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 4
  • 5. Definition  Music Information Retrieval  Is the process of searching for, and finding, music objects, or part of objects, via a query framed musically and/or in musical terms.  Music objects: Recordings (wav, mp3, etc.), scores, parts, etc.  Musically framed query: Singing, humming, keyboad, notation-based, MIDI files, sound files, etc.  Music terms: Genre, style, tempo, bibliography, etc.  Applications  Music search, recommendation, identification ,etc. 5
  • 6. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Music Similarity Measure  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 6
  • 7. Applications -- Search  Text-based Music Search  Compare a textual query with the metadata  Is adopted by most existing systems.  Examples: Last.fm, Musicovery, … 7
  • 8. Applications -- Search  Content-based Music Search  Compare audio query with audio content  Query-by-humming/singing/recording: midomi 8
  • 9. Applications -- Search  Content-based Music Search  Compare rhythm tapped with audio content  Query-by-tapping: SongTapper 9
  • 10. Applications -- Search  User’s information need Online Offline (intention) : Query Explicit Query: text, audio, etc. Intention Gap Query Formation  Similarity measure: Descriptors Documents Query  Music documents in the Match Descriptor Extraction Database Indexing Semantic Gap Index Descriptors  Ranking: relevant documents by domain specific criterions (no. of Ranking hits ). Ranked List Presentation Results 10
  • 11. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 11
  • 12. Applications – Recommendation  Collaborative-Filtering-based Recommendation  Last.fm: what you (and others ) listen to and like,  Amazon: customers who shopped for … also shopped for … 12
  • 13. Applications – Recommendation  Collaborative-Filtering-based Recommendation  Last.fm: what you (and others ) listen to and like,  Amazon: customers who shopped for … also shopped for …  Example:  Users: A, B, and C  Music: 1, 2, …, 8 Small similarity Large similarity C A B Similarity 4 Similarity 1 Measure 1 Measure 2 2 6 3 3 8 4 5 Recommend to user A 13
  • 14. Applications -- Recommendation  Audio Content-based Recommendation  Recommend songs which have similar audio content to the songs that you like.  Pandora: Music database Music Experts User Listen Instrument: Instrument: Similarity Instrument: Vocal: Vocal: Vocal: Structure: Measure Structure: …Structure: … … 400 Attributes/song Recommendations 14
  • 15. Applications -- Recommendation  User’s information need Online / Offline Offline (intention): User Profile Implicit user profiles: ratings, Intention Gap Profile Capture listening history, etc. Descriptors Documents  Similarity measure: Descriptor User profiles Music Match Extraction documents/other user profiles Semantic Gap Index Indexing Descriptors  Ranking: relevant documents Ranking by some domain specific Ranked List criterions (no. of hits). Presentation Results 15
  • 16. Similarity Measure  one of the most fundamental concepts in MIR Online / Offline Offline  Closely related to User Profile /query  What information music Profile/query Intention Gap Capture contains.  How this information is Descriptors Documents represented. Match Descriptor Extraction  How to match between themSemantic Gap Indexing Index Descriptors Ranking Ranked List Presentation Results 16
  • 17. Music Information Plane Similarity can be measure from different aspects. Song1: New favorite - Alison Krauss Song2: She is Beautiful - Andrew W.K. Song1 Song2 Female Male Dissimilar Gentle Aggressive Slow fast Guitar Similar Tempo: ~162 BPM (Beat Per Minute) * O. C. Herrada. Music recommendation and discovery in Music the long tail. PhD thsis. 2008. 17
  • 18. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 18
  • 19. Similarity Measure Methods  Text-based Method: Okapi BM-25 Ranking  Given: queryQ, containing keywords q1, …, qn, music documents: bag of words.  BM25 ranking function can be formulated as:  f(qi, D) is qi’s term frequency (tf) in document D.  |D| is the length of document D in words.  avgdl is the average document length in the collection.  k1 and b are free parameters, usually set as k1=2.0 and b=0.75.  IDF(qi) is the inverse document frequency (idf), calculated as: . The query term appears in this document frequently. f (qi, D) . And it doesn’t appear in other document. IDF 19
  • 20. Similarity Measure Methods  Text-based Method:  Pros  Simple & efficient  Cons  Affected by noisy/wrong texts  Songs with no text cannot be retrieved  Require high-level domain knowledge to create good metadata  “Text retrieval on audio metadata” not pure music retrieval 20
  • 21. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 21
  • 22. Similarity Measure Methods  Audio Feature-based Method Audio feature Distribution Model Music extraction modeling comparison 22
  • 23. Similarity Measure Methods  Audio Feature-based Method  Audio Feature extraction -> Distribution modeling -> Model Comparison frame Feature Vector … 23
  • 24. Existing Works  Audio Feature-based Method  Audio Feature extraction  Distribution modeling  Model Comparison  Use low-level feature directly  Pitch, loudness, MFCC (Blum et al.[3], 1999)  Histogram of MFCC (Foote[4], 1997)  Spectrum, rhythm, chord changesingleVector (Tzanetakis [5], 2002)  Low-level features  higher-level features.  Cluster MFCC=>model comparison (Aucouturier[6], 2002)  MFCC => Gaussian Mixture Models => model comparison  MFCC =>“anchor space”, compare probability models (Berenzweig et al. [7], 2003) 24
  • 25. Similarity Measure Methods  Audio Feature-based Method  Audio Feature extraction  Distribution modeling  Model Comparison  Euclidean /Cosine distance (uniform-length feature vectors)  Distance between two probability distributions  Kullback-Leibler divergence (KL Distance) / relative entropy  No closed form for Gaussian Distributions  Centroid distance: Euclidean distance between the overall means;  Sampling based method: compute the likelihood of one model given points sampled from another; very computationally expensive;  Earth-Mover’s distance Berenzweig, A., Logan, B., Ellis, D. P., and Whitman, B. P., A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity Measures. Computer Music Journal. 28, 2004, 25
  • 26. Similarity Measure Methods  Audio Feature-based Method  Pros  Can deal with new songs with no or few texts.  Save human labors from annotating each song manually  Cons  Time complexity is relatively high.  Features ≠ audio piece: Two songs with very similar features may sounds very different.  The average performance is reaching the glass ceiling of around 65% in accuracy. 26
  • 27. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 27
  • 28. Similarity Measure Methods  Semantic Concept-based Method  Nature of user queries  Far beyond of bibliographic text and audio search  Semantically-rich  Syntactically- undetermined e.g.: “Find me a classical and happy song”, or “Find me a song to relax” “Find me some songs for parties/ weddings/ in churches” …  Collaborative(social) tagging is very popular on Web 2.0. Users annotate their feelings or opinions to the music. Tags, comments, etc. 28
  • 29. Similarity Measure Methods  Semantic Concept-based Method  Tags VS user queries (Last.fm) Tag Type Frequency Multi-tag search queries Genre 68% 51% Locale 12% 7% Mood 5% 4% Opinion 4% 2% Instrumentation 4% 5% Style 3% 26% . Paul Lamere. Social tagging and music information retrieval. Journal of New Music Research. 2008. . Klaas Bosteels, Elias Pampalk, and Etienne E. Kerre. Music retrieval based on social tags: a case study. ISMIR, 2008. 29
  • 30. Similarity Measure Methods Vocabulary: classical, jazz, … piano, violin, …, female, male, … … Model Model … Model … Probability vector Song1 Similarity … Song2 30
  • 31. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 31
  • 32. Similarity Measure Methods  Multimodal Method  Information keeps growing.  One of the most important ongoing trends: Metadata Audio Semantic Content Concept  Users are important. 32
  • 33. Similarity Measure Methods  Multimodal Method Document Vectors Customization Fuzzy Music Semantic Vector B. Zhang, J. Shen, Q. Xiang, and Y. Wang. CompositMap: a novel framework for music similarity measure. ACM Multimedia, 2009. 33
  • 34. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 34
  • 35. Conclusion and Future Directions  What makes MIR (and the similarity measure) so tricky? Music information is  Multimodal: audio, metadata, social , …  Multicultural: e.g., modern art, Indian ragas, …  Multirepresentational: audio, MIDI, score, …  Multifaceted: melody, tempo, beat, … …  Similarity can be measured from different aspects. 35
  • 36. Conclusions and Future Directions  What do users really want? Intention Gap  User interactions with the system.  Learn a good user preference modeling  What kind of music features can really capture this need?  Content –Tags Semantic Gap  Leverage more social data? Comments, ratings, groups, playlist, other user created information, …  How to fuse multiple information effectively?  Identify the relevant/discriminative information aspects  Fusion Methods 36
  • 37. 37
  • 38. References  [2] F[1] O. C. Herrada. Music recommendation and discovery in the long tail. PhD thsis. 2008.  . Pachet. Knowledge management and musical metadata. Encyclopedia of Knowledge Management. Idea Group, 2005.  [3] T. L. Blum, D. F. Keislar, J. A. Wheaton, and E. H. Wold. Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information. U.S. Patent 5, 918, 223.  [4] J. T. Foote. Content-based retrieval of music and audio. SPIE, 1997.  [5] G. Tzanetakis. Manipulation, analysis, and retrieval system for audio signals. PhD thsis, 2002.  [6] J. J. Aucouturier and F. Pachet. Music similarity measure: What’s the use? International Symposium on Music information retrieval. 2002.  [7] A. Berenzweig, D. P. W. Ellis and S. Lawrence. Anchor space for classification and similarity measurement for music. ICME 2003. 38
  • 39. References  [8] B. Zhang, J. Shen, Q. Xiang and Y. Wang. CompositeMap: a novel framework for music similarity measure. SIGIR, 2009.  [9] B. Whiteman and S. Lawrence. Inferring descriptions and similarity for music from community metadata. International computer music conference. 2002.  [10] M. Schedl, T. Pohle, P. Knees and G. Widmer. Assigning and visualizing music genre by web-based co-occurrence analysis. ISMIR 2006.  [11] B. Whitman and Paris Smaragdis. Combining musical and cultural features for intelligent style detection. ISMIR 2002.  [12] L. Chen, P. Wright, and W. Nejdl. Improving music genre classification using collaborative tagging data. WSDM, 2009.  [13] Benedikt Raes. Automatic generation of music metadata. ISMIR, 2009. 39