SlideShare a Scribd company logo
1 of 18
Multimodal Music Tagging Task


Nicola Orio – University of Padova
Cynthia C. S. Liem – Delft University of Technology
Geoffroy Peeters – UMR STMS IRCAM-CNRS, Paris
Markus Schedl – Johannes Kepler University, Linz

MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   1
Multimodal music tagging
• Definition
    • Songs of a commercial music library need to be categorized
      according to their usage in TV and radio broadcasts (e.g.
      soundtracks, jingles)

• Practical motivation
    • The search for suitable music for video productions is a
      major activity for professionals and lay users alike
         • Collaborative filtering systems are taking their role
             • Notwithstanding their known limitations: long-tail, cold start…
    • Annotating professional music libraries is another important
      professional activity

MediaEval, Pisa 05/10/2012    MusiClef: Multimodal Music Tagging Task            2
Human assessment




 Different sources of information are routinely exploited
 by professionals to overcome limitations of individual media
MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   3
Goals of MusiClef
• To focus evaluation on professional application scenarios
    • Textual description of music items

• To grant replication of experiments and results
    • Feature extraction phase is crucial – released features
      computed with public, open-source library (MIRToolbox)

• To promote the exploitation of multimodal sources of
  information
    • Content (audio) + Context (tags & webpages)

• To disseminate music related initiatives
    • Outside the music information retrieval community
MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   4
Evaluation initiatives – 1
• MIREX (since 2004)
    • Community-based selection of tasks
         • Many tasks address audio feature extraction algorithms
    • Participants submit algorithms that are run by organizers
         • Music files are not shared with participants

• Million Song Dataset (since 2011)
    • Task on music recommendation proposed by organizers
    • Audio features are computed using proprietary algorithms
         • Only features are shared with participants


MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   5
Evaluation initiatives – 2

• Quaero-Eval (since 2012)
         • Tasks agreed with participants
             • Strategies to grant public access to evaluation results
    • Participants run training experiments on a shared repository
         • Runs on test set made by the organizers




MediaEval, Pisa 05/10/2012     MusiClef: Multimodal Music Tagging Task   6
Test collection – 1
• Individual songs of pop and rock music
    • 1355 songs (from 218 artists)
    • train (975) and test (380) split

• Social tags
    • Gathered from Last.fm API

• Multilingual sets of Web pages related to artists+albums
    • Mined querying Google

• Acoustic features: MFCC (using MIRToolbox) with a
  window length of 200ms and 50% overlap

MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   7
Test collection – 2
• Test collection created starting from the “500 Greatest
  Songs of All Time” (Rolling Stone)
    • Expected high number of social tags and web pages

• Ground truth created by experts in the domain
    • 355 tags selected (167 genre, 288 usage)
         • Tags associated to less than 20 songs were discarded

• Reference implementation in Matlab
    • Participants has an example to run a complete experiment
    • Code for the evaluation made already available

MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   8
Evaluation measures

• Standard IR measures

•   Accuracy
•   Precision
•   Recall
•   Specificity
•   F-measure



MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   9
Examining tags more closely
• Some tags are more equal than others…


                   hard rock                                                      ballroom
                                              melancholic
       travel
                               countryside
                                                                         bright

• Thus, we propose to also analyze results employing a
  higher-level tag categorization

MediaEval, Pisa 05/10/2012     MusiClef: Multimodal Music Tagging Task                       10
Tag categorization – 1
• Affective, mood-related aspects:
    • activity: the amount of perceived music
      activity, without implying strong positive or
      negative affective qualities (e.g.
      'fast', 'mellow', 'lazy')
    • affective state: affective qualities that can only be
      connected and attributed to living beings (e.g.
      'aggressive', 'hopeful')
    • atmosphere: affective qualities that can be
      connected to environments (e.g.
      'chaotic', 'intimate').

MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   11
Tag categorization – 2
• Situation, time and space aspects of the music:
    • Physical situation: concrete physical environments
      (e.g. 'city', 'night').
    • Occasion: implications of time and space, typically
      connected to social events (e.g. 'holiday', 'glamour').
• Sociocultural genre (e.g. 'new wave', 'r&b', 'punk')
• Sound qualities:
    • timbral aspects (e.g. 'acoustic', 'bright')
    • temporal aspects (e.g. 'beat', 'groove').
• Other (e.g. 'catchy', 'evocative').
MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   12
Reference implementation
• Made in MATLAB and released publicly
• Simple and straightforward approaches:
    • Individual GMMs for audio, user tags, web pages
    • Tagging process: 1-NN qualification using symmetrized KL


• Scenarios tested:
    • Audio, user tags, web pages individually
    • Majority vote
    • Union


MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   13
Baseline results – 1
• Evaluation of the submitted runs and of the reference
  implementation
    • Results with different modalities over the full dataset

strategy          accuracy           recall           precision         specificity    f-measure
audio                        0.894            0.148           0.127            0.939        0.126
tags                         0.898            0.061           0.039            0.942        0.037
web pages                    0.897            0.050           0.007            0.954        0.011
majority                     0.880            0.123           0.086            0.922        0.086
union                        0.824            0.240           0.115            0.845        0.134


MediaEval, Pisa 05/10/2012           MusiClef: Multimodal Music Tagging Task                  14
Baseline results – 2
                                                                       1. activity, energy
                                                                       2. affective state
                                                                       3. atmosphere
                                                                       4. other
                                                                       5. situation: occasion
                                                                       6. situation: physical
                                                                       7. sociocultural genre
                                                                       8. sound: temporal
                                                                       9: sound: timbral




MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task                    15
Participation

• Initially a lot of interest - about 8 explicitly interested
  parties
• But ultimately just one participant (LUTIN UserLab)
    • Aggregation of estimators
• Currently investigating what happened to the 7 others
    • So far, it appears ISMIR 2012 was inconveniently close
    • The 3 other MusiClef co-organizers will discuss this there



MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   16
Conclusions

• We established a multimodal music tagging benchmark task
• Special effort in facilitating deeper tag analysis
• We would like a 2013 multimodal music benchmark task
    • Depending on survey input
    • Depending on your input




MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   17
Thank you for your attention!


For contact and more information: musiclef@dei.unipd.it




MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   18

More Related Content

Viewers also liked

The MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes DetectioThe MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes DetectioMediaEval2012
 
LIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic methodLIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic methodMediaEval2012
 
Mentor Strategy Session: Business Plan and Video
Mentor Strategy Session: Business Plan and VideoMentor Strategy Session: Business Plan and Video
Mentor Strategy Session: Business Plan and VideoGrow America
 
Event Detection via LDA for the MediaEval2012 SED Task
Event Detection via LDA for the MediaEval2012 SED TaskEvent Detection via LDA for the MediaEval2012 SED Task
Event Detection via LDA for the MediaEval2012 SED TaskMediaEval2012
 
Week 2 discussion 2
Week 2 discussion 2Week 2 discussion 2
Week 2 discussion 2LILBIT2012
 
תחרות אלוף הידע
תחרות אלוף הידעתחרות אלוף הידע
תחרות אלוף הידעsabal1
 
John Richards: My Life Lessons As An Entrepreneur
John Richards: My Life Lessons As An EntrepreneurJohn Richards: My Life Lessons As An Entrepreneur
John Richards: My Life Lessons As An EntrepreneurGrow America
 
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...MediaEval2012
 
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskDCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskMediaEval2012
 
MediaEval 2012 Opening
MediaEval 2012 OpeningMediaEval 2012 Opening
MediaEval 2012 OpeningMediaEval2012
 
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...MediaEval2012
 
Secrets of Storytelling by Candace Klein
Secrets of Storytelling by Candace KleinSecrets of Storytelling by Candace Klein
Secrets of Storytelling by Candace KleinGrow America
 
Overview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy TaskOverview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy TaskMediaEval2012
 
Simha_23_REFFIT_Biochar_ICT_Published Version
Simha_23_REFFIT_Biochar_ICT_Published VersionSimha_23_REFFIT_Biochar_ICT_Published Version
Simha_23_REFFIT_Biochar_ICT_Published VersionPrithvi Simha
 
When Ideas and Opportunities Collide
When Ideas and Opportunities CollideWhen Ideas and Opportunities Collide
When Ideas and Opportunities CollideGrow America
 
Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012MediaEval2012
 

Viewers also liked (18)

The MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes DetectioThe MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes Detectio
 
Simha_RP
Simha_RPSimha_RP
Simha_RP
 
LIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic methodLIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic method
 
Mentor Strategy Session: Business Plan and Video
Mentor Strategy Session: Business Plan and VideoMentor Strategy Session: Business Plan and Video
Mentor Strategy Session: Business Plan and Video
 
Event Detection via LDA for the MediaEval2012 SED Task
Event Detection via LDA for the MediaEval2012 SED TaskEvent Detection via LDA for the MediaEval2012 SED Task
Event Detection via LDA for the MediaEval2012 SED Task
 
Week 2 discussion 2
Week 2 discussion 2Week 2 discussion 2
Week 2 discussion 2
 
תחרות אלוף הידע
תחרות אלוף הידעתחרות אלוף הידע
תחרות אלוף הידע
 
John Richards: My Life Lessons As An Entrepreneur
John Richards: My Life Lessons As An EntrepreneurJohn Richards: My Life Lessons As An Entrepreneur
John Richards: My Life Lessons As An Entrepreneur
 
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
 
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskDCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
 
MediaEval 2012 Opening
MediaEval 2012 OpeningMediaEval 2012 Opening
MediaEval 2012 Opening
 
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
 
Thotcon2013
Thotcon2013Thotcon2013
Thotcon2013
 
Secrets of Storytelling by Candace Klein
Secrets of Storytelling by Candace KleinSecrets of Storytelling by Candace Klein
Secrets of Storytelling by Candace Klein
 
Overview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy TaskOverview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy Task
 
Simha_23_REFFIT_Biochar_ICT_Published Version
Simha_23_REFFIT_Biochar_ICT_Published VersionSimha_23_REFFIT_Biochar_ICT_Published Version
Simha_23_REFFIT_Biochar_ICT_Published Version
 
When Ideas and Opportunities Collide
When Ideas and Opportunities CollideWhen Ideas and Opportunities Collide
When Ideas and Opportunities Collide
 
Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012
 

Similar to Multimodal Music Tagging Task Overview

Music Recommendation 2018
Music Recommendation 2018Music Recommendation 2018
Music Recommendation 2018Fabien Gouyon
 
Trends in Music Recommendations 2018
Trends in Music Recommendations 2018Trends in Music Recommendations 2018
Trends in Music Recommendations 2018Karthik Murugesan
 
Learning how to learn, with Software Carpentry
Learning how to learn, with Software CarpentryLearning how to learn, with Software Carpentry
Learning how to learn, with Software CarpentrySoundSoftware ac.uk
 
BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS
BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS
BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS Gemma San Cornelio Esquerdo
 
David L Page DCI KKP622 mid Project 1 report.20160610
David L Page DCI KKP622 mid Project 1 report.20160610David L Page DCI KKP622 mid Project 1 report.20160610
David L Page DCI KKP622 mid Project 1 report.20160610David L Page
 
David L Page DCI Project 1 Seminar Presentation.20180816.pdf
David L Page DCI Project 1 Seminar Presentation.20180816.pdfDavid L Page DCI Project 1 Seminar Presentation.20180816.pdf
David L Page DCI Project 1 Seminar Presentation.20180816.pdfDavid L Page
 
Introduction to Fast by Professor Mark Sandler
Introduction to Fast by  Professor Mark SandlerIntroduction to Fast by  Professor Mark Sandler
Introduction to Fast by Professor Mark SandlerFASTIMPACT
 
Mlapv
MlapvMlapv
Mlapvhhs
 
Copy of 3INFOR~1.powerpoint presentation
Copy of 3INFOR~1.powerpoint presentationCopy of 3INFOR~1.powerpoint presentation
Copy of 3INFOR~1.powerpoint presentationRouAnnAdobasNavarroz1
 
Authentic materials 2018 copy ppt
Authentic materials 2018 copy pptAuthentic materials 2018 copy ppt
Authentic materials 2018 copy ppthhs
 
Music and Media Communications
Music and Media CommunicationsMusic and Media Communications
Music and Media CommunicationsCHTCT
 
Mobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSCMobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSCNathanBowen8
 
3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....JemeloSipsip1
 
3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....ronald775612
 
3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....EdwardEpan1
 
Badges for Nature (HASTAC/DML proposal)
Badges for Nature (HASTAC/DML proposal)Badges for Nature (HASTAC/DML proposal)
Badges for Nature (HASTAC/DML proposal)Jon Rosewell
 
Sound Matters: a framework for the creative use and re-use of sound: field re...
Sound Matters: a framework for the creative use and re-use of sound: field re...Sound Matters: a framework for the creative use and re-use of sound: field re...
Sound Matters: a framework for the creative use and re-use of sound: field re...Jisc
 

Similar to Multimodal Music Tagging Task Overview (20)

Music Recommendation 2018
Music Recommendation 2018Music Recommendation 2018
Music Recommendation 2018
 
Trends in Music Recommendations 2018
Trends in Music Recommendations 2018Trends in Music Recommendations 2018
Trends in Music Recommendations 2018
 
Learning how to learn, with Software Carpentry
Learning how to learn, with Software CarpentryLearning how to learn, with Software Carpentry
Learning how to learn, with Software Carpentry
 
BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS
BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS
BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS
 
David L Page DCI KKP622 mid Project 1 report.20160610
David L Page DCI KKP622 mid Project 1 report.20160610David L Page DCI KKP622 mid Project 1 report.20160610
David L Page DCI KKP622 mid Project 1 report.20160610
 
M13517
M13517M13517
M13517
 
David L Page DCI Project 1 Seminar Presentation.20180816.pdf
David L Page DCI Project 1 Seminar Presentation.20180816.pdfDavid L Page DCI Project 1 Seminar Presentation.20180816.pdf
David L Page DCI Project 1 Seminar Presentation.20180816.pdf
 
Introduction to Fast by Professor Mark Sandler
Introduction to Fast by  Professor Mark SandlerIntroduction to Fast by  Professor Mark Sandler
Introduction to Fast by Professor Mark Sandler
 
MULHER@AVI2012
MULHER@AVI2012MULHER@AVI2012
MULHER@AVI2012
 
Mlapv
MlapvMlapv
Mlapv
 
Copy of 3INFOR~1.powerpoint presentation
Copy of 3INFOR~1.powerpoint presentationCopy of 3INFOR~1.powerpoint presentation
Copy of 3INFOR~1.powerpoint presentation
 
Authentic materials 2018 copy ppt
Authentic materials 2018 copy pptAuthentic materials 2018 copy ppt
Authentic materials 2018 copy ppt
 
Music and Media Communications
Music and Media CommunicationsMusic and Media Communications
Music and Media Communications
 
Mobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSCMobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSC
 
08b final event_experimente
08b final event_experimente08b final event_experimente
08b final event_experimente
 
3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....
 
3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....
 
3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....
 
Badges for Nature (HASTAC/DML proposal)
Badges for Nature (HASTAC/DML proposal)Badges for Nature (HASTAC/DML proposal)
Badges for Nature (HASTAC/DML proposal)
 
Sound Matters: a framework for the creative use and re-use of sound: field re...
Sound Matters: a framework for the creative use and re-use of sound: field re...Sound Matters: a framework for the creative use and re-use of sound: field re...
Sound Matters: a framework for the creative use and re-use of sound: field re...
 

More from MediaEval2012

A Multimodal Approach for Video Geocoding
A Multimodal Approach for   Video Geocoding A Multimodal Approach for   Video Geocoding
A Multimodal Approach for Video Geocoding MediaEval2012
 
CUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking TaskCUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking TaskMediaEval2012
 
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...MediaEval2012
 
Brave New Task: User Account Matching
Brave New Task: User Account MatchingBrave New Task: User Account Matching
Brave New Task: User Account MatchingMediaEval2012
 
The CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsThe CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsMediaEval2012
 
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...MediaEval2012
 
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...MediaEval2012
 
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect TaskNII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect TaskMediaEval2012
 
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...MediaEval2012
 
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...MediaEval2012
 
UNICAMP-UFMG at MediaEval 2012: Genre Tagging Task
UNICAMP-UFMG at MediaEval 2012: Genre Tagging TaskUNICAMP-UFMG at MediaEval 2012: Genre Tagging Task
UNICAMP-UFMG at MediaEval 2012: Genre Tagging TaskMediaEval2012
 
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...MediaEval2012
 
ARF @ MediaEval 2012: Multimodal Video Classification
ARF @ MediaEval 2012: Multimodal Video ClassificationARF @ MediaEval 2012: Multimodal Video Classification
ARF @ MediaEval 2012: Multimodal Video ClassificationMediaEval2012
 
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...MediaEval2012
 
KIT at MediaEval 2012 – Content–based Genre Classification with Visual Cues
KIT at MediaEval 2012 – Content–based Genre Classification with Visual CuesKIT at MediaEval 2012 – Content–based Genre Classification with Visual Cues
KIT at MediaEval 2012 – Content–based Genre Classification with Visual CuesMediaEval2012
 
Overview of the MediaEval 2012 Tagging Task
Overview of the MediaEval 2012 Tagging TaskOverview of the MediaEval 2012 Tagging Task
Overview of the MediaEval 2012 Tagging TaskMediaEval2012
 
CUHK System for the Spoken Web Search task at Mediaeval 2012
CUHK System for the Spoken Web Search task at Mediaeval 2012CUHK System for the Spoken Web Search task at Mediaeval 2012
CUHK System for the Spoken Web Search task at Mediaeval 2012MediaEval2012
 
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search TaskThe TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search TaskMediaEval2012
 

More from MediaEval2012 (19)

A Multimodal Approach for Video Geocoding
A Multimodal Approach for   Video Geocoding A Multimodal Approach for   Video Geocoding
A Multimodal Approach for Video Geocoding
 
CUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking TaskCUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking Task
 
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
 
Brave New Task: User Account Matching
Brave New Task: User Account MatchingBrave New Task: User Account Matching
Brave New Task: User Account Matching
 
The CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsThe CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and Onwards
 
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
 
mevd2012 esra_
 mevd2012 esra_ mevd2012 esra_
mevd2012 esra_
 
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
 
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect TaskNII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
 
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
 
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...
 
UNICAMP-UFMG at MediaEval 2012: Genre Tagging Task
UNICAMP-UFMG at MediaEval 2012: Genre Tagging TaskUNICAMP-UFMG at MediaEval 2012: Genre Tagging Task
UNICAMP-UFMG at MediaEval 2012: Genre Tagging Task
 
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...
 
ARF @ MediaEval 2012: Multimodal Video Classification
ARF @ MediaEval 2012: Multimodal Video ClassificationARF @ MediaEval 2012: Multimodal Video Classification
ARF @ MediaEval 2012: Multimodal Video Classification
 
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...
 
KIT at MediaEval 2012 – Content–based Genre Classification with Visual Cues
KIT at MediaEval 2012 – Content–based Genre Classification with Visual CuesKIT at MediaEval 2012 – Content–based Genre Classification with Visual Cues
KIT at MediaEval 2012 – Content–based Genre Classification with Visual Cues
 
Overview of the MediaEval 2012 Tagging Task
Overview of the MediaEval 2012 Tagging TaskOverview of the MediaEval 2012 Tagging Task
Overview of the MediaEval 2012 Tagging Task
 
CUHK System for the Spoken Web Search task at Mediaeval 2012
CUHK System for the Spoken Web Search task at Mediaeval 2012CUHK System for the Spoken Web Search task at Mediaeval 2012
CUHK System for the Spoken Web Search task at Mediaeval 2012
 
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search TaskThe TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task
 

Recently uploaded

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Recently uploaded (20)

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

Multimodal Music Tagging Task Overview

  • 1. Multimodal Music Tagging Task Nicola Orio – University of Padova Cynthia C. S. Liem – Delft University of Technology Geoffroy Peeters – UMR STMS IRCAM-CNRS, Paris Markus Schedl – Johannes Kepler University, Linz MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 1
  • 2. Multimodal music tagging • Definition • Songs of a commercial music library need to be categorized according to their usage in TV and radio broadcasts (e.g. soundtracks, jingles) • Practical motivation • The search for suitable music for video productions is a major activity for professionals and lay users alike • Collaborative filtering systems are taking their role • Notwithstanding their known limitations: long-tail, cold start… • Annotating professional music libraries is another important professional activity MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 2
  • 3. Human assessment Different sources of information are routinely exploited by professionals to overcome limitations of individual media MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 3
  • 4. Goals of MusiClef • To focus evaluation on professional application scenarios • Textual description of music items • To grant replication of experiments and results • Feature extraction phase is crucial – released features computed with public, open-source library (MIRToolbox) • To promote the exploitation of multimodal sources of information • Content (audio) + Context (tags & webpages) • To disseminate music related initiatives • Outside the music information retrieval community MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 4
  • 5. Evaluation initiatives – 1 • MIREX (since 2004) • Community-based selection of tasks • Many tasks address audio feature extraction algorithms • Participants submit algorithms that are run by organizers • Music files are not shared with participants • Million Song Dataset (since 2011) • Task on music recommendation proposed by organizers • Audio features are computed using proprietary algorithms • Only features are shared with participants MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 5
  • 6. Evaluation initiatives – 2 • Quaero-Eval (since 2012) • Tasks agreed with participants • Strategies to grant public access to evaluation results • Participants run training experiments on a shared repository • Runs on test set made by the organizers MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 6
  • 7. Test collection – 1 • Individual songs of pop and rock music • 1355 songs (from 218 artists) • train (975) and test (380) split • Social tags • Gathered from Last.fm API • Multilingual sets of Web pages related to artists+albums • Mined querying Google • Acoustic features: MFCC (using MIRToolbox) with a window length of 200ms and 50% overlap MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 7
  • 8. Test collection – 2 • Test collection created starting from the “500 Greatest Songs of All Time” (Rolling Stone) • Expected high number of social tags and web pages • Ground truth created by experts in the domain • 355 tags selected (167 genre, 288 usage) • Tags associated to less than 20 songs were discarded • Reference implementation in Matlab • Participants has an example to run a complete experiment • Code for the evaluation made already available MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 8
  • 9. Evaluation measures • Standard IR measures • Accuracy • Precision • Recall • Specificity • F-measure MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 9
  • 10. Examining tags more closely • Some tags are more equal than others… hard rock ballroom melancholic travel countryside bright • Thus, we propose to also analyze results employing a higher-level tag categorization MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 10
  • 11. Tag categorization – 1 • Affective, mood-related aspects: • activity: the amount of perceived music activity, without implying strong positive or negative affective qualities (e.g. 'fast', 'mellow', 'lazy') • affective state: affective qualities that can only be connected and attributed to living beings (e.g. 'aggressive', 'hopeful') • atmosphere: affective qualities that can be connected to environments (e.g. 'chaotic', 'intimate'). MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 11
  • 12. Tag categorization – 2 • Situation, time and space aspects of the music: • Physical situation: concrete physical environments (e.g. 'city', 'night'). • Occasion: implications of time and space, typically connected to social events (e.g. 'holiday', 'glamour'). • Sociocultural genre (e.g. 'new wave', 'r&b', 'punk') • Sound qualities: • timbral aspects (e.g. 'acoustic', 'bright') • temporal aspects (e.g. 'beat', 'groove'). • Other (e.g. 'catchy', 'evocative'). MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 12
  • 13. Reference implementation • Made in MATLAB and released publicly • Simple and straightforward approaches: • Individual GMMs for audio, user tags, web pages • Tagging process: 1-NN qualification using symmetrized KL • Scenarios tested: • Audio, user tags, web pages individually • Majority vote • Union MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 13
  • 14. Baseline results – 1 • Evaluation of the submitted runs and of the reference implementation • Results with different modalities over the full dataset strategy accuracy recall precision specificity f-measure audio 0.894 0.148 0.127 0.939 0.126 tags 0.898 0.061 0.039 0.942 0.037 web pages 0.897 0.050 0.007 0.954 0.011 majority 0.880 0.123 0.086 0.922 0.086 union 0.824 0.240 0.115 0.845 0.134 MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 14
  • 15. Baseline results – 2 1. activity, energy 2. affective state 3. atmosphere 4. other 5. situation: occasion 6. situation: physical 7. sociocultural genre 8. sound: temporal 9: sound: timbral MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 15
  • 16. Participation • Initially a lot of interest - about 8 explicitly interested parties • But ultimately just one participant (LUTIN UserLab) • Aggregation of estimators • Currently investigating what happened to the 7 others • So far, it appears ISMIR 2012 was inconveniently close • The 3 other MusiClef co-organizers will discuss this there MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 16
  • 17. Conclusions • We established a multimodal music tagging benchmark task • Special effort in facilitating deeper tag analysis • We would like a 2013 multimodal music benchmark task • Depending on survey input • Depending on your input MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 17
  • 18. Thank you for your attention! For contact and more information: musiclef@dei.unipd.it MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 18