SlideShare a Scribd company logo
Music Information Retrieval
DNA like classification of publicly accessible broadcast stations
Author:
Ganesh K Harugeri
Examiner:
apl. Prof. Dr. habil. Marcus Liwicki
What is Information Retrieval?
● It is a discipline that deals with retrieval of unstructured data
● Mainy textual documents
● This is in response to a query or topic statement
● Output may be unstructured or structured
● Example:
○ A sentence or even another document (Unstructured),
○ A boolean expression (Structured document) [1]
What is Music Information Retrieval?
● Music Information retrieval is an interdisciplinary research, involving
musicology, psychology, signal processing, machine learning or combination
among these disciplines.
● Music information retrieval includes
○ Discovery
○ Analysis and
○ Collection
○ of large amount of music or music related data.
Now, The Objective of the Study..
● Germany has over 200 publicly accessible radio stations
● Evaluate to systematically classify these radio stations
● Additionally, interpret these stations by a set of empirical factors
● The study helps in assigning DNA like identification after evaluation
● Few empirical factors to mention here are:
○ Proportion of German speaking artists (%)
○ Proportion of Newcomer artists (%)
○ Proportion of Distinct titles (%) etc.
The Study Workflow
Part-1
The study is divided into three tasks:
● Data collection
● Data validation and
● Data analysis
Part-2
As research experiment
● Audio fingerprint use in broadcast monitoring
Data collection
● Multiple sources to address to collect all required data
○ By commercial music monitoring service
○ By web scrapers (for most of the classical radio stations)
● Commercial sources
○ CSV
● Web scrapers
○ API used - jsoup
● Requester programs (collective work)
○ These collect music metadata from music information services
○ Two types - programs taking artist as input, another artist-title combination as
input
○ Those are Deezer, Discogs, MusicBrainz, SoundCloud and Spotify
Data Flow and Data CollectionProcess
Input for Requesters and their Output Usage
Experiment on Audio Fingerprint
● What is audio fingerprinting?
○ It is a summary of an audio object using a limited number of bits [2].
● Audio fingerprint need
○ It is crucial to think resourceful ways to accelerate the process
○ To analyse how effectively audio fingerprinting can be used in broadcast monitoring
○ To reduce the number of repeated requests to music information services.
Audio Fingerprint Realisation
● Various recent APIs were analysed
● Finally, musicg API was used which suited best to system requirements
● The system is prepared to take “.wav” file as input and generate fingerprints
● Use of the system:
○ Audio stream of 24 hrs is splitted to songs (.mp3)
○ “.mp3” are converted to “.wav” (ffmpeg)
○ This wave input generates fingerprint in byte stream
○ It give an opportunity to save it is as filetype or as byte[] in database
○ It also has an function to find similarity with other songs
Audio Fingerprinting Framework
Fingerprint Time Evaluation Analysis
Conclusions
1 2 6 7 8 10
Stations Deutsch
Sprachig
E-/gehob
Music
Anteil
Regional
Anteil
Nische
Anteil
Nach-
wuchs
Anteil
Eindeutige
Titel
Station1 16% 0% 1% 17% 26% 3%
Station2 12% 0% 0% 31% 10% 6%
Station3 17% 0% 0% 29% 25% 15%
Station4 18% 0% 0% 18% 21% 7%
Station5 3% 1% 0% 8% 10% 4%
Conclusions
● Station1 air played about 16% of songs of German artists (factor 1).
● Station3 air played 25% of the songs were originated by so called newcomers
identified by having not more than 2 albums published (Factor 8)
Other Benefits
● It is easy to conduct an investigation over a defined period, such as a month
● Documentation about empirical factors helps in calculation of royalties or
identification of copyright infringements
● Content categorisation helps in assigning a unique description to stations by
contents variety.
References
● Ed Greengrass. “Information retrieval: A survey”. In: (2000).
● S. Shum. An Introduction to Audio Fingerprinting," SLS Group Meeting,
October 2011. Oct. 2011. URL: http : / / people . csail . mit . edu / sshum
/talks/audio_fingerprinting_sls_24Oct2011.pdf

More Related Content

Similar to Thesis presentation on Music Information Retrieval

Multimodal Features for Linking Television Content
Multimodal Features for Linking Television ContentMultimodal Features for Linking Television Content
Multimodal Features for Linking Television ContentPetra Galuscakova
 
Automatic Music Transcription
Automatic Music TranscriptionAutomatic Music Transcription
Automatic Music TranscriptionKhyati Ganatra
 
Audio Information for Hyperlinking of TV Content
Audio Information for Hyperlinking of TV ContentAudio Information for Hyperlinking of TV Content
Audio Information for Hyperlinking of TV ContentPetra Galuscakova
 
ICT-GroupProject-Report2-NguyenDangHoa_2
ICT-GroupProject-Report2-NguyenDangHoa_2ICT-GroupProject-Report2-NguyenDangHoa_2
ICT-GroupProject-Report2-NguyenDangHoa_2Minh Tuan Nguyen
 
Introduction of my research histroy: From instrument recognition to support o...
Introduction of my research histroy: From instrument recognition to support o...Introduction of my research histroy: From instrument recognition to support o...
Introduction of my research histroy: From instrument recognition to support o...kthrlab
 
Aichroth audio forensics and automation
Aichroth audio forensics and automationAichroth audio forensics and automation
Aichroth audio forensics and automationFIAT/IFTA
 
Big data and machine learning @ Spotify
Big data and machine learning @ SpotifyBig data and machine learning @ Spotify
Big data and machine learning @ SpotifyOscar Carlsson
 
Data Science Game 2017 - Machine Learning Meetup Presentation
Data Science Game 2017 - Machine Learning Meetup PresentationData Science Game 2017 - Machine Learning Meetup Presentation
Data Science Game 2017 - Machine Learning Meetup PresentationLuis Moneda
 
05f 1805 felixdidier_a2b
05f 1805 felixdidier_a2b05f 1805 felixdidier_a2b
05f 1805 felixdidier_a2bguestd90cb0
 
Arcomem training entities-and-events_advanced
Arcomem training entities-and-events_advancedArcomem training entities-and-events_advanced
Arcomem training entities-and-events_advancedarcomem
 
Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...
Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...
Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...Marius Miron
 
Nithin Xavier research_proposal
Nithin Xavier research_proposalNithin Xavier research_proposal
Nithin Xavier research_proposalNithin Xavier
 
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...FIAT/IFTA
 
AI THROUGH THE EYES OF ORGANISE SOUND
AI THROUGH THE EYES OF ORGANISE SOUNDAI THROUGH THE EYES OF ORGANISE SOUND
AI THROUGH THE EYES OF ORGANISE SOUNDJaideep Ghosh
 
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...multimediaeval
 
AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...
AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...
AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...WGBH Media Library and Archives
 
Mood classification of songs based on lyrics
Mood classification of songs based on lyricsMood classification of songs based on lyrics
Mood classification of songs based on lyricsFrancesco Cucari
 
Mimo musical instrument museums online project fp7 final review master
Mimo musical instrument museums online project fp7  final review masterMimo musical instrument museums online project fp7  final review master
Mimo musical instrument museums online project fp7 final review masterJohn Scally
 

Similar to Thesis presentation on Music Information Retrieval (20)

Multimodal Features for Linking Television Content
Multimodal Features for Linking Television ContentMultimodal Features for Linking Television Content
Multimodal Features for Linking Television Content
 
Automatic Music Transcription
Automatic Music TranscriptionAutomatic Music Transcription
Automatic Music Transcription
 
Audio Information for Hyperlinking of TV Content
Audio Information for Hyperlinking of TV ContentAudio Information for Hyperlinking of TV Content
Audio Information for Hyperlinking of TV Content
 
ICT-GroupProject-Report2-NguyenDangHoa_2
ICT-GroupProject-Report2-NguyenDangHoa_2ICT-GroupProject-Report2-NguyenDangHoa_2
ICT-GroupProject-Report2-NguyenDangHoa_2
 
Introduction of my research histroy: From instrument recognition to support o...
Introduction of my research histroy: From instrument recognition to support o...Introduction of my research histroy: From instrument recognition to support o...
Introduction of my research histroy: From instrument recognition to support o...
 
Aichroth audio forensics and automation
Aichroth audio forensics and automationAichroth audio forensics and automation
Aichroth audio forensics and automation
 
Big data and machine learning @ Spotify
Big data and machine learning @ SpotifyBig data and machine learning @ Spotify
Big data and machine learning @ Spotify
 
Data Science Game 2017 - Machine Learning Meetup Presentation
Data Science Game 2017 - Machine Learning Meetup PresentationData Science Game 2017 - Machine Learning Meetup Presentation
Data Science Game 2017 - Machine Learning Meetup Presentation
 
CHAPTER – 5 Audio
CHAPTER – 5     AudioCHAPTER – 5     Audio
CHAPTER – 5 Audio
 
05f 1805 felixdidier_a2b
05f 1805 felixdidier_a2b05f 1805 felixdidier_a2b
05f 1805 felixdidier_a2b
 
Arcomem training entities-and-events_advanced
Arcomem training entities-and-events_advancedArcomem training entities-and-events_advanced
Arcomem training entities-and-events_advanced
 
Big Data in the Music Industries, Dagfinn Bach, Bach Technology
Big Data in the Music Industries, Dagfinn Bach, Bach TechnologyBig Data in the Music Industries, Dagfinn Bach, Bach Technology
Big Data in the Music Industries, Dagfinn Bach, Bach Technology
 
Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...
Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...
Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...
 
Nithin Xavier research_proposal
Nithin Xavier research_proposalNithin Xavier research_proposal
Nithin Xavier research_proposal
 
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
 
AI THROUGH THE EYES OF ORGANISE SOUND
AI THROUGH THE EYES OF ORGANISE SOUNDAI THROUGH THE EYES OF ORGANISE SOUND
AI THROUGH THE EYES OF ORGANISE SOUND
 
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
 
AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...
AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...
AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...
 
Mood classification of songs based on lyrics
Mood classification of songs based on lyricsMood classification of songs based on lyrics
Mood classification of songs based on lyrics
 
Mimo musical instrument museums online project fp7 final review master
Mimo musical instrument museums online project fp7  final review masterMimo musical instrument museums online project fp7  final review master
Mimo musical instrument museums online project fp7 final review master
 

Recently uploaded

Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单enxupq
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单ewymefz
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Domenico Conte
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationBoston Institute of Analytics
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...elinavihriala
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatheahmadsaood
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundOppotus
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单nscud
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单ocavb
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单nscud
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单ewymefz
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单ewymefz
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单ewymefz
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单ewymefz
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJames Polillo
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单enxupq
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sMAQIB18
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesStarCompliance.io
 

Recently uploaded (20)

Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 

Thesis presentation on Music Information Retrieval

  • 1. Music Information Retrieval DNA like classification of publicly accessible broadcast stations Author: Ganesh K Harugeri Examiner: apl. Prof. Dr. habil. Marcus Liwicki
  • 2. What is Information Retrieval? ● It is a discipline that deals with retrieval of unstructured data ● Mainy textual documents ● This is in response to a query or topic statement ● Output may be unstructured or structured ● Example: ○ A sentence or even another document (Unstructured), ○ A boolean expression (Structured document) [1]
  • 3. What is Music Information Retrieval? ● Music Information retrieval is an interdisciplinary research, involving musicology, psychology, signal processing, machine learning or combination among these disciplines. ● Music information retrieval includes ○ Discovery ○ Analysis and ○ Collection ○ of large amount of music or music related data.
  • 4. Now, The Objective of the Study.. ● Germany has over 200 publicly accessible radio stations ● Evaluate to systematically classify these radio stations ● Additionally, interpret these stations by a set of empirical factors ● The study helps in assigning DNA like identification after evaluation ● Few empirical factors to mention here are: ○ Proportion of German speaking artists (%) ○ Proportion of Newcomer artists (%) ○ Proportion of Distinct titles (%) etc.
  • 5. The Study Workflow Part-1 The study is divided into three tasks: ● Data collection ● Data validation and ● Data analysis Part-2 As research experiment ● Audio fingerprint use in broadcast monitoring
  • 6. Data collection ● Multiple sources to address to collect all required data ○ By commercial music monitoring service ○ By web scrapers (for most of the classical radio stations) ● Commercial sources ○ CSV ● Web scrapers ○ API used - jsoup ● Requester programs (collective work) ○ These collect music metadata from music information services ○ Two types - programs taking artist as input, another artist-title combination as input ○ Those are Deezer, Discogs, MusicBrainz, SoundCloud and Spotify
  • 7. Data Flow and Data CollectionProcess
  • 8. Input for Requesters and their Output Usage
  • 9. Experiment on Audio Fingerprint ● What is audio fingerprinting? ○ It is a summary of an audio object using a limited number of bits [2]. ● Audio fingerprint need ○ It is crucial to think resourceful ways to accelerate the process ○ To analyse how effectively audio fingerprinting can be used in broadcast monitoring ○ To reduce the number of repeated requests to music information services.
  • 10. Audio Fingerprint Realisation ● Various recent APIs were analysed ● Finally, musicg API was used which suited best to system requirements ● The system is prepared to take “.wav” file as input and generate fingerprints ● Use of the system: ○ Audio stream of 24 hrs is splitted to songs (.mp3) ○ “.mp3” are converted to “.wav” (ffmpeg) ○ This wave input generates fingerprint in byte stream ○ It give an opportunity to save it is as filetype or as byte[] in database ○ It also has an function to find similarity with other songs
  • 13. Conclusions 1 2 6 7 8 10 Stations Deutsch Sprachig E-/gehob Music Anteil Regional Anteil Nische Anteil Nach- wuchs Anteil Eindeutige Titel Station1 16% 0% 1% 17% 26% 3% Station2 12% 0% 0% 31% 10% 6% Station3 17% 0% 0% 29% 25% 15% Station4 18% 0% 0% 18% 21% 7% Station5 3% 1% 0% 8% 10% 4%
  • 14. Conclusions ● Station1 air played about 16% of songs of German artists (factor 1). ● Station3 air played 25% of the songs were originated by so called newcomers identified by having not more than 2 albums published (Factor 8)
  • 15. Other Benefits ● It is easy to conduct an investigation over a defined period, such as a month ● Documentation about empirical factors helps in calculation of royalties or identification of copyright infringements ● Content categorisation helps in assigning a unique description to stations by contents variety.
  • 16. References ● Ed Greengrass. “Information retrieval: A survey”. In: (2000). ● S. Shum. An Introduction to Audio Fingerprinting," SLS Group Meeting, October 2011. Oct. 2011. URL: http : / / people . csail . mit . edu / sshum /talks/audio_fingerprinting_sls_24Oct2011.pdf