SlideShare a Scribd company logo
sankalp.gulati@upf.edu
Discovery and Characterization of
Melodic Motives in Large Audio Music
Collections
PhD Proposal Defense
Music Technology Group, Universitat Pompeu Fabra, Barcelona,
Spain
Sankalp Gulati
Supervisor: Prof. Xavier Serra
sankalp.gulati@upf.edu
Patterns
Images at right half taken from- (Mueen & Keogh, 2009) and (Mueen & Keogh, 2010)
sankalp.gulati@upf.edu
Melodic Patterns
Top right image taken from - (Mueen & Keogh, 2009) and (Mueen & Keogh, 2010)
sankalp.gulati@upf.edu
Melodic Motives
(Patterns)
Melodic Motives
Top right image taken from - (Mueen & Keogh, 2009) and (Mueen & Keogh, 2010)
sankalp.gulati@upf.edu
Melodic Motives
Discovery
Induction
Extraction
Matching
Retrieval
Discovery Melodic Motives
+
Image taken from - (Mueen & Keogh, 2009)
sankalp.gulati@upf.edu
Large Audio Music Collections
Discovery Melodic Motives
Large Audio Music Collections
> 500,000
> 550 hours
sankalp.gulati@upf.edu
Characterizatio
n
Discovery Characterization Melodic Motives
Large Audio Music Collections
Transform
N
dimensions
sankalp.gulati@upf.edu
Discovery and Characterization of
Melodic Motives in Large Audio Music
Collections
PhD Proposal Defense
Music Technology Group, Universitat Pompeu
Fabra, Barcelona, Spain
Sankalp Gulati
Supervisor: Prof. Xavier Serra
sankalp.gulati@upf.edu
 Music Information Research (MIR)
Introduction
sankalp.gulati@upf.edu
Introduction
 Music->Melody (pitch, loudness, timbre)
 “It is melody that enables us to distinguish one work from
another. It is melody that human beings are innately able to
reproduce by singing, humming, and whistling. It is melody that
makes music memorable: we are likely to recall a tune long after
we have forgotten its text” -(Selfridge-Field, 1998)
 Audio example:
sankalp.gulati@upf.edu
Introduction
 Melodic Analysis : Melodic Motives
 Computational Melodic Motivic Analysis
Hungarian, Slovak, French, Sicilian, Bulgaria
n and Appalachian Folk Melodies -
(Juhász, 2006)
Cretan, Nova scotia and Essen Folk Melodies
– (Conklin and Anagnostopoulou, 2010, 2006)
Tunisian modal music
-(Lartillot & Ayari, 2006).
sankalp.gulati@upf.edu
Introduction
 Melodic Motivic Discovery in Audio Music
Signals?
 Is it needed?
 Why so little work?
 Solution?
sankalp.gulati@upf.edu
Introduction
 Indian Art Music: Opportunities
 Heterophonic Music
 Melodic framework (Rāg)
 Importance of melodic phrases (Pakads, Chalans)
 Available audio music repertoire
sankalp.gulati@upf.edu
Introduction: Broad Research
Goals
 Broad Research Goals:
 Computational methodology for melodic motivic
discovery in large audio music collection utilizing
domain specific knowledge.
 Melodic motivic analysis methodology
 Similarity measures based on melodic motives
 Compilation of sizeable audio music collection of
Indian art music
 Summarize and compile existing literature
sankalp.gulati@upf.edu
Introduction: Goals and Motivation
 Motivation:
 Lack of approaches for melodic motif extraction in
audio signal
 Lack of utilization of domain specific knowledge in
computational methodologies
 Further state of the art in pattern processing in MIR
sankalp.gulati@upf.edu
Proposed Methodology
sankalp.gulati@upf.edu
Proposed Methodology: Overview
Block Diagram for proposed methodology
sankalp.gulati@upf.edu
Proposed Methodology: Data Collection
 Audio
 Metadata
 Annotations > 550 hours
sankalp.gulati@upf.edu
Proposed Methodology: Melodic Feature
Extraction
 Pitch, loudness and timbre features
 Pitch: F0 frequency contour of predominant melodic source. Use -
(Salamon & Gómez, 2012)
 Loudness: Perceptual loudness computed using only predominant
melodic source. Use - (Zwicker, 1977)
 Timbre: Centroid of the spectral envelope of the predominant melodic
source. Use - (Röbel & Rodet, 2005).
Predominant F0
Frequency
estimation
Synthesize
predominant melodic
source
Loudness
feature
extraction
Timbre feature
extraction
Audio
sankalp.gulati@upf.edu
 Evaluation: predominant F0 frequency
estimation
 6 Hindustani music pieces ~45 mins
Proposed Methodology: Melodic Feature
Extraction
sankalp.gulati@upf.edu
 Compact + Abstract/reduced
 Challenges:
 Heavy meandering around notes (Gamakas)
 Svar intonation
 Aroh-Avroh dependent svar intonation
 F0 frequency contour musical pitch perception
2.215 2.22 2.225 2.23 2.235 2.24
x 10
4
1300
1400
1500
1600
1700
1800
1900
2000
Time (1 sample = 10 ms)
PredominantF0frequency(Cents)
Proposed Methodology: Melodic
Representation
sankalp.gulati@upf.edu
 Continuous time varying values of
pitch, loudness and timbral features
 Possibilities
 Melody transcription
 SAX based symbolic representation
 Parametric representation (no studies!!)
 Saddle point based representation
 Domain knowledge
 Svar intonation profiles
Proposed Methodology: Melodic
Representation
sankalp.gulati@upf.edu
Proposed Methodology: Melodic Similarity
 Challenges
 Melodic representation
 Large timing variations
 Pitch variations (ornamentations)
 Differentiating a characteristic phrase from a melodic
sequence using same svars
 Fixing similarity threshold
 Audio example:
 Dynamic Time Warping (Initial experiments)
 DTW > (SAX + Euclidean distances) (Ross, Vinutha, and
Rao,2012)
sankalp.gulati@upf.edu
 Possibilities
 Euclidian and Mahalanobis distance measures
 HMM based distance measures
 Dynamic time warping based distances
 Step and boundary conditions
 Constraints
 Context dependent DTW
 Domain Knowledge
 DTW constraint parameters
 Pattern dependent similarity threshold
 Weighted distance measures
Proposed Methodology: Melodic Similarity
sankalp.gulati@upf.edu
Proposed Methodology: Pattern Extraction
 Challenges:
 Melodic segmentation
 Different motif lengths
 Large volume of audio data
 Exact melodic similarity ~ parametric melodic
representation
1000 1200 1400 1600 1800 2000 2200
160
180
200
220
240
260
280
300
320
Time (1 sample = 10 ms)
PredominantF0frequency(Hz)
Match Matrix
sankalp.gulati@upf.edu
 Ongoing work
 Music parallelismMelodic segmentation
 Motif discovery in time series analysis domain
 Fast brute force exhaustive pattern search
 Pruning strategies
1000 1200 1400 1600 1800 2000 2200
160
180
200
220
240
260
280
300
320
Time (1 sample = 10 ms)
PredominantF0frequency(Hz)
Proposed Methodology: Pattern Extraction
sankalp.gulati@upf.edu
 Possibilities
 Sparse similarity matrices
 Lower bounds on distance measures
 Phase space embedding/recurrent plots
 Suffix trees (~parametric representation)
 Domain knowledge
 Probable phrase boundaries
 Pruning rules
 Motif characteristics
Proposed Methodology: Pattern Extraction
sankalp.gulati@upf.edu
Proposed Methodology: Melodic Motivic
Analysis
 Challenges
 Non uniform length of motives
 Directions
 Clustering
 K-mean clustering
 Self organizing maps
 Fractal Analysis
 Application
 Rāg characterization
 Rāg specific motives
 Shared motives
Transform
N dimensions
sankalp.gulati@upf.edu
Proposed Methodology: Evaluation
 Challenges
 No annotated corpus
 Human subjectivity in similarity related tasks
 Listening tests
 Feedback through Dunya users
sankalp.gulati@upf.edu
References
 Selfridge-Field, E. (1998). Conceptual and representational issues in melodic comparison.
Computing in musicology: a directory of research(11), 3–64.
 Juhász, Z. (2006, June). A systematic comparison of different European folk music
traditions using self-organizing maps. Journal of New Music Research, 35(2), 95–112.
 Conklin, D., & Anagnostopoulou, C. (2006). Segmental pattern discovery in music.
INFORMS Journal on Computing, 18(3), 285–293.
 Lartillot, O., & Ayari, M. (2006). Motivic pattern extraction in music, and application to the
study of Tunisian modal music. South African Computer Journal, 36, 16–28.
 Salamon, J., & Gómez, E. (2012, August). Melody Extraction From Polyphonic Music
Signals Using Pitch Contour Characteristics. IEEE Transactions on Audio, Speech, and
Language Processing, 20(6), 1759–1770.
 Zwicker, E. (1977). Procedure for calculating loudness of temporally variable sounds. The
Journal of the Acoustical Society of America, 62(3), 675–682.
 Röbel, A., & Rodet, X. (2005). Efficient spectral envelope estimation and its application to
pitch shifting and envelope preservation. In Proc. dafx.
 Ross, J. C., Vinutha, T., & Rao, P. (2012). Detecting melodic motifs from audio for
hindustani classical music. In Proceedings of the 13th international society for music
information retrieval conference, porto, portugal.
 Mueen, A., Keogh, E. J., Zhu, Q., Cash, S., & Westover, M. B. (2009, April). Exact
Discovery of Time Series Motifs. In SDM (pp. 473-484).
 Mueen, A., & Keogh, E. (2010, July). Online discovery and maintenance of time series
motifs. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge
discovery and data mining (pp. 1089-1098). ACM.
sankalp.gulati@upf.edu
Work Plan

More Related Content

Similar to Discovery and Characterization of Melodic Motives in Large Audio Music Collections

Nithin Xavier research_proposal
Nithin Xavier research_proposalNithin Xavier research_proposal
Nithin Xavier research_proposal
Nithin Xavier
 
The past, present and future of singing synthesis
The past, present and future of singing synthesisThe past, present and future of singing synthesis
The past, present and future of singing synthesis
Eji Warp
 
Performance Comparison of Musical Instrument Family Classification Using Soft...
Performance Comparison of Musical Instrument Family Classification Using Soft...Performance Comparison of Musical Instrument Family Classification Using Soft...
Performance Comparison of Musical Instrument Family Classification Using Soft...
Waqas Tariq
 
Towards Web-Scale Analysis of Musical Structure
Towards Web-Scale Analysis of Musical Structure Towards Web-Scale Analysis of Musical Structure
Towards Web-Scale Analysis of Musical Structure
David De Roure
 
J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...
J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...
J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...
MusicNet
 
DETECTING AND LOCATING PLAGIARISM OF MUSIC MELODIES BY PATH EXPLORATION OVER ...
DETECTING AND LOCATING PLAGIARISM OF MUSIC MELODIES BY PATH EXPLORATION OVER ...DETECTING AND LOCATING PLAGIARISM OF MUSIC MELODIES BY PATH EXPLORATION OVER ...
DETECTING AND LOCATING PLAGIARISM OF MUSIC MELODIES BY PATH EXPLORATION OVER ...
csandit
 
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiOThe convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The Research Council of Norway, IKTPLUSS
 
Transferring Singing Expressions from One Voice to Another for a Given Song
Transferring Singing Expressions from One Voice to Another for a Given SongTransferring Singing Expressions from One Voice to Another for a Given Song
Transferring Singing Expressions from One Voice to Another for a Given Song
NAVER Engineering
 
Genre Classification and Analysis
Genre Classification and AnalysisGenre Classification and Analysis
Genre Classification and Analysis
Anat Gilboa
 
Wilkie
WilkieWilkie
Wilkie
anesah
 
Human Perception and Recognition of Musical Instruments: A Review
Human Perception and Recognition of Musical Instruments: A ReviewHuman Perception and Recognition of Musical Instruments: A Review
Human Perception and Recognition of Musical Instruments: A Review
Editor IJCATR
 
Hsc2017 music
Hsc2017 musicHsc2017 music
Hsc2017 music
PublicLibraryServices
 
Computational Approaches for Melodic Description in Indian Art Music Corpora
Computational Approaches for Melodic Description in Indian Art Music CorporaComputational Approaches for Melodic Description in Indian Art Music Corpora
Computational Approaches for Melodic Description in Indian Art Music Corpora
Sankalp Gulati
 
Emofy
Emofy Emofy
Management of Music collections
Management of Music collectionsManagement of Music collections
Management of Music collections
Agnes Namaganda
 
Ism2011
Ism2011Ism2011
Ism2011
Shlomo Dubnov
 
Audio Source Separation Based on Low-Rank Structure and Statistical Independence
Audio Source Separation Based on Low-Rank Structure and Statistical IndependenceAudio Source Separation Based on Low-Rank Structure and Statistical Independence
Audio Source Separation Based on Low-Rank Structure and Statistical Independence
Daichi Kitamura
 
A Computational Framework for Sound Segregation in Music Signals using Marsyas
A Computational Framework for Sound Segregation in Music Signals using MarsyasA Computational Framework for Sound Segregation in Music Signals using Marsyas
A Computational Framework for Sound Segregation in Music Signals using Marsyas
Luís Gustavo Martins
 
Dancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop musicDancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop music
JAYALAKSHMI NAIR
 

Similar to Discovery and Characterization of Melodic Motives in Large Audio Music Collections (19)

Nithin Xavier research_proposal
Nithin Xavier research_proposalNithin Xavier research_proposal
Nithin Xavier research_proposal
 
The past, present and future of singing synthesis
The past, present and future of singing synthesisThe past, present and future of singing synthesis
The past, present and future of singing synthesis
 
Performance Comparison of Musical Instrument Family Classification Using Soft...
Performance Comparison of Musical Instrument Family Classification Using Soft...Performance Comparison of Musical Instrument Family Classification Using Soft...
Performance Comparison of Musical Instrument Family Classification Using Soft...
 
Towards Web-Scale Analysis of Musical Structure
Towards Web-Scale Analysis of Musical Structure Towards Web-Scale Analysis of Musical Structure
Towards Web-Scale Analysis of Musical Structure
 
J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...
J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...
J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...
 
DETECTING AND LOCATING PLAGIARISM OF MUSIC MELODIES BY PATH EXPLORATION OVER ...
DETECTING AND LOCATING PLAGIARISM OF MUSIC MELODIES BY PATH EXPLORATION OVER ...DETECTING AND LOCATING PLAGIARISM OF MUSIC MELODIES BY PATH EXPLORATION OVER ...
DETECTING AND LOCATING PLAGIARISM OF MUSIC MELODIES BY PATH EXPLORATION OVER ...
 
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiOThe convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
 
Transferring Singing Expressions from One Voice to Another for a Given Song
Transferring Singing Expressions from One Voice to Another for a Given SongTransferring Singing Expressions from One Voice to Another for a Given Song
Transferring Singing Expressions from One Voice to Another for a Given Song
 
Genre Classification and Analysis
Genre Classification and AnalysisGenre Classification and Analysis
Genre Classification and Analysis
 
Wilkie
WilkieWilkie
Wilkie
 
Human Perception and Recognition of Musical Instruments: A Review
Human Perception and Recognition of Musical Instruments: A ReviewHuman Perception and Recognition of Musical Instruments: A Review
Human Perception and Recognition of Musical Instruments: A Review
 
Hsc2017 music
Hsc2017 musicHsc2017 music
Hsc2017 music
 
Computational Approaches for Melodic Description in Indian Art Music Corpora
Computational Approaches for Melodic Description in Indian Art Music CorporaComputational Approaches for Melodic Description in Indian Art Music Corpora
Computational Approaches for Melodic Description in Indian Art Music Corpora
 
Emofy
Emofy Emofy
Emofy
 
Management of Music collections
Management of Music collectionsManagement of Music collections
Management of Music collections
 
Ism2011
Ism2011Ism2011
Ism2011
 
Audio Source Separation Based on Low-Rank Structure and Statistical Independence
Audio Source Separation Based on Low-Rank Structure and Statistical IndependenceAudio Source Separation Based on Low-Rank Structure and Statistical Independence
Audio Source Separation Based on Low-Rank Structure and Statistical Independence
 
A Computational Framework for Sound Segregation in Music Signals using Marsyas
A Computational Framework for Sound Segregation in Music Signals using MarsyasA Computational Framework for Sound Segregation in Music Signals using Marsyas
A Computational Framework for Sound Segregation in Music Signals using Marsyas
 
Dancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop musicDancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop music
 

More from Sankalp Gulati

Landmark Detection in Hindustani Music Melodies
Landmark Detection in Hindustani Music MelodiesLandmark Detection in Hindustani Music Melodies
Landmark Detection in Hindustani Music Melodies
Sankalp Gulati
 
Phrase-based Rāga Recognition Using Vector Space Modeling
Phrase-based Rāga Recognition Using Vector Space ModelingPhrase-based Rāga Recognition Using Vector Space Modeling
Phrase-based Rāga Recognition Using Vector Space Modeling
Sankalp Gulati
 
[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music
[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music
[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music
Sankalp Gulati
 
Computational Melodic Analysis of Indian Art Music
Computational Melodic Analysis of Indian Art MusicComputational Melodic Analysis of Indian Art Music
Computational Melodic Analysis of Indian Art Music
Sankalp Gulati
 
Hindify
HindifyHindify
Tonic Identification System for Hindustani and Carnatic Music
Tonic Identification System for Hindustani and Carnatic MusicTonic Identification System for Hindustani and Carnatic Music
Tonic Identification System for Hindustani and Carnatic Music
Sankalp Gulati
 
Tonic Identification System for Indian Art Music
Tonic Identification System for Indian Art MusicTonic Identification System for Indian Art Music
Tonic Identification System for Indian Art Music
Sankalp Gulati
 

More from Sankalp Gulati (7)

Landmark Detection in Hindustani Music Melodies
Landmark Detection in Hindustani Music MelodiesLandmark Detection in Hindustani Music Melodies
Landmark Detection in Hindustani Music Melodies
 
Phrase-based Rāga Recognition Using Vector Space Modeling
Phrase-based Rāga Recognition Using Vector Space ModelingPhrase-based Rāga Recognition Using Vector Space Modeling
Phrase-based Rāga Recognition Using Vector Space Modeling
 
[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music
[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music
[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music
 
Computational Melodic Analysis of Indian Art Music
Computational Melodic Analysis of Indian Art MusicComputational Melodic Analysis of Indian Art Music
Computational Melodic Analysis of Indian Art Music
 
Hindify
HindifyHindify
Hindify
 
Tonic Identification System for Hindustani and Carnatic Music
Tonic Identification System for Hindustani and Carnatic MusicTonic Identification System for Hindustani and Carnatic Music
Tonic Identification System for Hindustani and Carnatic Music
 
Tonic Identification System for Indian Art Music
Tonic Identification System for Indian Art MusicTonic Identification System for Indian Art Music
Tonic Identification System for Indian Art Music
 

Recently uploaded

Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
TechSoup
 
Cognitive Development Adolescence Psychology
Cognitive Development Adolescence PsychologyCognitive Development Adolescence Psychology
Cognitive Development Adolescence Psychology
paigestewart1632
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
Nguyen Thanh Tu Collection
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
Dr. Shivangi Singh Parihar
 
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
สมใจ จันสุกสี
 
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptxChapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Denish Jangid
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
AyyanKhan40
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
Dr. Mulla Adam Ali
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
GeorgeMilliken2
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
Celine George
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
Celine George
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
PECB
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
TechSoup
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
Nicholas Montgomery
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
Celine George
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 

Recently uploaded (20)

Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
 
Cognitive Development Adolescence Psychology
Cognitive Development Adolescence PsychologyCognitive Development Adolescence Psychology
Cognitive Development Adolescence Psychology
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
 
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
 
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptxChapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptx
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 

Discovery and Characterization of Melodic Motives in Large Audio Music Collections

  • 1. sankalp.gulati@upf.edu Discovery and Characterization of Melodic Motives in Large Audio Music Collections PhD Proposal Defense Music Technology Group, Universitat Pompeu Fabra, Barcelona, Spain Sankalp Gulati Supervisor: Prof. Xavier Serra
  • 2. sankalp.gulati@upf.edu Patterns Images at right half taken from- (Mueen & Keogh, 2009) and (Mueen & Keogh, 2010)
  • 3. sankalp.gulati@upf.edu Melodic Patterns Top right image taken from - (Mueen & Keogh, 2009) and (Mueen & Keogh, 2010)
  • 4. sankalp.gulati@upf.edu Melodic Motives (Patterns) Melodic Motives Top right image taken from - (Mueen & Keogh, 2009) and (Mueen & Keogh, 2010)
  • 6. sankalp.gulati@upf.edu Large Audio Music Collections Discovery Melodic Motives Large Audio Music Collections > 500,000 > 550 hours
  • 7. sankalp.gulati@upf.edu Characterizatio n Discovery Characterization Melodic Motives Large Audio Music Collections Transform N dimensions
  • 8. sankalp.gulati@upf.edu Discovery and Characterization of Melodic Motives in Large Audio Music Collections PhD Proposal Defense Music Technology Group, Universitat Pompeu Fabra, Barcelona, Spain Sankalp Gulati Supervisor: Prof. Xavier Serra
  • 9. sankalp.gulati@upf.edu  Music Information Research (MIR) Introduction
  • 10. sankalp.gulati@upf.edu Introduction  Music->Melody (pitch, loudness, timbre)  “It is melody that enables us to distinguish one work from another. It is melody that human beings are innately able to reproduce by singing, humming, and whistling. It is melody that makes music memorable: we are likely to recall a tune long after we have forgotten its text” -(Selfridge-Field, 1998)  Audio example:
  • 11. sankalp.gulati@upf.edu Introduction  Melodic Analysis : Melodic Motives  Computational Melodic Motivic Analysis Hungarian, Slovak, French, Sicilian, Bulgaria n and Appalachian Folk Melodies - (Juhász, 2006) Cretan, Nova scotia and Essen Folk Melodies – (Conklin and Anagnostopoulou, 2010, 2006) Tunisian modal music -(Lartillot & Ayari, 2006).
  • 12. sankalp.gulati@upf.edu Introduction  Melodic Motivic Discovery in Audio Music Signals?  Is it needed?  Why so little work?  Solution?
  • 13. sankalp.gulati@upf.edu Introduction  Indian Art Music: Opportunities  Heterophonic Music  Melodic framework (Rāg)  Importance of melodic phrases (Pakads, Chalans)  Available audio music repertoire
  • 14. sankalp.gulati@upf.edu Introduction: Broad Research Goals  Broad Research Goals:  Computational methodology for melodic motivic discovery in large audio music collection utilizing domain specific knowledge.  Melodic motivic analysis methodology  Similarity measures based on melodic motives  Compilation of sizeable audio music collection of Indian art music  Summarize and compile existing literature
  • 15. sankalp.gulati@upf.edu Introduction: Goals and Motivation  Motivation:  Lack of approaches for melodic motif extraction in audio signal  Lack of utilization of domain specific knowledge in computational methodologies  Further state of the art in pattern processing in MIR
  • 18. sankalp.gulati@upf.edu Proposed Methodology: Data Collection  Audio  Metadata  Annotations > 550 hours
  • 19. sankalp.gulati@upf.edu Proposed Methodology: Melodic Feature Extraction  Pitch, loudness and timbre features  Pitch: F0 frequency contour of predominant melodic source. Use - (Salamon & Gómez, 2012)  Loudness: Perceptual loudness computed using only predominant melodic source. Use - (Zwicker, 1977)  Timbre: Centroid of the spectral envelope of the predominant melodic source. Use - (Röbel & Rodet, 2005). Predominant F0 Frequency estimation Synthesize predominant melodic source Loudness feature extraction Timbre feature extraction Audio
  • 20. sankalp.gulati@upf.edu  Evaluation: predominant F0 frequency estimation  6 Hindustani music pieces ~45 mins Proposed Methodology: Melodic Feature Extraction
  • 21. sankalp.gulati@upf.edu  Compact + Abstract/reduced  Challenges:  Heavy meandering around notes (Gamakas)  Svar intonation  Aroh-Avroh dependent svar intonation  F0 frequency contour musical pitch perception 2.215 2.22 2.225 2.23 2.235 2.24 x 10 4 1300 1400 1500 1600 1700 1800 1900 2000 Time (1 sample = 10 ms) PredominantF0frequency(Cents) Proposed Methodology: Melodic Representation
  • 22. sankalp.gulati@upf.edu  Continuous time varying values of pitch, loudness and timbral features  Possibilities  Melody transcription  SAX based symbolic representation  Parametric representation (no studies!!)  Saddle point based representation  Domain knowledge  Svar intonation profiles Proposed Methodology: Melodic Representation
  • 23. sankalp.gulati@upf.edu Proposed Methodology: Melodic Similarity  Challenges  Melodic representation  Large timing variations  Pitch variations (ornamentations)  Differentiating a characteristic phrase from a melodic sequence using same svars  Fixing similarity threshold  Audio example:  Dynamic Time Warping (Initial experiments)  DTW > (SAX + Euclidean distances) (Ross, Vinutha, and Rao,2012)
  • 24. sankalp.gulati@upf.edu  Possibilities  Euclidian and Mahalanobis distance measures  HMM based distance measures  Dynamic time warping based distances  Step and boundary conditions  Constraints  Context dependent DTW  Domain Knowledge  DTW constraint parameters  Pattern dependent similarity threshold  Weighted distance measures Proposed Methodology: Melodic Similarity
  • 25. sankalp.gulati@upf.edu Proposed Methodology: Pattern Extraction  Challenges:  Melodic segmentation  Different motif lengths  Large volume of audio data  Exact melodic similarity ~ parametric melodic representation 1000 1200 1400 1600 1800 2000 2200 160 180 200 220 240 260 280 300 320 Time (1 sample = 10 ms) PredominantF0frequency(Hz) Match Matrix
  • 26. sankalp.gulati@upf.edu  Ongoing work  Music parallelismMelodic segmentation  Motif discovery in time series analysis domain  Fast brute force exhaustive pattern search  Pruning strategies 1000 1200 1400 1600 1800 2000 2200 160 180 200 220 240 260 280 300 320 Time (1 sample = 10 ms) PredominantF0frequency(Hz) Proposed Methodology: Pattern Extraction
  • 27. sankalp.gulati@upf.edu  Possibilities  Sparse similarity matrices  Lower bounds on distance measures  Phase space embedding/recurrent plots  Suffix trees (~parametric representation)  Domain knowledge  Probable phrase boundaries  Pruning rules  Motif characteristics Proposed Methodology: Pattern Extraction
  • 28. sankalp.gulati@upf.edu Proposed Methodology: Melodic Motivic Analysis  Challenges  Non uniform length of motives  Directions  Clustering  K-mean clustering  Self organizing maps  Fractal Analysis  Application  Rāg characterization  Rāg specific motives  Shared motives Transform N dimensions
  • 29. sankalp.gulati@upf.edu Proposed Methodology: Evaluation  Challenges  No annotated corpus  Human subjectivity in similarity related tasks  Listening tests  Feedback through Dunya users
  • 30. sankalp.gulati@upf.edu References  Selfridge-Field, E. (1998). Conceptual and representational issues in melodic comparison. Computing in musicology: a directory of research(11), 3–64.  Juhász, Z. (2006, June). A systematic comparison of different European folk music traditions using self-organizing maps. Journal of New Music Research, 35(2), 95–112.  Conklin, D., & Anagnostopoulou, C. (2006). Segmental pattern discovery in music. INFORMS Journal on Computing, 18(3), 285–293.  Lartillot, O., & Ayari, M. (2006). Motivic pattern extraction in music, and application to the study of Tunisian modal music. South African Computer Journal, 36, 16–28.  Salamon, J., & Gómez, E. (2012, August). Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics. IEEE Transactions on Audio, Speech, and Language Processing, 20(6), 1759–1770.  Zwicker, E. (1977). Procedure for calculating loudness of temporally variable sounds. The Journal of the Acoustical Society of America, 62(3), 675–682.  Röbel, A., & Rodet, X. (2005). Efficient spectral envelope estimation and its application to pitch shifting and envelope preservation. In Proc. dafx.  Ross, J. C., Vinutha, T., & Rao, P. (2012). Detecting melodic motifs from audio for hindustani classical music. In Proceedings of the 13th international society for music information retrieval conference, porto, portugal.  Mueen, A., Keogh, E. J., Zhu, Q., Cash, S., & Westover, M. B. (2009, April). Exact Discovery of Time Series Motifs. In SDM (pp. 473-484).  Mueen, A., & Keogh, E. (2010, July). Online discovery and maintenance of time series motifs. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 1089-1098). ACM.

Editor's Notes

  1. Logo
  2. Concert ad