SlideShare a Scribd company logo
1 of 20
Download to read offline
Introduction to Audio Content Analysis
module 9.7: music structure detection
alexander lerch
overview intro ssm novelty homogeneity repetition eval summary
introduction
overview
corresponding textbook section
section 9.7
lecture content
• structure in music
• self similarity and self distance matrices
• structure detection approaches
learning objectives
• summarize basic difficulties in ground truth annotations of musical structure
• explain and interpret self similarity and self distance matrices
• summarize three domains for approaching music structure detection
module 9.7: music structure detection 1 / 14
overview intro ssm novelty homogeneity repetition eval summary
introduction
overview
corresponding textbook section
section 9.7
lecture content
• structure in music
• self similarity and self distance matrices
• structure detection approaches
learning objectives
• summarize basic difficulties in ground truth annotations of musical structure
• explain and interpret self similarity and self distance matrices
• summarize three domains for approaching music structure detection
module 9.7: music structure detection 1 / 14
overview intro ssm novelty homogeneity repetition eval summary
music structure
introduction
music is inherently formal/organized/structural
various hierarchical structural levels
• groups of notes build rhythmic/melodic/harmonic patterns
• measures group multiple events
• phrases group several measures
• sections contain several phrases
• several sections can comprise piece/movement
• . . .
grouping of musical elements/patterns is influenced by
1 contrasts & novelty
▶ rhythmic, harmonic, melodic patterns
2 similarity and repetitions
▶ rhythmic, harmonic, melodic patterns
3 homogeneity within a section
▶ instrumentation, tempo, harmony
module 9.7: music structure detection 2 / 14
overview intro ssm novelty homogeneity repetition eval summary
music structure
introduction
music is inherently formal/organized/structural
various hierarchical structural levels
• groups of notes build rhythmic/melodic/harmonic patterns
• measures group multiple events
• phrases group several measures
• sections contain several phrases
• several sections can comprise piece/movement
• . . .
grouping of musical elements/patterns is influenced by
1 contrasts & novelty
▶ rhythmic, harmonic, melodic patterns
2 similarity and repetitions
▶ rhythmic, harmonic, melodic patterns
3 homogeneity within a section
▶ instrumentation, tempo, harmony
module 9.7: music structure detection 2 / 14
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
introduction
objective
• reveal structural properties and relationships
• generate a list of parts and repetitions
typical processing steps
1 feature extraction
2 Self Distance Matrix (SDM) or Self Similarity Matrix (SSM) computation
3 segment detection based on
▶ novelty
▶ homogeneity
▶ repetition
module 9.7: music structure detection 3 / 14
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
example
module 9.7: music structure detection 4 / 14
matlab
source:
plotStructure.m
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
features 1/2
features from all categories can have impact on structure
• timbre
▶ instrumentation, playing technique, effects, . . .
• tonal content
▶ melodic and harmonic patterns, range, . . .
• rhythm content
▶ tempo, rhythmic patterns, . . .
• dynamics
▶ loudness, range, . . .
feature aggregation
• use texture window, or
• aggregate features per beat or downbeat
module 9.7: music structure detection 5 / 14
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
self similarity matrix
S(nA, nB) = s v(nA), v(nB)

module 9.7: music structure detection 6 / 14
matlab
source:
plotSsm.m
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
feature dependency of similarity
Features (left to right): Pitch Chroma, RMS, MFCCs, Mel Spectrogram
module 9.7: music structure detection 7 / 14
matlab
source:
plotSsmFeatures.m
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
novelty analysis
module 9.7: music structure detection 8 / 14
matlab
source:
plotCheckerBoard.m
matlab
source:
plotSsmNovelty.m
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
homogeneity analysis 1/2
module 9.7: music structure detection 9 / 14
matlab
source:
plotSsmLowPass.m
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
homogeneity analysis 2/2
can also be used as post-processing step after novelty-based approach, e.g.
1 describe each segment with features
2 cluster and see which segments are grouped together
module 9.7: music structure detection 10 / 14
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
repetition analysis 1/2
module 9.7: music structure detection 11 / 14
matlab
source:
plotSsmRotated.m
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
repetition analysis 2/2
while in many cases it ’looks’ easy, automatic extraction is error-prone
⇒ typical approaches for enhancing the distance/similarity/lag matrix
• filtering (low pass smoothing, high pass edge detection)
• use matrices with different time resolutions
• image processing methods (e.g., erosion  dilation)
• thresholding
• “path search” through probability matrix
module 9.7: music structure detection 12 / 14
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
repetition analysis 2/2
while in many cases it ’looks’ easy, automatic extraction is error-prone
⇒ typical approaches for enhancing the distance/similarity/lag matrix
• filtering (low pass smoothing, high pass edge detection)
• use matrices with different time resolutions
• image processing methods (e.g., erosion  dilation)
• thresholding
• “path search” through probability matrix
module 9.7: music structure detection 12 / 14
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
evaluation
evaluation of structure detection challenging
• ground truth
▶ structure itself may be ambiguous
▶ depending on annotator, varying hierarchical level of labels, e.g.
ann 1 intro A A outro
ann 2 intro verse chorus verse chorus outro
ann 3 intro V1 V2 C1 C2 V1 V2 C1 C2 outro
method and metric
• boundary matching
• frame level, e.g., pairwise match
typical range of results
• F = 50 . . . 70%
module 9.7: music structure detection 13 / 14
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
evaluation
evaluation of structure detection challenging
• ground truth
▶ structure itself may be ambiguous
▶ depending on annotator, varying hierarchical level of labels, e.g.
ann 1 intro A A outro
ann 2 intro verse chorus verse chorus outro
ann 3 intro V1 V2 C1 C2 V1 V2 C1 C2 outro
method and metric
• boundary matching
• frame level, e.g., pairwise match
typical range of results
• F = 50 . . . 70%
module 9.7: music structure detection 13 / 14
overview intro ssm novelty homogeneity repetition eval summary
music structure analysis
evaluation
evaluation of structure detection challenging
• ground truth
▶ structure itself may be ambiguous
▶ depending on annotator, varying hierarchical level of labels, e.g.
ann 1 intro A A outro
ann 2 intro verse chorus verse chorus outro
ann 3 intro V1 V2 C1 C2 V1 V2 C1 C2 outro
method and metric
• boundary matching
• frame level, e.g., pairwise match
typical range of results
• F = 50 . . . 70%
module 9.7: music structure detection 13 / 14
overview intro ssm novelty homogeneity repetition eval summary
summary
lecture content
self similarity/distance matrices
• shows pairwise similarities/distances
• depends on input features
structure detection
1 novelty
2 homogeneity
3 repetitions
module 9.7: music structure detection 14 / 14

More Related Content

More from AlexanderLerch4

0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdfAlexanderLerch4
 
09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdfAlexanderLerch4
 
09-01-ACA-Temporal-Intro.pdf
09-01-ACA-Temporal-Intro.pdf09-01-ACA-Temporal-Intro.pdf
09-01-ACA-Temporal-Intro.pdfAlexanderLerch4
 
07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdfAlexanderLerch4
 
09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdfAlexanderLerch4
 
09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdfAlexanderLerch4
 
11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdfAlexanderLerch4
 
03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdf03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdfAlexanderLerch4
 
07-03-03-ACA-Tonal-Monof0.pdf
07-03-03-ACA-Tonal-Monof0.pdf07-03-03-ACA-Tonal-Monof0.pdf
07-03-03-ACA-Tonal-Monof0.pdfAlexanderLerch4
 
05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdfAlexanderLerch4
 
07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdfAlexanderLerch4
 
03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdfAlexanderLerch4
 
03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdfAlexanderLerch4
 
07-03-04-ACA-Tonal-Polyf0.pdf
07-03-04-ACA-Tonal-Polyf0.pdf07-03-04-ACA-Tonal-Polyf0.pdf
07-03-04-ACA-Tonal-Polyf0.pdfAlexanderLerch4
 
03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdfAlexanderLerch4
 
07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdfAlexanderLerch4
 

More from AlexanderLerch4 (20)

0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf
 
12-02-ACA-Genre.pdf
12-02-ACA-Genre.pdf12-02-ACA-Genre.pdf
12-02-ACA-Genre.pdf
 
10-02-ACA-Alignment.pdf
10-02-ACA-Alignment.pdf10-02-ACA-Alignment.pdf
10-02-ACA-Alignment.pdf
 
09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf
 
13-00-ACA-Mood.pdf
13-00-ACA-Mood.pdf13-00-ACA-Mood.pdf
13-00-ACA-Mood.pdf
 
09-01-ACA-Temporal-Intro.pdf
09-01-ACA-Temporal-Intro.pdf09-01-ACA-Temporal-Intro.pdf
09-01-ACA-Temporal-Intro.pdf
 
07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf
 
09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf
 
09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf
 
11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf
 
08-00-ACA-Intensity.pdf
08-00-ACA-Intensity.pdf08-00-ACA-Intensity.pdf
08-00-ACA-Intensity.pdf
 
03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdf03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdf
 
07-03-03-ACA-Tonal-Monof0.pdf
07-03-03-ACA-Tonal-Monof0.pdf07-03-03-ACA-Tonal-Monof0.pdf
07-03-03-ACA-Tonal-Monof0.pdf
 
05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf
 
07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf
 
03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf
 
03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf
 
07-03-04-ACA-Tonal-Polyf0.pdf
07-03-04-ACA-Tonal-Polyf0.pdf07-03-04-ACA-Tonal-Polyf0.pdf
07-03-04-ACA-Tonal-Polyf0.pdf
 
03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf
 
07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf
 

Recently uploaded

Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTopCSSGallery
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024Lorenzo Miniero
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityVictorSzoltysek
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxFIDO Alliance
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxFIDO Alliance
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...ScyllaDB
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data SciencePaolo Missier
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...panagenda
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...marcuskenyatta275
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightSafe Software
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!Memoori
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfdanishmna97
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxFIDO Alliance
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfSrushith Repakula
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftshyamraj55
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireExakis Nelite
 

Recently uploaded (20)

Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptx
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - Questionnaire
 

09-07-ACA-Temporal-Structure.pdf

  • 1. Introduction to Audio Content Analysis module 9.7: music structure detection alexander lerch
  • 2. overview intro ssm novelty homogeneity repetition eval summary introduction overview corresponding textbook section section 9.7 lecture content • structure in music • self similarity and self distance matrices • structure detection approaches learning objectives • summarize basic difficulties in ground truth annotations of musical structure • explain and interpret self similarity and self distance matrices • summarize three domains for approaching music structure detection module 9.7: music structure detection 1 / 14
  • 3. overview intro ssm novelty homogeneity repetition eval summary introduction overview corresponding textbook section section 9.7 lecture content • structure in music • self similarity and self distance matrices • structure detection approaches learning objectives • summarize basic difficulties in ground truth annotations of musical structure • explain and interpret self similarity and self distance matrices • summarize three domains for approaching music structure detection module 9.7: music structure detection 1 / 14
  • 4. overview intro ssm novelty homogeneity repetition eval summary music structure introduction music is inherently formal/organized/structural various hierarchical structural levels • groups of notes build rhythmic/melodic/harmonic patterns • measures group multiple events • phrases group several measures • sections contain several phrases • several sections can comprise piece/movement • . . . grouping of musical elements/patterns is influenced by 1 contrasts & novelty ▶ rhythmic, harmonic, melodic patterns 2 similarity and repetitions ▶ rhythmic, harmonic, melodic patterns 3 homogeneity within a section ▶ instrumentation, tempo, harmony module 9.7: music structure detection 2 / 14
  • 5. overview intro ssm novelty homogeneity repetition eval summary music structure introduction music is inherently formal/organized/structural various hierarchical structural levels • groups of notes build rhythmic/melodic/harmonic patterns • measures group multiple events • phrases group several measures • sections contain several phrases • several sections can comprise piece/movement • . . . grouping of musical elements/patterns is influenced by 1 contrasts & novelty ▶ rhythmic, harmonic, melodic patterns 2 similarity and repetitions ▶ rhythmic, harmonic, melodic patterns 3 homogeneity within a section ▶ instrumentation, tempo, harmony module 9.7: music structure detection 2 / 14
  • 6. overview intro ssm novelty homogeneity repetition eval summary music structure analysis introduction objective • reveal structural properties and relationships • generate a list of parts and repetitions typical processing steps 1 feature extraction 2 Self Distance Matrix (SDM) or Self Similarity Matrix (SSM) computation 3 segment detection based on ▶ novelty ▶ homogeneity ▶ repetition module 9.7: music structure detection 3 / 14
  • 7. overview intro ssm novelty homogeneity repetition eval summary music structure analysis example module 9.7: music structure detection 4 / 14 matlab source: plotStructure.m
  • 8. overview intro ssm novelty homogeneity repetition eval summary music structure analysis features 1/2 features from all categories can have impact on structure • timbre ▶ instrumentation, playing technique, effects, . . . • tonal content ▶ melodic and harmonic patterns, range, . . . • rhythm content ▶ tempo, rhythmic patterns, . . . • dynamics ▶ loudness, range, . . . feature aggregation • use texture window, or • aggregate features per beat or downbeat module 9.7: music structure detection 5 / 14
  • 9. overview intro ssm novelty homogeneity repetition eval summary music structure analysis self similarity matrix S(nA, nB) = s v(nA), v(nB) module 9.7: music structure detection 6 / 14 matlab source: plotSsm.m
  • 10. overview intro ssm novelty homogeneity repetition eval summary music structure analysis feature dependency of similarity Features (left to right): Pitch Chroma, RMS, MFCCs, Mel Spectrogram module 9.7: music structure detection 7 / 14 matlab source: plotSsmFeatures.m
  • 11. overview intro ssm novelty homogeneity repetition eval summary music structure analysis novelty analysis module 9.7: music structure detection 8 / 14 matlab source: plotCheckerBoard.m matlab source: plotSsmNovelty.m
  • 12. overview intro ssm novelty homogeneity repetition eval summary music structure analysis homogeneity analysis 1/2 module 9.7: music structure detection 9 / 14 matlab source: plotSsmLowPass.m
  • 13. overview intro ssm novelty homogeneity repetition eval summary music structure analysis homogeneity analysis 2/2 can also be used as post-processing step after novelty-based approach, e.g. 1 describe each segment with features 2 cluster and see which segments are grouped together module 9.7: music structure detection 10 / 14
  • 14. overview intro ssm novelty homogeneity repetition eval summary music structure analysis repetition analysis 1/2 module 9.7: music structure detection 11 / 14 matlab source: plotSsmRotated.m
  • 15. overview intro ssm novelty homogeneity repetition eval summary music structure analysis repetition analysis 2/2 while in many cases it ’looks’ easy, automatic extraction is error-prone ⇒ typical approaches for enhancing the distance/similarity/lag matrix • filtering (low pass smoothing, high pass edge detection) • use matrices with different time resolutions • image processing methods (e.g., erosion dilation) • thresholding • “path search” through probability matrix module 9.7: music structure detection 12 / 14
  • 16. overview intro ssm novelty homogeneity repetition eval summary music structure analysis repetition analysis 2/2 while in many cases it ’looks’ easy, automatic extraction is error-prone ⇒ typical approaches for enhancing the distance/similarity/lag matrix • filtering (low pass smoothing, high pass edge detection) • use matrices with different time resolutions • image processing methods (e.g., erosion dilation) • thresholding • “path search” through probability matrix module 9.7: music structure detection 12 / 14
  • 17. overview intro ssm novelty homogeneity repetition eval summary music structure analysis evaluation evaluation of structure detection challenging • ground truth ▶ structure itself may be ambiguous ▶ depending on annotator, varying hierarchical level of labels, e.g. ann 1 intro A A outro ann 2 intro verse chorus verse chorus outro ann 3 intro V1 V2 C1 C2 V1 V2 C1 C2 outro method and metric • boundary matching • frame level, e.g., pairwise match typical range of results • F = 50 . . . 70% module 9.7: music structure detection 13 / 14
  • 18. overview intro ssm novelty homogeneity repetition eval summary music structure analysis evaluation evaluation of structure detection challenging • ground truth ▶ structure itself may be ambiguous ▶ depending on annotator, varying hierarchical level of labels, e.g. ann 1 intro A A outro ann 2 intro verse chorus verse chorus outro ann 3 intro V1 V2 C1 C2 V1 V2 C1 C2 outro method and metric • boundary matching • frame level, e.g., pairwise match typical range of results • F = 50 . . . 70% module 9.7: music structure detection 13 / 14
  • 19. overview intro ssm novelty homogeneity repetition eval summary music structure analysis evaluation evaluation of structure detection challenging • ground truth ▶ structure itself may be ambiguous ▶ depending on annotator, varying hierarchical level of labels, e.g. ann 1 intro A A outro ann 2 intro verse chorus verse chorus outro ann 3 intro V1 V2 C1 C2 V1 V2 C1 C2 outro method and metric • boundary matching • frame level, e.g., pairwise match typical range of results • F = 50 . . . 70% module 9.7: music structure detection 13 / 14
  • 20. overview intro ssm novelty homogeneity repetition eval summary summary lecture content self similarity/distance matrices • shows pairwise similarities/distances • depends on input features structure detection 1 novelty 2 homogeneity 3 repetitions module 9.7: music structure detection 14 / 14