SlideShare a Scribd company logo
1 of 17
Download to read offline
Introduction to Audio Content Analysis
module 2.0: audio content analysis process
alexander lerch
overview content ACA summary
introduction
overview
corresponding textbook section
chapter 2
lecture content
• audio content
• processing steps in a typical ACA system
learning objectives
• discuss typical forms of content in an audio signal
• describe the typical signal flow in an ACA system
module 2.0: audio content analysis process 1 / 5
overview content ACA summary
introduction
overview
corresponding textbook section
chapter 2
lecture content
• audio content
• processing steps in a typical ACA system
learning objectives
• discuss typical forms of content in an audio signal
• describe the typical signal flow in an ACA system
module 2.0: audio content analysis process 1 / 5
overview content ACA summary
audio content
sources
what are the sources of (musical) audio content?
module 2.0: audio content analysis process 2 / 5
overview content ACA summary
audio content
sources
what are the sources of (musical) audio content?
1 score/composition:
• definition of musical ideas
• “blue-print” of the music
• examples: melody, key, harmony, rhythmic patterns, . . .
module 2.0: audio content analysis process 2 / 5
overview content ACA summary
audio content
sources
what are the sources of (musical) audio content?
1 score/composition:
• definition of musical ideas
• “blue-print” of the music
• examples: melody, key, harmony, rhythmic patterns, . . .
2 performance:
• unique acoustic rendition
• information in the score is interpreted, modified, added to
• examples: (micro-)tempo, dynamics, intonation, . . .
module 2.0: audio content analysis process 2 / 5
overview content ACA summary
audio content
sources
what are the sources of (musical) audio content?
1 score/composition:
• definition of musical ideas
• “blue-print” of the music
• examples: melody, key, harmony, rhythmic patterns, . . .
2 performance:
• unique acoustic rendition
• information in the score is interpreted, modified, added to
• examples: (micro-)tempo, dynamics, intonation, . . .
3 production:
• aesthetic choices
• editing & processing
• examples: sound quality (EQ, microphone positioning), changes in timing and pitch
module 2.0: audio content analysis process 2 / 5
overview content ACA summary
audio content
categories
audio content can be structured into 4 basic categories:
1 tonal: related to pitch
• examples: melody, chords, intonation, vibrato, . . .
2 timbral: related to sound quality
• examples: instrument(ation), playing technique, venue, audio processing, . . .
3 intensity-related: related to musical dynamics
• examples: accents, loudness, . . .
4 temporal: related to rhythm and tempo
• examples: timing, meter, rhythmic patterns, . . .
module 2.0: audio content analysis process 3 / 5
overview content ACA summary
audio content
categories
audio content can be structured into 4 basic categories:
1 tonal: related to pitch
• examples: melody, chords, intonation, vibrato, . . .
2 timbral: related to sound quality
• examples: instrument(ation), playing technique, venue, audio processing, . . .
3 intensity-related: related to musical dynamics
• examples: accents, loudness, . . .
4 temporal: related to rhythm and tempo
• examples: timing, meter, rhythmic patterns, . . .
module 2.0: audio content analysis process 3 / 5
overview content ACA summary
audio content
categories
audio content can be structured into 4 basic categories:
1 tonal: related to pitch
• examples: melody, chords, intonation, vibrato, . . .
2 timbral: related to sound quality
• examples: instrument(ation), playing technique, venue, audio processing, . . .
3 intensity-related: related to musical dynamics
• examples: accents, loudness, . . .
4 temporal: related to rhythm and tempo
• examples: timing, meter, rhythmic patterns, . . .
module 2.0: audio content analysis process 3 / 5
overview content ACA summary
audio content
categories
audio content can be structured into 4 basic categories:
1 tonal: related to pitch
• examples: melody, chords, intonation, vibrato, . . .
2 timbral: related to sound quality
• examples: instrument(ation), playing technique, venue, audio processing, . . .
3 intensity-related: related to musical dynamics
• examples: accents, loudness, . . .
4 temporal: related to rhythm and tempo
• examples: timing, meter, rhythmic patterns, . . .
module 2.0: audio content analysis process 3 / 5
overview content ACA summary
audio content
categories
audio content can be structured into 4 basic categories:
1 tonal: related to pitch
• examples: melody, chords, intonation, vibrato, . . .
2 timbral: related to sound quality
• examples: instrument(ation), playing technique, venue, audio processing, . . .
3 intensity-related: related to musical dynamics
• examples: accents, loudness, . . .
4 temporal: related to rhythm and tempo
• examples: timing, meter, rhythmic patterns, . . .
module 2.0: audio content analysis process 3 / 5
overview content ACA summary
audio content
categories
audio content can be structured into 4 basic categories:
1 tonal: related to pitch
• examples: melody, chords, intonation, vibrato, . . .
2 timbral: related to sound quality
• examples: instrument(ation), playing technique, venue, audio processing, . . .
3 intensity-related: related to musical dynamics
• examples: accents, loudness, . . .
4 temporal: related to rhythm and tempo
• examples: timing, meter, rhythmic patterns, . . .
other non-musical content descriptions: e.g., statistical, technical
module 2.0: audio content analysis process 3 / 5
overview content ACA summary
audio content analysis
system overview
audio
signal
feature extraction
input representation
decision,
interpretation,
classification,
inference
meta
data
feature representation
• compact and non-redundant
• task-relevant
• easy to analyze
classification/inference
• map or convert feature to
comprehensible domain
module 2.0: audio content analysis process 4 / 5
overview content ACA summary
audio content analysis
system overview
audio
signal
feature
extraction
decision,
interpretation,
classification,
inference
meta
data
feature representation
• compact and non-redundant
• task-relevant
• easy to analyze
classification/inference
• map or convert feature to
comprehensible domain
module 2.0: audio content analysis process 4 / 5
overview content ACA summary
audio content analysis
system overview
audio
signal
feature
extraction
decision,
interpretation,
classification,
inference
meta
data
feature representation
• compact and non-redundant
• task-relevant
• easy to analyze
classification/inference
• map or convert feature to
comprehensible domain
module 2.0: audio content analysis process 4 / 5
overview content ACA summary
summary
lecture content
audio content
• is shaped by the musical ideas (score), the music performance, and the (studio)
production
• can relate to timbre, pitch, intensity, tempo and rhythm (but there is both lower
level and higher level content)
the flow chart of an ACA system at its most fundamental level shows
• a feature extraction step to extract meaningful descriptors
• a classification or inference step to produce a “human” result
module 2.0: audio content analysis process 5 / 5

More Related Content

Similar to 02-00-ACA-System-Intro.pdf

Collins
CollinsCollins
Collins
anesah
 
The kusc classical music dataset for audio key finding
The kusc classical music dataset for audio key findingThe kusc classical music dataset for audio key finding
The kusc classical music dataset for audio key finding
ijma
 
Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008
Rui Pedro Paiva
 

Similar to 02-00-ACA-System-Intro.pdf (18)

MULHER@AVI2012
MULHER@AVI2012MULHER@AVI2012
MULHER@AVI2012
 
Audio Source Separation Based on Low-Rank Structure and Statistical Independence
Audio Source Separation Based on Low-Rank Structure and Statistical IndependenceAudio Source Separation Based on Low-Rank Structure and Statistical Independence
Audio Source Separation Based on Low-Rank Structure and Statistical Independence
 
AMT overview
AMT overviewAMT overview
AMT overview
 
EDM for Europeana Sounds
EDM for Europeana SoundsEDM for Europeana Sounds
EDM for Europeana Sounds
 
10-02-ACA-Alignment.pdf
10-02-ACA-Alignment.pdf10-02-ACA-Alignment.pdf
10-02-ACA-Alignment.pdf
 
Collins
CollinsCollins
Collins
 
EDM for sounds
EDM for soundsEDM for sounds
EDM for sounds
 
Archiving and disseminating sound archives – 3. Analysis and treatment of sou...
Archiving and disseminating sound archives – 3. Analysis and treatment of sou...Archiving and disseminating sound archives – 3. Analysis and treatment of sou...
Archiving and disseminating sound archives – 3. Analysis and treatment of sou...
 
Critiquing music
Critiquing musicCritiquing music
Critiquing music
 
Lesson 1.1 Why Music Theory is Important
Lesson 1.1 Why Music Theory is Important Lesson 1.1 Why Music Theory is Important
Lesson 1.1 Why Music Theory is Important
 
Musicology Presentation
Musicology PresentationMusicology Presentation
Musicology Presentation
 
Indexing and Retrieval of Audio
Indexing and Retrieval of AudioIndexing and Retrieval of Audio
Indexing and Retrieval of Audio
 
09-01-ACA-Temporal-Intro.pdf
09-01-ACA-Temporal-Intro.pdf09-01-ACA-Temporal-Intro.pdf
09-01-ACA-Temporal-Intro.pdf
 
The kusc classical music dataset for audio key finding
The kusc classical music dataset for audio key findingThe kusc classical music dataset for audio key finding
The kusc classical music dataset for audio key finding
 
RDA for music cataloguers
RDA for music cataloguersRDA for music cataloguers
RDA for music cataloguers
 
15-00-ACA-Music-Performance-Analysis.pdf
15-00-ACA-Music-Performance-Analysis.pdf15-00-ACA-Music-Performance-Analysis.pdf
15-00-ACA-Music-Performance-Analysis.pdf
 
Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008
 
Music Objects to Social Machines
Music Objects to Social MachinesMusic Objects to Social Machines
Music Objects to Social Machines
 

More from AlexanderLerch4

More from AlexanderLerch4 (20)

0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf
 
09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf
 
13-00-ACA-Mood.pdf
13-00-ACA-Mood.pdf13-00-ACA-Mood.pdf
13-00-ACA-Mood.pdf
 
09-07-ACA-Temporal-Structure.pdf
09-07-ACA-Temporal-Structure.pdf09-07-ACA-Temporal-Structure.pdf
09-07-ACA-Temporal-Structure.pdf
 
07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf
 
12-01-ACA-Similarity.pdf
12-01-ACA-Similarity.pdf12-01-ACA-Similarity.pdf
12-01-ACA-Similarity.pdf
 
09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf
 
09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf
 
11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf
 
08-00-ACA-Intensity.pdf
08-00-ACA-Intensity.pdf08-00-ACA-Intensity.pdf
08-00-ACA-Intensity.pdf
 
03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdf03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdf
 
07-03-03-ACA-Tonal-Monof0.pdf
07-03-03-ACA-Tonal-Monof0.pdf07-03-03-ACA-Tonal-Monof0.pdf
07-03-03-ACA-Tonal-Monof0.pdf
 
05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf
 
07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf
 
03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf
 
03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf
 
07-03-04-ACA-Tonal-Polyf0.pdf
07-03-04-ACA-Tonal-Polyf0.pdf07-03-04-ACA-Tonal-Polyf0.pdf
07-03-04-ACA-Tonal-Polyf0.pdf
 
03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf
 
07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf
 
06-00-ACA-Evaluation.pdf
06-00-ACA-Evaluation.pdf06-00-ACA-Evaluation.pdf
06-00-ACA-Evaluation.pdf
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

02-00-ACA-System-Intro.pdf

  • 1. Introduction to Audio Content Analysis module 2.0: audio content analysis process alexander lerch
  • 2. overview content ACA summary introduction overview corresponding textbook section chapter 2 lecture content • audio content • processing steps in a typical ACA system learning objectives • discuss typical forms of content in an audio signal • describe the typical signal flow in an ACA system module 2.0: audio content analysis process 1 / 5
  • 3. overview content ACA summary introduction overview corresponding textbook section chapter 2 lecture content • audio content • processing steps in a typical ACA system learning objectives • discuss typical forms of content in an audio signal • describe the typical signal flow in an ACA system module 2.0: audio content analysis process 1 / 5
  • 4. overview content ACA summary audio content sources what are the sources of (musical) audio content? module 2.0: audio content analysis process 2 / 5
  • 5. overview content ACA summary audio content sources what are the sources of (musical) audio content? 1 score/composition: • definition of musical ideas • “blue-print” of the music • examples: melody, key, harmony, rhythmic patterns, . . . module 2.0: audio content analysis process 2 / 5
  • 6. overview content ACA summary audio content sources what are the sources of (musical) audio content? 1 score/composition: • definition of musical ideas • “blue-print” of the music • examples: melody, key, harmony, rhythmic patterns, . . . 2 performance: • unique acoustic rendition • information in the score is interpreted, modified, added to • examples: (micro-)tempo, dynamics, intonation, . . . module 2.0: audio content analysis process 2 / 5
  • 7. overview content ACA summary audio content sources what are the sources of (musical) audio content? 1 score/composition: • definition of musical ideas • “blue-print” of the music • examples: melody, key, harmony, rhythmic patterns, . . . 2 performance: • unique acoustic rendition • information in the score is interpreted, modified, added to • examples: (micro-)tempo, dynamics, intonation, . . . 3 production: • aesthetic choices • editing & processing • examples: sound quality (EQ, microphone positioning), changes in timing and pitch module 2.0: audio content analysis process 2 / 5
  • 8. overview content ACA summary audio content categories audio content can be structured into 4 basic categories: 1 tonal: related to pitch • examples: melody, chords, intonation, vibrato, . . . 2 timbral: related to sound quality • examples: instrument(ation), playing technique, venue, audio processing, . . . 3 intensity-related: related to musical dynamics • examples: accents, loudness, . . . 4 temporal: related to rhythm and tempo • examples: timing, meter, rhythmic patterns, . . . module 2.0: audio content analysis process 3 / 5
  • 9. overview content ACA summary audio content categories audio content can be structured into 4 basic categories: 1 tonal: related to pitch • examples: melody, chords, intonation, vibrato, . . . 2 timbral: related to sound quality • examples: instrument(ation), playing technique, venue, audio processing, . . . 3 intensity-related: related to musical dynamics • examples: accents, loudness, . . . 4 temporal: related to rhythm and tempo • examples: timing, meter, rhythmic patterns, . . . module 2.0: audio content analysis process 3 / 5
  • 10. overview content ACA summary audio content categories audio content can be structured into 4 basic categories: 1 tonal: related to pitch • examples: melody, chords, intonation, vibrato, . . . 2 timbral: related to sound quality • examples: instrument(ation), playing technique, venue, audio processing, . . . 3 intensity-related: related to musical dynamics • examples: accents, loudness, . . . 4 temporal: related to rhythm and tempo • examples: timing, meter, rhythmic patterns, . . . module 2.0: audio content analysis process 3 / 5
  • 11. overview content ACA summary audio content categories audio content can be structured into 4 basic categories: 1 tonal: related to pitch • examples: melody, chords, intonation, vibrato, . . . 2 timbral: related to sound quality • examples: instrument(ation), playing technique, venue, audio processing, . . . 3 intensity-related: related to musical dynamics • examples: accents, loudness, . . . 4 temporal: related to rhythm and tempo • examples: timing, meter, rhythmic patterns, . . . module 2.0: audio content analysis process 3 / 5
  • 12. overview content ACA summary audio content categories audio content can be structured into 4 basic categories: 1 tonal: related to pitch • examples: melody, chords, intonation, vibrato, . . . 2 timbral: related to sound quality • examples: instrument(ation), playing technique, venue, audio processing, . . . 3 intensity-related: related to musical dynamics • examples: accents, loudness, . . . 4 temporal: related to rhythm and tempo • examples: timing, meter, rhythmic patterns, . . . module 2.0: audio content analysis process 3 / 5
  • 13. overview content ACA summary audio content categories audio content can be structured into 4 basic categories: 1 tonal: related to pitch • examples: melody, chords, intonation, vibrato, . . . 2 timbral: related to sound quality • examples: instrument(ation), playing technique, venue, audio processing, . . . 3 intensity-related: related to musical dynamics • examples: accents, loudness, . . . 4 temporal: related to rhythm and tempo • examples: timing, meter, rhythmic patterns, . . . other non-musical content descriptions: e.g., statistical, technical module 2.0: audio content analysis process 3 / 5
  • 14. overview content ACA summary audio content analysis system overview audio signal feature extraction input representation decision, interpretation, classification, inference meta data feature representation • compact and non-redundant • task-relevant • easy to analyze classification/inference • map or convert feature to comprehensible domain module 2.0: audio content analysis process 4 / 5
  • 15. overview content ACA summary audio content analysis system overview audio signal feature extraction decision, interpretation, classification, inference meta data feature representation • compact and non-redundant • task-relevant • easy to analyze classification/inference • map or convert feature to comprehensible domain module 2.0: audio content analysis process 4 / 5
  • 16. overview content ACA summary audio content analysis system overview audio signal feature extraction decision, interpretation, classification, inference meta data feature representation • compact and non-redundant • task-relevant • easy to analyze classification/inference • map or convert feature to comprehensible domain module 2.0: audio content analysis process 4 / 5
  • 17. overview content ACA summary summary lecture content audio content • is shaped by the musical ideas (score), the music performance, and the (studio) production • can relate to timbre, pitch, intensity, tempo and rhythm (but there is both lower level and higher level content) the flow chart of an ACA system at its most fundamental level shows • a feature extraction step to extract meaningful descriptors • a classification or inference step to produce a “human” result module 2.0: audio content analysis process 5 / 5