SlideShare a Scribd company logo
1 of 9
Download to read offline
Introduction to Audio Content Analysis
module 3.3.4: time-frequency representations — filterbanks
alexander lerch
overview intro gammatone resonance summary
introduction
overview
corresponding textbook section
section 3.3.4
lecture content
• gammatone filterbank
• resonance filterbank
learning objectives
• explaining the principles of (auditory) filterbanks
module 3.3.4: time-frequency representations — filterbanks 1 / 5
overview intro gammatone resonance summary
introduction
overview
corresponding textbook section
section 3.3.4
lecture content
• gammatone filterbank
• resonance filterbank
learning objectives
• explaining the principles of (auditory) filterbanks
module 3.3.4: time-frequency representations — filterbanks 1 / 5
overview intro gammatone resonance summary
auditory filterbanks
introduction
FT and related transforms are bad models of physiological properties of the human ear:
frequency resolution (critical bands)
frequency scale (pitch resolution)
loudness & masking
event perception & time integration
⇒ auditory filterbanks
not as widely used as one might think because
1 computationally inefficient
2 analysis only: no invertibility (mostly)
3 not proven to be superior
module 3.3.4: time-frequency representations — filterbanks 2 / 5
overview intro gammatone resonance summary
auditory filterbanks
introduction
FT and related transforms are bad models of physiological properties of the human ear:
frequency resolution (critical bands)
frequency scale (pitch resolution)
loudness & masking
event perception & time integration
⇒ auditory filterbanks
not as widely used as one might think because
1 computationally inefficient
2 analysis only: no invertibility (mostly)
3 not proven to be superior
module 3.3.4: time-frequency representations — filterbanks 2 / 5
overview intro gammatone resonance summary
auditory filterbanks
introduction
FT and related transforms are bad models of physiological properties of the human ear:
frequency resolution (critical bands)
frequency scale (pitch resolution)
loudness & masking
event perception & time integration
⇒ auditory filterbanks
not as widely used as one might think because
1 computationally inefficient
2 analysis only: no invertibility (mostly)
3 not proven to be superior
module 3.3.4: time-frequency representations — filterbanks 2 / 5
overview intro gammatone resonance summary
auditory filterbanks
gammatone filterbank
h(i) =
a · (i/fS)O−1
· cos

2π · fc
i
fS

e2πi∆f/fS
module 3.3.4: time-frequency representations — filterbanks 3 / 5
matlab
source:
plotGammatone.m
overview intro gammatone resonance summary
other filterbanks
resonance filterbank
module 3.3.4: time-frequency representations — filterbanks 4 / 5
matlab
source:
plotResonanceFilterbank.m
overview intro gammatone resonance summary
summary
lecture content
filterbank-based frequency transforms
• possibly good model of human physiology
• high time resolution
• not invertible and inefficient
• not proven to be superior
module 3.3.4: time-frequency representations — filterbanks 5 / 5

More Related Content

Similar to 03-03-04-ACA-Input-TF-Filterbanks.pdf

A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
CSCJournals
 
Audio based Bird Species Recognition
Audio based Bird Species RecognitionAudio based Bird Species Recognition
Audio based Bird Species Recognition
Shriram Nandakumar
 
Filters two design_with_matlab
Filters two design_with_matlabFilters two design_with_matlab
Filters two design_with_matlab
researchwork
 

Similar to 03-03-04-ACA-Input-TF-Filterbanks.pdf (17)

Bz33462466
Bz33462466Bz33462466
Bz33462466
 
IRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time Domain
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker Recognition
 
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
 
Audio/Speech Signal Analysis for Depression
Audio/Speech Signal Analysis for DepressionAudio/Speech Signal Analysis for Depression
Audio/Speech Signal Analysis for Depression
 
Ftir principle
Ftir principleFtir principle
Ftir principle
 
07-01-ACA-Tonal-Perception.pdf
07-01-ACA-Tonal-Perception.pdf07-01-ACA-Tonal-Perception.pdf
07-01-ACA-Tonal-Perception.pdf
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Broad phoneme classification using signal based features
Broad phoneme classification using signal based featuresBroad phoneme classification using signal based features
Broad phoneme classification using signal based features
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features
 
Paper id 28201448
Paper id 28201448Paper id 28201448
Paper id 28201448
 
Geldokumentations-Systeme von Syngene: ein Anwendungsleitfaden
Geldokumentations-Systeme von Syngene: ein AnwendungsleitfadenGeldokumentations-Systeme von Syngene: ein Anwendungsleitfaden
Geldokumentations-Systeme von Syngene: ein Anwendungsleitfaden
 
Audio based Bird Species Recognition
Audio based Bird Species RecognitionAudio based Bird Species Recognition
Audio based Bird Species Recognition
 
09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf
 
Envelope analysis (tugas getaran mekanik kelas 1).pdf
Envelope analysis (tugas getaran mekanik kelas 1).pdfEnvelope analysis (tugas getaran mekanik kelas 1).pdf
Envelope analysis (tugas getaran mekanik kelas 1).pdf
 
Filters two design_with_matlab
Filters two design_with_matlabFilters two design_with_matlab
Filters two design_with_matlab
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 

More from AlexanderLerch4

More from AlexanderLerch4 (20)

0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf
 
12-02-ACA-Genre.pdf
12-02-ACA-Genre.pdf12-02-ACA-Genre.pdf
12-02-ACA-Genre.pdf
 
10-02-ACA-Alignment.pdf
10-02-ACA-Alignment.pdf10-02-ACA-Alignment.pdf
10-02-ACA-Alignment.pdf
 
09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf
 
13-00-ACA-Mood.pdf
13-00-ACA-Mood.pdf13-00-ACA-Mood.pdf
13-00-ACA-Mood.pdf
 
09-07-ACA-Temporal-Structure.pdf
09-07-ACA-Temporal-Structure.pdf09-07-ACA-Temporal-Structure.pdf
09-07-ACA-Temporal-Structure.pdf
 
09-01-ACA-Temporal-Intro.pdf
09-01-ACA-Temporal-Intro.pdf09-01-ACA-Temporal-Intro.pdf
09-01-ACA-Temporal-Intro.pdf
 
07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf
 
15-00-ACA-Music-Performance-Analysis.pdf
15-00-ACA-Music-Performance-Analysis.pdf15-00-ACA-Music-Performance-Analysis.pdf
15-00-ACA-Music-Performance-Analysis.pdf
 
12-01-ACA-Similarity.pdf
12-01-ACA-Similarity.pdf12-01-ACA-Similarity.pdf
12-01-ACA-Similarity.pdf
 
09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf
 
11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf
 
08-00-ACA-Intensity.pdf
08-00-ACA-Intensity.pdf08-00-ACA-Intensity.pdf
08-00-ACA-Intensity.pdf
 
05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf
 
07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf
 
03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf
 
03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf
 
03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf
 
07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf
 
06-00-ACA-Evaluation.pdf
06-00-ACA-Evaluation.pdf06-00-ACA-Evaluation.pdf
06-00-ACA-Evaluation.pdf
 

Recently uploaded

Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
FIDO Alliance
 

Recently uploaded (20)

Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptx
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Generative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdfGenerative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdf
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 
2024 May Patch Tuesday
2024 May Patch Tuesday2024 May Patch Tuesday
2024 May Patch Tuesday
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overview
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 

03-03-04-ACA-Input-TF-Filterbanks.pdf

  • 1. Introduction to Audio Content Analysis module 3.3.4: time-frequency representations — filterbanks alexander lerch
  • 2. overview intro gammatone resonance summary introduction overview corresponding textbook section section 3.3.4 lecture content • gammatone filterbank • resonance filterbank learning objectives • explaining the principles of (auditory) filterbanks module 3.3.4: time-frequency representations — filterbanks 1 / 5
  • 3. overview intro gammatone resonance summary introduction overview corresponding textbook section section 3.3.4 lecture content • gammatone filterbank • resonance filterbank learning objectives • explaining the principles of (auditory) filterbanks module 3.3.4: time-frequency representations — filterbanks 1 / 5
  • 4. overview intro gammatone resonance summary auditory filterbanks introduction FT and related transforms are bad models of physiological properties of the human ear: frequency resolution (critical bands) frequency scale (pitch resolution) loudness & masking event perception & time integration ⇒ auditory filterbanks not as widely used as one might think because 1 computationally inefficient 2 analysis only: no invertibility (mostly) 3 not proven to be superior module 3.3.4: time-frequency representations — filterbanks 2 / 5
  • 5. overview intro gammatone resonance summary auditory filterbanks introduction FT and related transforms are bad models of physiological properties of the human ear: frequency resolution (critical bands) frequency scale (pitch resolution) loudness & masking event perception & time integration ⇒ auditory filterbanks not as widely used as one might think because 1 computationally inefficient 2 analysis only: no invertibility (mostly) 3 not proven to be superior module 3.3.4: time-frequency representations — filterbanks 2 / 5
  • 6. overview intro gammatone resonance summary auditory filterbanks introduction FT and related transforms are bad models of physiological properties of the human ear: frequency resolution (critical bands) frequency scale (pitch resolution) loudness & masking event perception & time integration ⇒ auditory filterbanks not as widely used as one might think because 1 computationally inefficient 2 analysis only: no invertibility (mostly) 3 not proven to be superior module 3.3.4: time-frequency representations — filterbanks 2 / 5
  • 7. overview intro gammatone resonance summary auditory filterbanks gammatone filterbank h(i) = a · (i/fS)O−1 · cos 2π · fc i fS e2πi∆f/fS module 3.3.4: time-frequency representations — filterbanks 3 / 5 matlab source: plotGammatone.m
  • 8. overview intro gammatone resonance summary other filterbanks resonance filterbank module 3.3.4: time-frequency representations — filterbanks 4 / 5 matlab source: plotResonanceFilterbank.m
  • 9. overview intro gammatone resonance summary summary lecture content filterbank-based frequency transforms • possibly good model of human physiology • high time resolution • not invertible and inefficient • not proven to be superior module 3.3.4: time-frequency representations — filterbanks 5 / 5