SlideShare a Scribd company logo
1 of 12
Computational Spectro-
temporal Auditory Model
Taishih Chi
June 29, 2003
Auditory Model
• Overview – two stage processing
• Model description and formulation
• Examples of representations
• Reconstruction from model output
representations
• Discussions
Spectral
Estimation
Early Auditory
Spectral
Analysis
Primary Cortex (A1)
Sound
Auditory
Spectrum
Cortical
Representation
Auditory Model
Overview
• Temporal dynamics
reduction
• Monaural model
• Two stage functional
model
– Early stage
(spectrum estimation)
– Cortical stage
(spectrum analysis)
Early stage
Mathematical Formulation
Early Stage MATLAB Implementation
Matlab ToolBox Usage:
yfinal = wav2aud(s, [frmlen, tc, fac, shft], filt);
s : acoustic input signal
yfinal: auditory spectrogram; N(time) x M(freq.)
CF = 440 * 2 .^ ((-31:97)/24 + shft);
Cortical stage
Spectrotemporal Receptive Field
4
0.125
4
0.125
4
0.125
4
0.125
4 4
0.125
C
Frequency(kHz)Frequency(kHz)
Frequency(kHz)
Frequency(kHz)
Frequency(kHz)Frequency(kHz)
Time (ms) Time (ms) Time (ms)
Time (ms) Time (ms) Time (ms)
250 250 250
250250250
0 0 0
000
0.125
D E F
BA
(a)
Time (ms)
Log.Frequency
Downward; Ω:1 cyc/oct, ω:4 Hz
500 1000
0.25 CF
0.5 CF
1 CF
2 CF
4 CF
(b)
-1.25 0 1.25
0
Log. Frequency (octave)
hs
0 1 2 3 4 5
0
Time (sec)
ht
Cortical stage
Model Implementation
Cortical stage
Mathematical Formulation
where
then the spectrotemporal cortical response:
Cortical stage
Mathematical Formulation (cont’d)
Consider the complex wavelet transform
where
then
Cortical stage
Cortical Representation of Speech
Frequency(Hz)
Time (ms)
100 200 300 400 500 600 700 800 900 1000
125
250
500
1000
2000
Multiresolution Cortical Filters and Outputs
Upward Downward
Slow Rate
Coarse Scale
Slow Rate
Fine Scale
Fast Rate
Coarse Scale
Fast Rate
Fine Scale
Slow Rate
Coarse Scale
Slow Rate
Fine Scale
Fast Rate
Coarse Scale
Fast Rate
Fine Scale
Cortical Magnitude Representation of Speech
Frequency(Hz)
Time (ms)
Auditory Spectrogram
100 200 300 400 500 600 700 800 900 1000
125
250
500
1000
2000
Multiresolution Cortical Filters and Outputs
Upward Downward
Slow Rate
Coarse Scale
Slow Rate
Fine Scale
Fast Rate
Coarse Scale
Fast Rate
Fine Scale
Slow Rate
Coarse Scale
Slow Rate
Fine Scale
Fast Rate
Coarse Scale
Fast Rate
Fine Scale
Cortical Stage MATLAB Implementation
Matlab ToolBox Usage:
cr = aud2cor(y, para1, rv, sv, fname, DISP);
cr: 4D cortical representation (scale-rate(up-
down)-time-freq.)
y : auditory spectrogram, N(time) x M(freq.)
para1 = [paras FULLT FULLX BP],paras:see WAV2AUD
FULLT (FULLX): fullness of temporal (spectral)
margin.
BP: pure bandpass indicator.
rv: rate vector in Hz, e.g., 2.^(1:.5:5).
sv: scale vector in cyc/oct, e.g., 2.^(-2:.5:3).

More Related Content

What's hot

Speech signal time frequency representation
Speech signal time frequency representationSpeech signal time frequency representation
Speech signal time frequency representation
Nikolay Karpov
 

What's hot (19)

Matlab task1
Matlab task1Matlab task1
Matlab task1
 
Lecture9
Lecture9Lecture9
Lecture9
 
A Simple Communication System Design Lab #4 with MATLAB Simulink
A Simple Communication System Design Lab #4 with MATLAB SimulinkA Simple Communication System Design Lab #4 with MATLAB Simulink
A Simple Communication System Design Lab #4 with MATLAB Simulink
 
A Simple Communication System Design Lab #2 with MATLAB Simulink
A Simple Communication System Design Lab #2 with MATLAB SimulinkA Simple Communication System Design Lab #2 with MATLAB Simulink
A Simple Communication System Design Lab #2 with MATLAB Simulink
 
Neural source-filter waveform model
Neural source-filter waveform modelNeural source-filter waveform model
Neural source-filter waveform model
 
Fft ppt
Fft pptFft ppt
Fft ppt
 
Overview of sampling
Overview of samplingOverview of sampling
Overview of sampling
 
The Fast Fourier Transform (FFT)
The Fast Fourier Transform (FFT)The Fast Fourier Transform (FFT)
The Fast Fourier Transform (FFT)
 
Fft
FftFft
Fft
 
Non-Uniform sampling and reconstruction of multi-band signals
Non-Uniform sampling and reconstruction of multi-band signalsNon-Uniform sampling and reconstruction of multi-band signals
Non-Uniform sampling and reconstruction of multi-band signals
 
DSP_2018_FOEHU - Lec 02 - Sampling of Continuous Time Signals
DSP_2018_FOEHU - Lec 02 - Sampling of Continuous Time SignalsDSP_2018_FOEHU - Lec 02 - Sampling of Continuous Time Signals
DSP_2018_FOEHU - Lec 02 - Sampling of Continuous Time Signals
 
Speech signal time frequency representation
Speech signal time frequency representationSpeech signal time frequency representation
Speech signal time frequency representation
 
Sampling and Reconstruction of Signal using Aliasing
Sampling and Reconstruction of Signal using AliasingSampling and Reconstruction of Signal using Aliasing
Sampling and Reconstruction of Signal using Aliasing
 
Fast fourier transform
Fast fourier transformFast fourier transform
Fast fourier transform
 
Speaker Dependent WaveNet Vocoder
Speaker Dependent WaveNet VocoderSpeaker Dependent WaveNet Vocoder
Speaker Dependent WaveNet Vocoder
 
Time-Frequency Representation of Microseismic Signals using the SST
Time-Frequency Representation of Microseismic Signals using the SSTTime-Frequency Representation of Microseismic Signals using the SST
Time-Frequency Representation of Microseismic Signals using the SST
 
Tutorial on end-to-end text-to-speech synthesis: Part 1 – Neural waveform mod...
Tutorial on end-to-end text-to-speech synthesis: Part 1 – Neural waveform mod...Tutorial on end-to-end text-to-speech synthesis: Part 1 – Neural waveform mod...
Tutorial on end-to-end text-to-speech synthesis: Part 1 – Neural waveform mod...
 
Prior distribution design for music bleeding-sound reduction based on nonnega...
Prior distribution design for music bleeding-sound reduction based on nonnega...Prior distribution design for music bleeding-sound reduction based on nonnega...
Prior distribution design for music bleeding-sound reduction based on nonnega...
 
Fast Fourier Transform
Fast Fourier TransformFast Fourier Transform
Fast Fourier Transform
 

Similar to auditory model

Sampling and Reconstruction (Online Learning).pptx
Sampling and Reconstruction (Online Learning).pptxSampling and Reconstruction (Online Learning).pptx
Sampling and Reconstruction (Online Learning).pptx
HamzaJaved306957
 
Ff tand matlab-wanjun huang
Ff tand matlab-wanjun huangFf tand matlab-wanjun huang
Ff tand matlab-wanjun huang
jhonce
 
Slide Handouts with Notes
Slide Handouts with NotesSlide Handouts with Notes
Slide Handouts with Notes
Leon Nguyen
 
Digital communication
Digital communicationDigital communication
Digital communication
meashi
 
Efficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input DataEfficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input Data
ymelka
 

Similar to auditory model (20)

Radio Signal Classification with Deep Neural Networks
Radio Signal Classification with Deep Neural NetworksRadio Signal Classification with Deep Neural Networks
Radio Signal Classification with Deep Neural Networks
 
Course-Notes__Advanced-DSP.pdf
Course-Notes__Advanced-DSP.pdfCourse-Notes__Advanced-DSP.pdf
Course-Notes__Advanced-DSP.pdf
 
Advanced_DSP_J_G_Proakis.pdf
Advanced_DSP_J_G_Proakis.pdfAdvanced_DSP_J_G_Proakis.pdf
Advanced_DSP_J_G_Proakis.pdf
 
CHƯƠNG 2 KỸ THUẬT TRUYỀN DẪN SỐ - THONG TIN SỐ
CHƯƠNG 2 KỸ THUẬT TRUYỀN DẪN SỐ - THONG TIN SỐCHƯƠNG 2 KỸ THUẬT TRUYỀN DẪN SỐ - THONG TIN SỐ
CHƯƠNG 2 KỸ THUẬT TRUYỀN DẪN SỐ - THONG TIN SỐ
 
Sampling and Reconstruction (Online Learning).pptx
Sampling and Reconstruction (Online Learning).pptxSampling and Reconstruction (Online Learning).pptx
Sampling and Reconstruction (Online Learning).pptx
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal Processing
 
PS
PSPS
PS
 
Ff tand matlab-wanjun huang
Ff tand matlab-wanjun huangFf tand matlab-wanjun huang
Ff tand matlab-wanjun huang
 
Ff tand matlab-wanjun huang
Ff tand matlab-wanjun huangFf tand matlab-wanjun huang
Ff tand matlab-wanjun huang
 
Lecture_1 (1).pptx
Lecture_1 (1).pptxLecture_1 (1).pptx
Lecture_1 (1).pptx
 
Slide Handouts with Notes
Slide Handouts with NotesSlide Handouts with Notes
Slide Handouts with Notes
 
Radar 2009 a 11 waveforms and pulse compression
Radar 2009 a 11 waveforms and pulse compressionRadar 2009 a 11 waveforms and pulse compression
Radar 2009 a 11 waveforms and pulse compression
 
Online divergence switching for superresolution-based nonnegative matrix fact...
Online divergence switching for superresolution-based nonnegative matrix fact...Online divergence switching for superresolution-based nonnegative matrix fact...
Online divergence switching for superresolution-based nonnegative matrix fact...
 
Tdm fdm
Tdm fdmTdm fdm
Tdm fdm
 
Digital communication
Digital communicationDigital communication
Digital communication
 
디지털통신 7
디지털통신 7디지털통신 7
디지털통신 7
 
Lecture 1 (ADSP).pptx
Lecture 1 (ADSP).pptxLecture 1 (ADSP).pptx
Lecture 1 (ADSP).pptx
 
"Speech recognition" - Hidden Markov Models @ Papers We Love Bucharest
"Speech recognition" - Hidden Markov Models @ Papers We Love Bucharest"Speech recognition" - Hidden Markov Models @ Papers We Love Bucharest
"Speech recognition" - Hidden Markov Models @ Papers We Love Bucharest
 
Efficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input DataEfficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input Data
 
Sampling
SamplingSampling
Sampling
 

More from faiqa saleem (9)

Alchemist novl presntation by faiqa saleem
Alchemist novl presntation by faiqa saleemAlchemist novl presntation by faiqa saleem
Alchemist novl presntation by faiqa saleem
 
Fuzzy logic
Fuzzy logicFuzzy logic
Fuzzy logic
 
instruction cycle
instruction cycle instruction cycle
instruction cycle
 
manage teams
manage teams manage teams
manage teams
 
Presntation
PresntationPresntation
Presntation
 
Advancement in technology
Advancement in technologyAdvancement in technology
Advancement in technology
 
Motivation
MotivationMotivation
Motivation
 
Indian culture
Indian culture Indian culture
Indian culture
 
Actual meaning-of-change
Actual meaning-of-changeActual meaning-of-change
Actual meaning-of-change
 

Recently uploaded

Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Christo Ananth
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Christo Ananth
 

Recently uploaded (20)

Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 

auditory model

  • 1. Computational Spectro- temporal Auditory Model Taishih Chi June 29, 2003
  • 2. Auditory Model • Overview – two stage processing • Model description and formulation • Examples of representations • Reconstruction from model output representations • Discussions Spectral Estimation Early Auditory Spectral Analysis Primary Cortex (A1) Sound Auditory Spectrum Cortical Representation
  • 3. Auditory Model Overview • Temporal dynamics reduction • Monaural model • Two stage functional model – Early stage (spectrum estimation) – Cortical stage (spectrum analysis)
  • 5. Early Stage MATLAB Implementation Matlab ToolBox Usage: yfinal = wav2aud(s, [frmlen, tc, fac, shft], filt); s : acoustic input signal yfinal: auditory spectrogram; N(time) x M(freq.) CF = 440 * 2 .^ ((-31:97)/24 + shft);
  • 6. Cortical stage Spectrotemporal Receptive Field 4 0.125 4 0.125 4 0.125 4 0.125 4 4 0.125 C Frequency(kHz)Frequency(kHz) Frequency(kHz) Frequency(kHz) Frequency(kHz)Frequency(kHz) Time (ms) Time (ms) Time (ms) Time (ms) Time (ms) Time (ms) 250 250 250 250250250 0 0 0 000 0.125 D E F BA
  • 7. (a) Time (ms) Log.Frequency Downward; Ω:1 cyc/oct, ω:4 Hz 500 1000 0.25 CF 0.5 CF 1 CF 2 CF 4 CF (b) -1.25 0 1.25 0 Log. Frequency (octave) hs 0 1 2 3 4 5 0 Time (sec) ht Cortical stage Model Implementation
  • 8. Cortical stage Mathematical Formulation where then the spectrotemporal cortical response:
  • 9. Cortical stage Mathematical Formulation (cont’d) Consider the complex wavelet transform where then
  • 10. Cortical stage Cortical Representation of Speech Frequency(Hz) Time (ms) 100 200 300 400 500 600 700 800 900 1000 125 250 500 1000 2000 Multiresolution Cortical Filters and Outputs Upward Downward Slow Rate Coarse Scale Slow Rate Fine Scale Fast Rate Coarse Scale Fast Rate Fine Scale Slow Rate Coarse Scale Slow Rate Fine Scale Fast Rate Coarse Scale Fast Rate Fine Scale
  • 11. Cortical Magnitude Representation of Speech Frequency(Hz) Time (ms) Auditory Spectrogram 100 200 300 400 500 600 700 800 900 1000 125 250 500 1000 2000 Multiresolution Cortical Filters and Outputs Upward Downward Slow Rate Coarse Scale Slow Rate Fine Scale Fast Rate Coarse Scale Fast Rate Fine Scale Slow Rate Coarse Scale Slow Rate Fine Scale Fast Rate Coarse Scale Fast Rate Fine Scale
  • 12. Cortical Stage MATLAB Implementation Matlab ToolBox Usage: cr = aud2cor(y, para1, rv, sv, fname, DISP); cr: 4D cortical representation (scale-rate(up- down)-time-freq.) y : auditory spectrogram, N(time) x M(freq.) para1 = [paras FULLT FULLX BP],paras:see WAV2AUD FULLT (FULLX): fullness of temporal (spectral) margin. BP: pure bandpass indicator. rv: rate vector in Hz, e.g., 2.^(1:.5:5). sv: scale vector in cyc/oct, e.g., 2.^(-2:.5:3).