HRTF Database Provides Cues for Sound Localization

•

0 likes•106 views

Manu Mitra

HRTF Project Presentation

Presentations & Public Speaking

Humans are self trained to localize sounds using their
ears starting at birth and localize well even in adverse
conditions.
Head-related transfer function (HRTF), which is the
ratio of the Fourier transform of the signal at the
listener’s eardrum to that at the center of the listener’s
head with the listener absent, characterizes these
listener induced changes.
Head-related transfer functions (HRTFs) capture the
sound localization cues created by the scattering of
incident sound waves by the body, and play a central
role in spatial audio systems.

The CIPIC Interface Laboratory at U.C. Davis has
measured HRTFs at high spatial resolution for more
than 90 subjects.
In addition to including impulse responses for 1250
directions for each ear of each subject, the database
includes a set of anthropometric measurements that
can be used for scaling studies.
Release 1.0 – a public-domain subset for 45 subjects
(including KEMAR with large and with small pinnae)
– is available by downloading from the website
(http://interface.cipic.ucdavis.edu).

$HRTF impulse responses are the output of a linear and time-invariant system, that is, the diffraction and reflections around the human head, the outer ear, and the torso. An attractive property of HRTF’s is that they may be modeled as minimum phase structures.$

One of the advantages of measuring HRTF data at
high spatial resolution is that the data can represented
as an image.
Figure 5(a) shows, each column in the image is one
impulse response at a particular azimuth, with
brightness coding the strength of the response.
Figure 5(b) shows, each column is the magnitude of
the HRTF in db, after the power spectrum was
smoothed by a constant-Q filter (Q=8).

$Figure (a) shows the gray scale value represents the amplitude of HRIR . Figure (b) shows the gray scale value is the magnitude of the HRTF in dB. Composition of the responded in terms of head diffraction effects, head and torso reflection, pinna effects and knee reflection can be seen both in the time domain and in the frequency domain.$

The basis for the decomposition techniques presented
are spectral peaks and nulls, i.e., poles and the zeros.
These poles and zeros are caused by different parts like
the head, torso, knees and pinna.
The challenging task is to isolate the prominent
spectral nulls caused by different acoustic phenomena.

In this section, features like pinna resonant
frequencies, pinna nulls and the delay due to torso and
knee reflection can be extracted using the above
decomposition technique.
Figure shows the frequency response of the 12th order
all-pole model for the subject 10 for azimuth 0* as a
function of different elevations as a mesh plot.
The effect of torso reflection delay in the frequency
domain is the appearance of periodic comb-filter nulls.

High-spatial-resolution HRTF measurements clarify
the physical sources of HRTF behavior.
The composition and decomposition of the HRTF into
different components, and extraction of features
which could be perceptually important for sound
source localization.
Using the features extracted interpolation can be done
in the feature domain.
These features can be related to the physical
dimensions of the human anatomy and the pinna so
that the HRTF could be customized.

V.R. Algazi, R. O. Duda and D. M. Thompson, C.
Avendano, “The CIPIC HRTF Database” October 21-24
2001, New Paltz, New York.
Vikas C. Raykar, Ramani Duraiswami, Larry Davis, B.
Yegnanarayana, “Extracting Significant Features from
the HRTF” July 6-9 2003, Boston, MA
Dmitry N. Zotkin, Jane Hwanf, Ramani Duraiswamy,
Larry S. Davis, “HRTF Personalization using
Anthropometric Measurements” College park, MD

Similar to HRTF Database Provides Cues for Sound Localization

An Introduction To Speech Sciences (Acoustic Analysis Of Speech)Jeff Nelson

HRTF Project ReportManu Mitra

Effective Preprocessing in Modeling Head-Related Impulse Responses Based on P...CSCJournals

Current DevDr. Cupid Lucid

Current Dev. In PhoneticsDr. Cupid Lucid

Hearing After HemispherectomyThe Brain Recovery Project

Patrik-Burda_Thorbjørn-L.-Basse_Eik-L.-Nielsen_Investigation_of_auditorium_ac...Patrik Burda

COGS 107B - Winter 2010 - Lecture 7 - Auditory System ITim Mullen

B110512inventionjournals

A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...CSCJournals

Novel cochlear filter based cepstral coefficients for classification of unvoi...ijnlc

3 D Soundadityas87

A Customizable Model of Head-Related Transfer Functions Based on Pinna Measur...Waqas Tariq

Response Properties of Single Auditory Nerve Fibers in the MouseAnnette Taberner-Miller, Ph.D.

COGS 107B - Winter 2010 - Lecture 8 - Auditory System IITim Mullen

Int journal 01Salai Selvam V

Central auditory sys.lailabakhsh1

Otoacoustic emissions (sbo 3& k.j.lee )Shekhar Krishna Debnath

Theories of speech perception.pptxsherin444916

Distinctive features.pptxSunilKumarRavi3

Similar to HRTF Database Provides Cues for Sound Localization (20)

An Introduction To Speech Sciences (Acoustic Analysis Of Speech)

HRTF Project Report

Effective Preprocessing in Modeling Head-Related Impulse Responses Based on P...

Current Dev

Current Dev. In Phonetics

Hearing After Hemispherectomy

Patrik-Burda_Thorbjørn-L.-Basse_Eik-L.-Nielsen_Investigation_of_auditorium_ac...

COGS 107B - Winter 2010 - Lecture 7 - Auditory System I

B110512

A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...

Novel cochlear filter based cepstral coefficients for classification of unvoi...

3 D Sound

A Customizable Model of Head-Related Transfer Functions Based on Pinna Measur...

Response Properties of Single Auditory Nerve Fibers in the Mouse

COGS 107B - Winter 2010 - Lecture 8 - Auditory System II

Int journal 01

Central auditory sys.

Otoacoustic emissions (sbo 3& k.j.lee )

Theories of speech perception.pptx

Distinctive features.pptx

Recently uploaded

Introduction to Prompt Engineering (Focusing on ChatGPT)Chameera Dedduwage

BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls

SaaStr Workshop Wednesday w/ Lucas Price, Yardsticksaastr

No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...Sheetaleventcompany

Presentation on Engagement in Book Clubssamaasim06

Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Kayode Fayemi

Mathematics of Finance Presentation.pptxMoumonDas2

VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesPooja Nehwal

ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2

Microsoft Copilot AI for Everyone - created by AITatiana Gurgel

WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )Pooja Nehwal

Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Hasting Chen

Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxraffaeleoman

Thirunelveli call girls Tamil escorts 7877702510Vipesco

If this Giant Must Walk: A Manifesto for a New NigeriaKayode Fayemi

Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Pooja Nehwal

Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute

CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...henrik385807

Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls

CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfhenrik385807

Recently uploaded (20)

Introduction to Prompt Engineering (Focusing on ChatGPT)

BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service

SaaStr Workshop Wednesday w/ Lucas Price, Yardstick

No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...

Presentation on Engagement in Book Clubs

Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...

Mathematics of Finance Presentation.pptx

VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services

ANCHORING SCRIPT FOR A CULTURAL EVENT.docx

Microsoft Copilot AI for Everyone - created by AI

WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )

Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...

Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx

Thirunelveli call girls Tamil escorts 7877702510

If this Giant Must Walk: A Manifesto for a New Nigeria

Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...

Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024

CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...

Night 7k Call Girls Noida Sector 128 Call Me: 8448380779

CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf

HRTF Database Provides Cues for Sound Localization

2. Humans are self trained to localize sounds using their ears starting at birth and localize well even in adverse conditions. Head-related transfer function (HRTF), which is the ratio of the Fourier transform of the signal at the listener’s eardrum to that at the center of the listener’s head with the listener absent, characterizes these listener induced changes. Head-related transfer functions (HRTFs) capture the sound localization cues created by the scattering of incident sound waves by the body, and play a central role in spatial audio systems.

3. The CIPIC Interface Laboratory at U.C. Davis has measured HRTFs at high spatial resolution for more than 90 subjects. In addition to including impulse responses for 1250 directions for each ear of each subject, the database includes a set of anthropometric measurements that can be used for scaling studies. Release 1.0 – a public-domain subset for 45 subjects (including KEMAR with large and with small pinnae) – is available by downloading from the website (http://interface.cipic.ucdavis.edu).

4. HRTF impulse responses are the output of a linear and time-invariant system, that is, the diffraction and reflections around the human head, the outer ear, and the torso. An attractive property of HRTF’s is that they may be modeled as minimum phase structures.

6. One of the advantages of measuring HRTF data at high spatial resolution is that the data can represented as an image. Figure 5(a) shows, each column in the image is one impulse response at a particular azimuth, with brightness coding the strength of the response. Figure 5(b) shows, each column is the magnitude of the HRTF in db, after the power spectrum was smoothed by a constant-Q filter (Q=8).

8. Figure (a) shows the gray scale value represents the amplitude of HRIR . Figure (b) shows the gray scale value is the magnitude of the HRTF in dB. Composition of the responded in terms of head diffraction effects, head and torso reflection, pinna effects and knee reflection can be seen both in the time domain and in the frequency domain.

10. The basis for the decomposition techniques presented are spectral peaks and nulls, i.e., poles and the zeros. These poles and zeros are caused by different parts like the head, torso, knees and pinna. The challenging task is to isolate the prominent spectral nulls caused by different acoustic phenomena.

11.

12.

13.

14. In this section, features like pinna resonant frequencies, pinna nulls and the delay due to torso and knee reflection can be extracted using the above decomposition technique. Figure shows the frequency response of the 12th order all-pole model for the subject 10 for azimuth 0* as a function of different elevations as a mesh plot. The effect of torso reflection delay in the frequency domain is the appearance of periodic comb-filter nulls.

15.

16. High-spatial-resolution HRTF measurements clarify the physical sources of HRTF behavior. The composition and decomposition of the HRTF into different components, and extraction of features which could be perceptually important for sound source localization. Using the features extracted interpolation can be done in the feature domain. These features can be related to the physical dimensions of the human anatomy and the pinna so that the HRTF could be customized.

17.

18.

19.

20. V.R. Algazi, R. O. Duda and D. M. Thompson, C. Avendano, “The CIPIC HRTF Database” October 21-24 2001, New Paltz, New York. Vikas C. Raykar, Ramani Duraiswami, Larry Davis, B. Yegnanarayana, “Extracting Significant Features from the HRTF” July 6-9 2003, Boston, MA Dmitry N. Zotkin, Jane Hwanf, Ramani Duraiswamy, Larry S. Davis, “HRTF Personalization using Anthropometric Measurements” College park, MD

HRTF Database Provides Cues for Sound Localization

Recommended

Recommended

More Related Content

Similar to HRTF Database Provides Cues for Sound Localization

Similar to HRTF Database Provides Cues for Sound Localization (20)

More from Manu Mitra

More from Manu Mitra (20)

Recently uploaded

Recently uploaded (20)

HRTF Database Provides Cues for Sound Localization