SlideShare a Scribd company logo
Subjective Assessment of HRTF
Interpolation with Spherical Harmonics
Chris Pike and Tony Tew
Binaural
Filter
Processor
Mono
Input
Audio
Binaural
Filter
Generator
Source
Position
Binaural
Filter
Data
Binaural
Output
Audio
Binaural Rendering
• Spatialising a monophonic signal for
headphone playback with a filter for
each ear
• We’re discussing the role of filter
generation (via interpolation)
• Not how you swap between filters
(commutation) [Jot 1995]
2
Head-Related Transfer Function (HRTF)
3
Why Interpolate HRTFs?
• Reduce measurement requirements
• Reduce data storage requirements
• Obtain filters for precise source position

(a direction-continuous HRTF)
4
HRTF Interpolation Techniques
• Local techniques

e.g. linear weighting of local measurements around target position
• Global techniques

e.g. linear weighting of spectral or spatial basis functions
5
Interpolation with Spherical Harmonics
• First suggested by Jot (“binaural B-format”) [Jot 1995]
• Investigated practically by Evans at order 17 [Evans et al 1998]
• Theory further developed, extended to range extrapolation, extended
to irregular and incomplete sample grids with regularisation
[Duraiswami et al 2004, Pollow et al 2012]
• Metrics and implementation issues discussed by [Richter et al. 2014]
• Applied in binaural reproduction of plane waves with reduced modal
order (sound field synthesis) [Bernschütz et al 2014]
6
Motivation
• Use spherical harmonic interpolation to generate HRTFs at arbitrary
directions for investigation of binaural rendering of virtual sound
fields e.g. higher-order ambisonics
• Although the theory is well developed, there appears to be no
subjective validation of the interpolation method
• Research question: Can spherical harmonic interpolation of HRTFs
generate filters indistinguishable from measured data at complexity
that could be run in real-time for complex scenes?
7
Theoretical Approach
• Evaluate the HRTF with a solution of the Helmholtz equation
• Assume source at the ear and evaluate the outgoing acoustic
radiation at any point outside of the head surface (reciprocity
principal)
• Expand the pressure field as a series of spatial modes
8
Analysis and Synthesis
• Project HRTF measurements of constant distance onto the spherical
harmonic expansion coefficients using integration over the sphere
(spatial Fourier transform)
• For given sample weights (quadrature) the discrete spatial Fourier
transform is used
• Performed separately for each ear 

(unless the head is left-right symmetric)
• An HRTF can then be synthesised at arbitrary points using the inverse
spatial Fourier transform
9
Sources of Error
• Spatial aliasing

Sampled signal has energy in higher modes than sample grid offers
• Truncation error

Transform order lower than grid order e.g. for real-time efficiency
• Ill-conditioning in range extrapolation

Numerical instability at low frequency due to ratios of spherical
Hankel functions

10
Full Sphere HRTF Dataset
• Bernschütz measured a full-sphere far-field HRTF dataset on the
Neumann KU100 dummy head microphone [Bernschütz 2013]
• Three quadrature grids that sample the spherical harmonics well up
to given orders (Ng > 35)
• Used to obtain HRTFs at arbitrary source directions via spherical
harmonic interpolation [Bernschütz et al 2014]
• Suggests series order 35 is valid for temporal frequencies up to
20kHz
11
Quadratures
12
Gauss Ng =89 L=16020 Lebedev Ng =44 L=2702 Lebedev Ng =41 L=2354
Modal Intensity Distributions
13
Gauss Ng = 89 Lebedev Ng = 44
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
-40
-30
-20
-10
0
10
20
Magnitude(dB)
Measured
5
10
20
35
44
Synthesis Truncation Order
14
Left ear at (0˚,0˚)
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
-40
-30
-20
-10
0
10
20
Magnitude(dB)
Measured
5
10
20
35
44
Left ear at (45˚,0˚)
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
-40
-30
-20
-10
0
10
20
Magnitude(dB)
Measured
5
10
20
35
44
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
-40
-30
-20
-10
0
10
20
Magnitude(dB)
Measured
5
10
20
35
44
Synthesis Truncation Order
15
Left ear at (90˚,0˚) Left ear at (135˚,0˚)
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
35
30
25
20
15
10
5
0
Order(N)
-50
-45
-40
-35
-30
-25
-20
-15
-10
-5
0
(dB)
Modal Intensity Distribution
• The intensity distribution energy at
increasing modal order for higher
temporal frequencies
• Truncation will remove this energy
and therefore we get a low-pass
effect, as observed by 

[Bernschütz et al 2014]
16
500 1k 2k 5k 10k 20k
Frequency (Hz)
0
5
10
15
20
25
30
35
Order(N)
50
50
50
50
90
90
90
90
95
95
95
95
98
98
98
98
Modal Energy Ratio
• Can look at truncation error using
contour plot of modal energy ratio
• At truncation order N=35 we retain
98% of energy up to 20kHz
17
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
0
5
10
15
20
25
30
CentreofPower(J2)
off-centre
acoustic centring
onset removal
Acoustic Centring
• [Richter et al 2014] showed that an
acoustic centring can reduce the
energy in higher modal orders
• Translation of measurement points
to estimated acoustic centre
(including distance adjustment)
• Here a frequency-independent
offset of 9.75cm in y-axis was used
18
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
0
5
10
15
20
25
30
CentreofPower(J2)
off-centre
acoustic centring
onset removal
Acoustic Centring
• Similar reduction in centre of power
is achieved with a conventional
broadband onset extraction from the
HRIRs
• A parametric broadband time-of-
arrival modal was used
[Ziegelwanger et al 2014]
• This has advantages in resynthesis:
efficiency and ITD personalisation
19
Acoustic Centring
20
500 1k 2k 5k 10k 20k
Frequency (Hz)
0
5
10
15
20
25
30
35
Order(N)
50
50
50
50
90
90
90
90
95
95
95
95
98
98
98
98
500 1k 2k 5k 10k 20k
Frequency (Hz)
0
5
10
15
20
25
30
35
Order(N)
50
50
90
90
90
95
95
95
95
98
98
98
98
98
98
Without onset removal With onset removal
Change in modal energy ratio
Listening Tests
• Using 2702-point Lebedev grid for spherical harmonic analysis
• Comparing synthesised HRTFs to measured HRTFs from Gauss set
where no measurement available in Lebedev set
• Three positions: front (2˚,0˚), rear-right (-100˚,0˚), up-left (30˚,30˚)
• Nearest measurement distances: 1.44˚, 2.41˚, 1.18˚ respectively
• Stimulus was a repeated pink noise burst

(750ms burst, 20ms half-cosine fade-in/-out, 1s silence)
21
Listening Test A
• Two alternative forced choice test (2AFC)
• Hypothesis: HRTFs synthesised at order N=35 (no onset removal)
are inaudible from those measured at the target position.
• Null hypothesis corresponds to group average detection rate of 65%
• Design balanced type-I and type-II error levels, keeping them below
5% after accounting for repeated tests with Sidak correction
[Leventhal 1986]
22
Listening Test A
• Critical number of 125 or more correct answers out of 216
• 12 assessors performed 18 repeats for each of the 3 conditions
• Could repeat and freely switch between stimuli as often as needed
23
(2°
,0°
) (-100°
,0°
) (30°
,30°
)
Position
0
10
20
30
40
50
60
70
80
90
100
Percentagecorrectanswers
2
2
3
2
2
3
2
2
2
3
2
2
Listening Test A - Results
24
• Front (2˚,0˚): 118 of 216 correct
• Rear-right (-100˚,0˚): 118 of 216 correct
• Up-left (30˚,30˚): 115 of 216 correct
Listening Test A - Discussion
• These results suggest that the 2702-point Lebedev grid dataset can
be used with spherical harmonic interpolation at order N=35 to
obtain HRTFs that are indistinguishable from real measurements
25
Listening Test B
• How do the differences to a target measurement compare between
relevant options?
• Multiple stimulus test with hidden reference, 

rating overall difference to the reference
• Measured reference compared to SH synthesised HRTFs at orders
35 and 5, both with and without separate onset modelling
• Nearest-neighbour measurement in 2702-point Lebedev grid also
compared
26
Listening Test B - Results
• Frontal position:
• N=5 with onsets left in was very
different
• N=5 with onsets processed
separately was very close to
reference but with perceivable
differences
• For all others no difference could be
heard
27
Listening Test B - Results
• Rear-right position:
• Both N=5 versions were very
different to the reference, the case
with separate onset processing was
perceived as more different
• Nearest-neighbour and N=35 with
separate onset processing may have
been perceived as different to the
reference
28
Listening Test B - Results
• Up-left position:
• Nearest-neighbour was very close to
reference but with perceivable
differences
• N=5 with onsets left in was very
different to the reference
• N=5 with onsets processed
separately was close to reference but
with perceivable differences
• For both N=35 cases no differences
were heard
29
Listening Test B - Discussion
• Confirms that N=35 with onsets included cannot be distinguished
from a real measurement
• Nearest-neighbour selection from the Lebedev grid is also very
close but sometimes audible
• With separate onset processing, N=35 is not significantly different
from the reference either
• At N=5 differences are clear, but separate onset processing makes
the differences a great deal smaller (except for the lateral position)
30
Conclusions
• Spherical harmonic interpolation allows generation of HRTF at
arbitrary field points that appears indistinguishable from real
measurements (for this dataset)
• Order limit N=35 is feasible for real-time implementation

(important for next experiment)
• Separate onset processing improves performance at lower-orders
(except at lateral positions) and allows ITD personalisation
31
Thank you
chris.pike@bbc.co.uk
32

More Related Content

What's hot

Rによるデータサイエンス13「樹木モデル」
Rによるデータサイエンス13「樹木モデル」Rによるデータサイエンス13「樹木モデル」
Rによるデータサイエンス13「樹木モデル」Takeshi Mikami
 
[DL輪読会]Focal Loss for Dense Object Detection
[DL輪読会]Focal Loss for Dense Object Detection[DL輪読会]Focal Loss for Dense Object Detection
[DL輪読会]Focal Loss for Dense Object Detection
Deep Learning JP
 
Average Sensitivity of Graph Algorithms
Average Sensitivity of Graph AlgorithmsAverage Sensitivity of Graph Algorithms
Average Sensitivity of Graph Algorithms
Yuichi Yoshida
 
CVIM#11 3. 最小化のための数値計算
CVIM#11 3. 最小化のための数値計算CVIM#11 3. 最小化のための数値計算
CVIM#11 3. 最小化のための数値計算
sleepy_yoshi
 
線形代数の視覚的理解のためのノート
線形代数の視覚的理解のためのノート線形代数の視覚的理解のためのノート
線形代数の視覚的理解のためのノート
Kenji Hiranabe
 
Metric learning ICML2010 tutorial
Metric learning  ICML2010 tutorialMetric learning  ICML2010 tutorial
Metric learning ICML2010 tutorialzukun
 
Bert(transformer,attention)
Bert(transformer,attention)Bert(transformer,attention)
Bert(transformer,attention)
norimatsu5
 
【DL輪読会】時系列予測 Transfomers の精度向上手法
【DL輪読会】時系列予測 Transfomers の精度向上手法【DL輪読会】時系列予測 Transfomers の精度向上手法
【DL輪読会】時系列予測 Transfomers の精度向上手法
Deep Learning JP
 
マルチバイブレータ
マルチバイブレータマルチバイブレータ
マルチバイブレータrobotclub_kut
 
Ml professional bandit_chapter2
Ml professional bandit_chapter2Ml professional bandit_chapter2
Ml professional bandit_chapter2
Takeru Maehara
 
わかりやすいパターン認識 4章
わかりやすいパターン認識 4章わかりやすいパターン認識 4章
わかりやすいパターン認識 4章
Motokawa Tetsuya
 
ニューラルチューリングマシン入門
ニューラルチューリングマシン入門ニューラルチューリングマシン入門
ニューラルチューリングマシン入門
naoto moriyama
 
ガウス過程入門
ガウス過程入門ガウス過程入門
ガウス過程入門
ShoShimoyama
 
[DL輪読会]Deep Learning 第17章 モンテカルロ法
[DL輪読会]Deep Learning 第17章 モンテカルロ法[DL輪読会]Deep Learning 第17章 モンテカルロ法
[DL輪読会]Deep Learning 第17章 モンテカルロ法
Deep Learning JP
 
東京都市大学 データ解析入門 10 ニューラルネットワークと深層学習 1
東京都市大学 データ解析入門 10 ニューラルネットワークと深層学習 1東京都市大学 データ解析入門 10 ニューラルネットワークと深層学習 1
東京都市大学 データ解析入門 10 ニューラルネットワークと深層学習 1
hirokazutanaka
 
PRML上巻勉強会 at 東京大学 資料 第1章後半
PRML上巻勉強会 at 東京大学 資料 第1章後半PRML上巻勉強会 at 東京大学 資料 第1章後半
PRML上巻勉強会 at 東京大学 資料 第1章後半
Ohsawa Goodfellow
 
混合整数ブラックボックス最適化に向けたCMA-ESの改良 / Optuna Meetup #2
混合整数ブラックボックス最適化に向けたCMA-ESの改良 / Optuna Meetup #2混合整数ブラックボックス最適化に向けたCMA-ESの改良 / Optuna Meetup #2
混合整数ブラックボックス最適化に向けたCMA-ESの改良 / Optuna Meetup #2
RHamano
 
コミュニティ分類アルゴリズムの高速化とソーシャルグラフへの応用
コミュニティ分類アルゴリズムの高速化とソーシャルグラフへの応用コミュニティ分類アルゴリズムの高速化とソーシャルグラフへの応用
コミュニティ分類アルゴリズムの高速化とソーシャルグラフへの応用
mosa siru
 

What's hot (20)

Rによるデータサイエンス13「樹木モデル」
Rによるデータサイエンス13「樹木モデル」Rによるデータサイエンス13「樹木モデル」
Rによるデータサイエンス13「樹木モデル」
 
[DL輪読会]Focal Loss for Dense Object Detection
[DL輪読会]Focal Loss for Dense Object Detection[DL輪読会]Focal Loss for Dense Object Detection
[DL輪読会]Focal Loss for Dense Object Detection
 
Chapter2.3.6
Chapter2.3.6Chapter2.3.6
Chapter2.3.6
 
Average Sensitivity of Graph Algorithms
Average Sensitivity of Graph AlgorithmsAverage Sensitivity of Graph Algorithms
Average Sensitivity of Graph Algorithms
 
CVIM#11 3. 最小化のための数値計算
CVIM#11 3. 最小化のための数値計算CVIM#11 3. 最小化のための数値計算
CVIM#11 3. 最小化のための数値計算
 
Prml
PrmlPrml
Prml
 
線形代数の視覚的理解のためのノート
線形代数の視覚的理解のためのノート線形代数の視覚的理解のためのノート
線形代数の視覚的理解のためのノート
 
Metric learning ICML2010 tutorial
Metric learning  ICML2010 tutorialMetric learning  ICML2010 tutorial
Metric learning ICML2010 tutorial
 
Bert(transformer,attention)
Bert(transformer,attention)Bert(transformer,attention)
Bert(transformer,attention)
 
【DL輪読会】時系列予測 Transfomers の精度向上手法
【DL輪読会】時系列予測 Transfomers の精度向上手法【DL輪読会】時系列予測 Transfomers の精度向上手法
【DL輪読会】時系列予測 Transfomers の精度向上手法
 
マルチバイブレータ
マルチバイブレータマルチバイブレータ
マルチバイブレータ
 
Ml professional bandit_chapter2
Ml professional bandit_chapter2Ml professional bandit_chapter2
Ml professional bandit_chapter2
 
わかりやすいパターン認識 4章
わかりやすいパターン認識 4章わかりやすいパターン認識 4章
わかりやすいパターン認識 4章
 
ニューラルチューリングマシン入門
ニューラルチューリングマシン入門ニューラルチューリングマシン入門
ニューラルチューリングマシン入門
 
ガウス過程入門
ガウス過程入門ガウス過程入門
ガウス過程入門
 
[DL輪読会]Deep Learning 第17章 モンテカルロ法
[DL輪読会]Deep Learning 第17章 モンテカルロ法[DL輪読会]Deep Learning 第17章 モンテカルロ法
[DL輪読会]Deep Learning 第17章 モンテカルロ法
 
東京都市大学 データ解析入門 10 ニューラルネットワークと深層学習 1
東京都市大学 データ解析入門 10 ニューラルネットワークと深層学習 1東京都市大学 データ解析入門 10 ニューラルネットワークと深層学習 1
東京都市大学 データ解析入門 10 ニューラルネットワークと深層学習 1
 
PRML上巻勉強会 at 東京大学 資料 第1章後半
PRML上巻勉強会 at 東京大学 資料 第1章後半PRML上巻勉強会 at 東京大学 資料 第1章後半
PRML上巻勉強会 at 東京大学 資料 第1章後半
 
混合整数ブラックボックス最適化に向けたCMA-ESの改良 / Optuna Meetup #2
混合整数ブラックボックス最適化に向けたCMA-ESの改良 / Optuna Meetup #2混合整数ブラックボックス最適化に向けたCMA-ESの改良 / Optuna Meetup #2
混合整数ブラックボックス最適化に向けたCMA-ESの改良 / Optuna Meetup #2
 
コミュニティ分類アルゴリズムの高速化とソーシャルグラフへの応用
コミュニティ分類アルゴリズムの高速化とソーシャルグラフへの応用コミュニティ分類アルゴリズムの高速化とソーシャルグラフへの応用
コミュニティ分類アルゴリズムの高速化とソーシャルグラフへの応用
 

Similar to Subjective Assessment of HRTF Interpolation with Spherical Harmonics - Chris Pike and Tony Tew

3D Spatial Response
3D Spatial Response3D Spatial Response
3D Spatial Response
Ramin Anushiravani
 
Slide Handouts with Notes
Slide Handouts with NotesSlide Handouts with Notes
Slide Handouts with NotesLeon Nguyen
 
Wavelet Transform and DSP Applications
Wavelet Transform and DSP ApplicationsWavelet Transform and DSP Applications
Wavelet Transform and DSP Applications
University of Technology - Iraq
 
US_pres.pptx
US_pres.pptxUS_pres.pptx
US_pres.pptx
amirjodeiry
 
TH4.T05.3_IGARSS-2011-3233-BOUCHER.ppt
TH4.T05.3_IGARSS-2011-3233-BOUCHER.pptTH4.T05.3_IGARSS-2011-3233-BOUCHER.ppt
TH4.T05.3_IGARSS-2011-3233-BOUCHER.pptgrssieee
 
Sound Power Basics
Sound Power BasicsSound Power Basics
Sound Power Basics
mktgpcb
 
Feasibility of EEG Super-Resolution Using Deep Convolutional Networks
Feasibility of EEG Super-Resolution Using Deep Convolutional NetworksFeasibility of EEG Super-Resolution Using Deep Convolutional Networks
Feasibility of EEG Super-Resolution Using Deep Convolutional Networks
Sangjun Han
 
Fast Sparse 2-D DFT Computation using Sparse-Graph Alias Codes
Fast Sparse 2-D DFT Computation using Sparse-Graph Alias CodesFast Sparse 2-D DFT Computation using Sparse-Graph Alias Codes
Fast Sparse 2-D DFT Computation using Sparse-Graph Alias Codes
Frank Ong
 
Defense - Sound space rendering based on the virtual Sound space rendering ba...
Defense - Sound space rendering based on the virtual Sound space rendering ba...Defense - Sound space rendering based on the virtual Sound space rendering ba...
Defense - Sound space rendering based on the virtual Sound space rendering ba...
JunjieShi3
 
PDF3.pdf
PDF3.pdfPDF3.pdf
PDF3.pdf
CharurubaN
 
noise.pdf
noise.pdfnoise.pdf
noise.pdf
ArunKumarDhupam
 
Fundamentals of music processing chapter 6 발표자료
Fundamentals of music processing chapter 6 발표자료Fundamentals of music processing chapter 6 발표자료
Fundamentals of music processing chapter 6 발표자료
Jeong Choi
 
Oscilloscope Lab
Oscilloscope LabOscilloscope Lab
Oscilloscope Lab
cyberns_
 
communication systems ppt on Frequency modulation
communication systems ppt on Frequency modulationcommunication systems ppt on Frequency modulation
communication systems ppt on Frequency modulation
NatarajVijapur
 
Sampling
SamplingSampling
Doppler Effect
Doppler EffectDoppler Effect
Doppler Effect
Nusirat Adedewe
 
Doppler Effect - Ultrasound
Doppler Effect - UltrasoundDoppler Effect - Ultrasound
Doppler Effect - Ultrasound
Victor Ekpo
 
wave-propagation
wave-propagationwave-propagation
wave-propagation
ATTO RATHORE
 
Mixed Time Frequency Approach for Multipoint Room Response Equalization
Mixed Time Frequency Approach for Multipoint Room Response EqualizationMixed Time Frequency Approach for Multipoint Room Response Equalization
Mixed Time Frequency Approach for Multipoint Room Response Equalization
a3labdsp
 

Similar to Subjective Assessment of HRTF Interpolation with Spherical Harmonics - Chris Pike and Tony Tew (20)

3D Spatial Response
3D Spatial Response3D Spatial Response
3D Spatial Response
 
Slide Handouts with Notes
Slide Handouts with NotesSlide Handouts with Notes
Slide Handouts with Notes
 
Measuring Jitter Using Phase Noise Techniques
Measuring Jitter Using Phase Noise TechniquesMeasuring Jitter Using Phase Noise Techniques
Measuring Jitter Using Phase Noise Techniques
 
Wavelet Transform and DSP Applications
Wavelet Transform and DSP ApplicationsWavelet Transform and DSP Applications
Wavelet Transform and DSP Applications
 
US_pres.pptx
US_pres.pptxUS_pres.pptx
US_pres.pptx
 
TH4.T05.3_IGARSS-2011-3233-BOUCHER.ppt
TH4.T05.3_IGARSS-2011-3233-BOUCHER.pptTH4.T05.3_IGARSS-2011-3233-BOUCHER.ppt
TH4.T05.3_IGARSS-2011-3233-BOUCHER.ppt
 
Sound Power Basics
Sound Power BasicsSound Power Basics
Sound Power Basics
 
Feasibility of EEG Super-Resolution Using Deep Convolutional Networks
Feasibility of EEG Super-Resolution Using Deep Convolutional NetworksFeasibility of EEG Super-Resolution Using Deep Convolutional Networks
Feasibility of EEG Super-Resolution Using Deep Convolutional Networks
 
Fast Sparse 2-D DFT Computation using Sparse-Graph Alias Codes
Fast Sparse 2-D DFT Computation using Sparse-Graph Alias CodesFast Sparse 2-D DFT Computation using Sparse-Graph Alias Codes
Fast Sparse 2-D DFT Computation using Sparse-Graph Alias Codes
 
Defense - Sound space rendering based on the virtual Sound space rendering ba...
Defense - Sound space rendering based on the virtual Sound space rendering ba...Defense - Sound space rendering based on the virtual Sound space rendering ba...
Defense - Sound space rendering based on the virtual Sound space rendering ba...
 
PDF3.pdf
PDF3.pdfPDF3.pdf
PDF3.pdf
 
noise.pdf
noise.pdfnoise.pdf
noise.pdf
 
Fundamentals of music processing chapter 6 발표자료
Fundamentals of music processing chapter 6 발표자료Fundamentals of music processing chapter 6 발표자료
Fundamentals of music processing chapter 6 발표자료
 
Oscilloscope Lab
Oscilloscope LabOscilloscope Lab
Oscilloscope Lab
 
communication systems ppt on Frequency modulation
communication systems ppt on Frequency modulationcommunication systems ppt on Frequency modulation
communication systems ppt on Frequency modulation
 
Sampling
SamplingSampling
Sampling
 
Doppler Effect
Doppler EffectDoppler Effect
Doppler Effect
 
Doppler Effect - Ultrasound
Doppler Effect - UltrasoundDoppler Effect - Ultrasound
Doppler Effect - Ultrasound
 
wave-propagation
wave-propagationwave-propagation
wave-propagation
 
Mixed Time Frequency Approach for Multipoint Room Response Equalization
Mixed Time Frequency Approach for Multipoint Room Response EqualizationMixed Time Frequency Approach for Multipoint Room Response Equalization
Mixed Time Frequency Approach for Multipoint Room Response Equalization
 

Recently uploaded

Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
AJAYKUMARPUND1
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
ydteq
 
Swimming pool mechanical components design.pptx
Swimming pool  mechanical components design.pptxSwimming pool  mechanical components design.pptx
Swimming pool mechanical components design.pptx
yokeleetan1
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
Literature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptxLiterature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptx
Dr Ramhari Poudyal
 
basic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdfbasic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdf
NidhalKahouli2
 
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
ssuser7dcef0
 
6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)
ClaraZara1
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
manasideore6
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
Osamah Alsalih
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Water billing management system project report.pdf
Water billing management system project report.pdfWater billing management system project report.pdf
Water billing management system project report.pdf
Kamal Acharya
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
SUTEJAS
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Teleport Manpower Consultant
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTSHeap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Soumen Santra
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
Unbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptxUnbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptx
ChristineTorrepenida1
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 

Recently uploaded (20)

Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
 
Swimming pool mechanical components design.pptx
Swimming pool  mechanical components design.pptxSwimming pool  mechanical components design.pptx
Swimming pool mechanical components design.pptx
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
Literature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptxLiterature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptx
 
basic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdfbasic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdf
 
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
 
6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Water billing management system project report.pdf
Water billing management system project report.pdfWater billing management system project report.pdf
Water billing management system project report.pdf
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTSHeap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
Unbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptxUnbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptx
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 

Subjective Assessment of HRTF Interpolation with Spherical Harmonics - Chris Pike and Tony Tew

  • 1. Subjective Assessment of HRTF Interpolation with Spherical Harmonics Chris Pike and Tony Tew
  • 2. Binaural Filter Processor Mono Input Audio Binaural Filter Generator Source Position Binaural Filter Data Binaural Output Audio Binaural Rendering • Spatialising a monophonic signal for headphone playback with a filter for each ear • We’re discussing the role of filter generation (via interpolation) • Not how you swap between filters (commutation) [Jot 1995] 2
  • 4. Why Interpolate HRTFs? • Reduce measurement requirements • Reduce data storage requirements • Obtain filters for precise source position
 (a direction-continuous HRTF) 4
  • 5. HRTF Interpolation Techniques • Local techniques
 e.g. linear weighting of local measurements around target position • Global techniques
 e.g. linear weighting of spectral or spatial basis functions 5
  • 6. Interpolation with Spherical Harmonics • First suggested by Jot (“binaural B-format”) [Jot 1995] • Investigated practically by Evans at order 17 [Evans et al 1998] • Theory further developed, extended to range extrapolation, extended to irregular and incomplete sample grids with regularisation [Duraiswami et al 2004, Pollow et al 2012] • Metrics and implementation issues discussed by [Richter et al. 2014] • Applied in binaural reproduction of plane waves with reduced modal order (sound field synthesis) [Bernschütz et al 2014] 6
  • 7. Motivation • Use spherical harmonic interpolation to generate HRTFs at arbitrary directions for investigation of binaural rendering of virtual sound fields e.g. higher-order ambisonics • Although the theory is well developed, there appears to be no subjective validation of the interpolation method • Research question: Can spherical harmonic interpolation of HRTFs generate filters indistinguishable from measured data at complexity that could be run in real-time for complex scenes? 7
  • 8. Theoretical Approach • Evaluate the HRTF with a solution of the Helmholtz equation • Assume source at the ear and evaluate the outgoing acoustic radiation at any point outside of the head surface (reciprocity principal) • Expand the pressure field as a series of spatial modes 8
  • 9. Analysis and Synthesis • Project HRTF measurements of constant distance onto the spherical harmonic expansion coefficients using integration over the sphere (spatial Fourier transform) • For given sample weights (quadrature) the discrete spatial Fourier transform is used • Performed separately for each ear 
 (unless the head is left-right symmetric) • An HRTF can then be synthesised at arbitrary points using the inverse spatial Fourier transform 9
  • 10. Sources of Error • Spatial aliasing
 Sampled signal has energy in higher modes than sample grid offers • Truncation error
 Transform order lower than grid order e.g. for real-time efficiency • Ill-conditioning in range extrapolation
 Numerical instability at low frequency due to ratios of spherical Hankel functions
 10
  • 11. Full Sphere HRTF Dataset • Bernschütz measured a full-sphere far-field HRTF dataset on the Neumann KU100 dummy head microphone [Bernschütz 2013] • Three quadrature grids that sample the spherical harmonics well up to given orders (Ng > 35) • Used to obtain HRTFs at arbitrary source directions via spherical harmonic interpolation [Bernschütz et al 2014] • Suggests series order 35 is valid for temporal frequencies up to 20kHz 11
  • 12. Quadratures 12 Gauss Ng =89 L=16020 Lebedev Ng =44 L=2702 Lebedev Ng =41 L=2354
  • 13. Modal Intensity Distributions 13 Gauss Ng = 89 Lebedev Ng = 44
  • 14. 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) -40 -30 -20 -10 0 10 20 Magnitude(dB) Measured 5 10 20 35 44 Synthesis Truncation Order 14 Left ear at (0˚,0˚) 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) -40 -30 -20 -10 0 10 20 Magnitude(dB) Measured 5 10 20 35 44 Left ear at (45˚,0˚)
  • 15. 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) -40 -30 -20 -10 0 10 20 Magnitude(dB) Measured 5 10 20 35 44 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) -40 -30 -20 -10 0 10 20 Magnitude(dB) Measured 5 10 20 35 44 Synthesis Truncation Order 15 Left ear at (90˚,0˚) Left ear at (135˚,0˚)
  • 16. 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) 35 30 25 20 15 10 5 0 Order(N) -50 -45 -40 -35 -30 -25 -20 -15 -10 -5 0 (dB) Modal Intensity Distribution • The intensity distribution energy at increasing modal order for higher temporal frequencies • Truncation will remove this energy and therefore we get a low-pass effect, as observed by 
 [Bernschütz et al 2014] 16
  • 17. 500 1k 2k 5k 10k 20k Frequency (Hz) 0 5 10 15 20 25 30 35 Order(N) 50 50 50 50 90 90 90 90 95 95 95 95 98 98 98 98 Modal Energy Ratio • Can look at truncation error using contour plot of modal energy ratio • At truncation order N=35 we retain 98% of energy up to 20kHz 17
  • 18. 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) 0 5 10 15 20 25 30 CentreofPower(J2) off-centre acoustic centring onset removal Acoustic Centring • [Richter et al 2014] showed that an acoustic centring can reduce the energy in higher modal orders • Translation of measurement points to estimated acoustic centre (including distance adjustment) • Here a frequency-independent offset of 9.75cm in y-axis was used 18
  • 19. 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) 0 5 10 15 20 25 30 CentreofPower(J2) off-centre acoustic centring onset removal Acoustic Centring • Similar reduction in centre of power is achieved with a conventional broadband onset extraction from the HRIRs • A parametric broadband time-of- arrival modal was used [Ziegelwanger et al 2014] • This has advantages in resynthesis: efficiency and ITD personalisation 19
  • 20. Acoustic Centring 20 500 1k 2k 5k 10k 20k Frequency (Hz) 0 5 10 15 20 25 30 35 Order(N) 50 50 50 50 90 90 90 90 95 95 95 95 98 98 98 98 500 1k 2k 5k 10k 20k Frequency (Hz) 0 5 10 15 20 25 30 35 Order(N) 50 50 90 90 90 95 95 95 95 98 98 98 98 98 98 Without onset removal With onset removal Change in modal energy ratio
  • 21. Listening Tests • Using 2702-point Lebedev grid for spherical harmonic analysis • Comparing synthesised HRTFs to measured HRTFs from Gauss set where no measurement available in Lebedev set • Three positions: front (2˚,0˚), rear-right (-100˚,0˚), up-left (30˚,30˚) • Nearest measurement distances: 1.44˚, 2.41˚, 1.18˚ respectively • Stimulus was a repeated pink noise burst
 (750ms burst, 20ms half-cosine fade-in/-out, 1s silence) 21
  • 22. Listening Test A • Two alternative forced choice test (2AFC) • Hypothesis: HRTFs synthesised at order N=35 (no onset removal) are inaudible from those measured at the target position. • Null hypothesis corresponds to group average detection rate of 65% • Design balanced type-I and type-II error levels, keeping them below 5% after accounting for repeated tests with Sidak correction [Leventhal 1986] 22
  • 23. Listening Test A • Critical number of 125 or more correct answers out of 216 • 12 assessors performed 18 repeats for each of the 3 conditions • Could repeat and freely switch between stimuli as often as needed 23
  • 24. (2° ,0° ) (-100° ,0° ) (30° ,30° ) Position 0 10 20 30 40 50 60 70 80 90 100 Percentagecorrectanswers 2 2 3 2 2 3 2 2 2 3 2 2 Listening Test A - Results 24 • Front (2˚,0˚): 118 of 216 correct • Rear-right (-100˚,0˚): 118 of 216 correct • Up-left (30˚,30˚): 115 of 216 correct
  • 25. Listening Test A - Discussion • These results suggest that the 2702-point Lebedev grid dataset can be used with spherical harmonic interpolation at order N=35 to obtain HRTFs that are indistinguishable from real measurements 25
  • 26. Listening Test B • How do the differences to a target measurement compare between relevant options? • Multiple stimulus test with hidden reference, 
 rating overall difference to the reference • Measured reference compared to SH synthesised HRTFs at orders 35 and 5, both with and without separate onset modelling • Nearest-neighbour measurement in 2702-point Lebedev grid also compared 26
  • 27. Listening Test B - Results • Frontal position: • N=5 with onsets left in was very different • N=5 with onsets processed separately was very close to reference but with perceivable differences • For all others no difference could be heard 27
  • 28. Listening Test B - Results • Rear-right position: • Both N=5 versions were very different to the reference, the case with separate onset processing was perceived as more different • Nearest-neighbour and N=35 with separate onset processing may have been perceived as different to the reference 28
  • 29. Listening Test B - Results • Up-left position: • Nearest-neighbour was very close to reference but with perceivable differences • N=5 with onsets left in was very different to the reference • N=5 with onsets processed separately was close to reference but with perceivable differences • For both N=35 cases no differences were heard 29
  • 30. Listening Test B - Discussion • Confirms that N=35 with onsets included cannot be distinguished from a real measurement • Nearest-neighbour selection from the Lebedev grid is also very close but sometimes audible • With separate onset processing, N=35 is not significantly different from the reference either • At N=5 differences are clear, but separate onset processing makes the differences a great deal smaller (except for the lateral position) 30
  • 31. Conclusions • Spherical harmonic interpolation allows generation of HRTF at arbitrary field points that appears indistinguishable from real measurements (for this dataset) • Order limit N=35 is feasible for real-time implementation
 (important for next experiment) • Separate onset processing improves performance at lower-orders (except at lateral positions) and allows ITD personalisation 31