SlideShare a Scribd company logo
1 of 32
Download to read offline
Subjective Assessment of HRTF
Interpolation with Spherical Harmonics
Chris Pike and Tony Tew
Binaural
Filter
Processor
Mono
Input
Audio
Binaural
Filter
Generator
Source
Position
Binaural
Filter
Data
Binaural
Output
Audio
Binaural Rendering
• Spatialising a monophonic signal for
headphone playback with a filter for
each ear
• We’re discussing the role of filter
generation (via interpolation)
• Not how you swap between filters
(commutation) [Jot 1995]
2
Head-Related Transfer Function (HRTF)
3
Why Interpolate HRTFs?
• Reduce measurement requirements
• Reduce data storage requirements
• Obtain filters for precise source position

(a direction-continuous HRTF)
4
HRTF Interpolation Techniques
• Local techniques

e.g. linear weighting of local measurements around target position
• Global techniques

e.g. linear weighting of spectral or spatial basis functions
5
Interpolation with Spherical Harmonics
• First suggested by Jot (“binaural B-format”) [Jot 1995]
• Investigated practically by Evans at order 17 [Evans et al 1998]
• Theory further developed, extended to range extrapolation, extended
to irregular and incomplete sample grids with regularisation
[Duraiswami et al 2004, Pollow et al 2012]
• Metrics and implementation issues discussed by [Richter et al. 2014]
• Applied in binaural reproduction of plane waves with reduced modal
order (sound field synthesis) [Bernschütz et al 2014]
6
Motivation
• Use spherical harmonic interpolation to generate HRTFs at arbitrary
directions for investigation of binaural rendering of virtual sound
fields e.g. higher-order ambisonics
• Although the theory is well developed, there appears to be no
subjective validation of the interpolation method
• Research question: Can spherical harmonic interpolation of HRTFs
generate filters indistinguishable from measured data at complexity
that could be run in real-time for complex scenes?
7
Theoretical Approach
• Evaluate the HRTF with a solution of the Helmholtz equation
• Assume source at the ear and evaluate the outgoing acoustic
radiation at any point outside of the head surface (reciprocity
principal)
• Expand the pressure field as a series of spatial modes
8
Analysis and Synthesis
• Project HRTF measurements of constant distance onto the spherical
harmonic expansion coefficients using integration over the sphere
(spatial Fourier transform)
• For given sample weights (quadrature) the discrete spatial Fourier
transform is used
• Performed separately for each ear 

(unless the head is left-right symmetric)
• An HRTF can then be synthesised at arbitrary points using the inverse
spatial Fourier transform
9
Sources of Error
• Spatial aliasing

Sampled signal has energy in higher modes than sample grid offers
• Truncation error

Transform order lower than grid order e.g. for real-time efficiency
• Ill-conditioning in range extrapolation

Numerical instability at low frequency due to ratios of spherical
Hankel functions

10
Full Sphere HRTF Dataset
• Bernschütz measured a full-sphere far-field HRTF dataset on the
Neumann KU100 dummy head microphone [Bernschütz 2013]
• Three quadrature grids that sample the spherical harmonics well up
to given orders (Ng > 35)
• Used to obtain HRTFs at arbitrary source directions via spherical
harmonic interpolation [Bernschütz et al 2014]
• Suggests series order 35 is valid for temporal frequencies up to
20kHz
11
Quadratures
12
Gauss Ng =89 L=16020 Lebedev Ng =44 L=2702 Lebedev Ng =41 L=2354
Modal Intensity Distributions
13
Gauss Ng = 89 Lebedev Ng = 44
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
-40
-30
-20
-10
0
10
20
Magnitude(dB)
Measured
5
10
20
35
44
Synthesis Truncation Order
14
Left ear at (0˚,0˚)
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
-40
-30
-20
-10
0
10
20
Magnitude(dB)
Measured
5
10
20
35
44
Left ear at (45˚,0˚)
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
-40
-30
-20
-10
0
10
20
Magnitude(dB)
Measured
5
10
20
35
44
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
-40
-30
-20
-10
0
10
20
Magnitude(dB)
Measured
5
10
20
35
44
Synthesis Truncation Order
15
Left ear at (90˚,0˚) Left ear at (135˚,0˚)
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
35
30
25
20
15
10
5
0
Order(N)
-50
-45
-40
-35
-30
-25
-20
-15
-10
-5
0
(dB)
Modal Intensity Distribution
• The intensity distribution energy at
increasing modal order for higher
temporal frequencies
• Truncation will remove this energy
and therefore we get a low-pass
effect, as observed by 

[Bernschütz et al 2014]
16
500 1k 2k 5k 10k 20k
Frequency (Hz)
0
5
10
15
20
25
30
35
Order(N)
50
50
50
50
90
90
90
90
95
95
95
95
98
98
98
98
Modal Energy Ratio
• Can look at truncation error using
contour plot of modal energy ratio
• At truncation order N=35 we retain
98% of energy up to 20kHz
17
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
0
5
10
15
20
25
30
CentreofPower(J2)
off-centre
acoustic centring
onset removal
Acoustic Centring
• [Richter et al 2014] showed that an
acoustic centring can reduce the
energy in higher modal orders
• Translation of measurement points
to estimated acoustic centre
(including distance adjustment)
• Here a frequency-independent
offset of 9.75cm in y-axis was used
18
0 100 250 500 1000 2000 4000 8000 16000
Frequency (Hz)
0
5
10
15
20
25
30
CentreofPower(J2)
off-centre
acoustic centring
onset removal
Acoustic Centring
• Similar reduction in centre of power
is achieved with a conventional
broadband onset extraction from the
HRIRs
• A parametric broadband time-of-
arrival modal was used
[Ziegelwanger et al 2014]
• This has advantages in resynthesis:
efficiency and ITD personalisation
19
Acoustic Centring
20
500 1k 2k 5k 10k 20k
Frequency (Hz)
0
5
10
15
20
25
30
35
Order(N)
50
50
50
50
90
90
90
90
95
95
95
95
98
98
98
98
500 1k 2k 5k 10k 20k
Frequency (Hz)
0
5
10
15
20
25
30
35
Order(N)
50
50
90
90
90
95
95
95
95
98
98
98
98
98
98
Without onset removal With onset removal
Change in modal energy ratio
Listening Tests
• Using 2702-point Lebedev grid for spherical harmonic analysis
• Comparing synthesised HRTFs to measured HRTFs from Gauss set
where no measurement available in Lebedev set
• Three positions: front (2˚,0˚), rear-right (-100˚,0˚), up-left (30˚,30˚)
• Nearest measurement distances: 1.44˚, 2.41˚, 1.18˚ respectively
• Stimulus was a repeated pink noise burst

(750ms burst, 20ms half-cosine fade-in/-out, 1s silence)
21
Listening Test A
• Two alternative forced choice test (2AFC)
• Hypothesis: HRTFs synthesised at order N=35 (no onset removal)
are inaudible from those measured at the target position.
• Null hypothesis corresponds to group average detection rate of 65%
• Design balanced type-I and type-II error levels, keeping them below
5% after accounting for repeated tests with Sidak correction
[Leventhal 1986]
22
Listening Test A
• Critical number of 125 or more correct answers out of 216
• 12 assessors performed 18 repeats for each of the 3 conditions
• Could repeat and freely switch between stimuli as often as needed
23
(2°
,0°
) (-100°
,0°
) (30°
,30°
)
Position
0
10
20
30
40
50
60
70
80
90
100
Percentagecorrectanswers
2
2
3
2
2
3
2
2
2
3
2
2
Listening Test A - Results
24
• Front (2˚,0˚): 118 of 216 correct
• Rear-right (-100˚,0˚): 118 of 216 correct
• Up-left (30˚,30˚): 115 of 216 correct
Listening Test A - Discussion
• These results suggest that the 2702-point Lebedev grid dataset can
be used with spherical harmonic interpolation at order N=35 to
obtain HRTFs that are indistinguishable from real measurements
25
Listening Test B
• How do the differences to a target measurement compare between
relevant options?
• Multiple stimulus test with hidden reference, 

rating overall difference to the reference
• Measured reference compared to SH synthesised HRTFs at orders
35 and 5, both with and without separate onset modelling
• Nearest-neighbour measurement in 2702-point Lebedev grid also
compared
26
Listening Test B - Results
• Frontal position:
• N=5 with onsets left in was very
different
• N=5 with onsets processed
separately was very close to
reference but with perceivable
differences
• For all others no difference could be
heard
27
Listening Test B - Results
• Rear-right position:
• Both N=5 versions were very
different to the reference, the case
with separate onset processing was
perceived as more different
• Nearest-neighbour and N=35 with
separate onset processing may have
been perceived as different to the
reference
28
Listening Test B - Results
• Up-left position:
• Nearest-neighbour was very close to
reference but with perceivable
differences
• N=5 with onsets left in was very
different to the reference
• N=5 with onsets processed
separately was close to reference but
with perceivable differences
• For both N=35 cases no differences
were heard
29
Listening Test B - Discussion
• Confirms that N=35 with onsets included cannot be distinguished
from a real measurement
• Nearest-neighbour selection from the Lebedev grid is also very
close but sometimes audible
• With separate onset processing, N=35 is not significantly different
from the reference either
• At N=5 differences are clear, but separate onset processing makes
the differences a great deal smaller (except for the lateral position)
30
Conclusions
• Spherical harmonic interpolation allows generation of HRTF at
arbitrary field points that appears indistinguishable from real
measurements (for this dataset)
• Order limit N=35 is feasible for real-time implementation

(important for next experiment)
• Separate onset processing improves performance at lower-orders
(except at lateral positions) and allows ITD personalisation
31
Thank you
chris.pike@bbc.co.uk
32

More Related Content

What's hot

№79 конус, қиық конус
№79  конус, қиық конус   №79  конус, қиық конус
№79 конус, қиық конус
proxiGlen
 
クックパッド春の超絶技巧パンまつり 超絶技巧プログラミング編 資料
クックパッド春の超絶技巧パンまつり 超絶技巧プログラミング編 資料クックパッド春の超絶技巧パンまつり 超絶技巧プログラミング編 資料
クックパッド春の超絶技巧パンまつり 超絶技巧プログラミング編 資料
mametter
 
ディジタル信号処理 課題解説 その8
ディジタル信号処理 課題解説 その8ディジタル信号処理 課題解説 その8
ディジタル信号処理 課題解説 その8
noname409
 

What's hot (20)

№79 конус, қиық конус
№79  конус, қиық конус   №79  конус, қиық конус
№79 конус, қиық конус
 
Rolling Hashを殺す話
Rolling Hashを殺す話Rolling Hashを殺す話
Rolling Hashを殺す話
 
クックパッド春の超絶技巧パンまつり 超絶技巧プログラミング編 資料
クックパッド春の超絶技巧パンまつり 超絶技巧プログラミング編 資料クックパッド春の超絶技巧パンまつり 超絶技巧プログラミング編 資料
クックパッド春の超絶技巧パンまつり 超絶技巧プログラミング編 資料
 
Code Formula 2014 予選A 解説
Code Formula 2014 予選A 解説Code Formula 2014 予選A 解説
Code Formula 2014 予選A 解説
 
指数時間アルゴリズム入門
指数時間アルゴリズム入門指数時間アルゴリズム入門
指数時間アルゴリズム入門
 
直交領域探索
直交領域探索直交領域探索
直交領域探索
 
ディジタル信号処理 課題解説 その8
ディジタル信号処理 課題解説 その8ディジタル信号処理 課題解説 その8
ディジタル信号処理 課題解説 その8
 
コードゴルフのススメ(C言語)
コードゴルフのススメ(C言語)コードゴルフのススメ(C言語)
コードゴルフのススメ(C言語)
 
五次方程式はやっぱり解ける #日曜数学会
五次方程式はやっぱり解ける #日曜数学会五次方程式はやっぱり解ける #日曜数学会
五次方程式はやっぱり解ける #日曜数学会
 
Chapter4 1 takmin
Chapter4 1 takminChapter4 1 takmin
Chapter4 1 takmin
 
ぷよぷよAIの新しい探索法
ぷよぷよAIの新しい探索法ぷよぷよAIの新しい探索法
ぷよぷよAIの新しい探索法
 
Convex Hull Trick
Convex Hull TrickConvex Hull Trick
Convex Hull Trick
 
Popcntによるハミング距離計算
Popcntによるハミング距離計算Popcntによるハミング距離計算
Popcntによるハミング距離計算
 
実用Brainf*ckプログラミング入門編
実用Brainf*ckプログラミング入門編実用Brainf*ckプログラミング入門編
実用Brainf*ckプログラミング入門編
 
ユークリッド最小全域木
ユークリッド最小全域木ユークリッド最小全域木
ユークリッド最小全域木
 
オセロの終盤ソルバーを100倍以上高速化した話
オセロの終盤ソルバーを100倍以上高速化した話オセロの終盤ソルバーを100倍以上高速化した話
オセロの終盤ソルバーを100倍以上高速化した話
 
Disco Presents ディスカバリーチャンネルプログラミングコンテスト2016 本選 解説
Disco Presents ディスカバリーチャンネルプログラミングコンテスト2016 本選 解説Disco Presents ディスカバリーチャンネルプログラミングコンテスト2016 本選 解説
Disco Presents ディスカバリーチャンネルプログラミングコンテスト2016 本選 解説
 
双方向パストレーシングレンダラedubpt解説
双方向パストレーシングレンダラedubpt解説双方向パストレーシングレンダラedubpt解説
双方向パストレーシングレンダラedubpt解説
 
AtCoder Regular Contest 048
AtCoder Regular Contest 048AtCoder Regular Contest 048
AtCoder Regular Contest 048
 
Matrix Multiplication in Strassen Algorithm
Matrix Multiplication in Strassen AlgorithmMatrix Multiplication in Strassen Algorithm
Matrix Multiplication in Strassen Algorithm
 

Similar to Subjective Assessment of HRTF Interpolation with Spherical Harmonics - Chris Pike and Tony Tew

Slide Handouts with Notes
Slide Handouts with NotesSlide Handouts with Notes
Slide Handouts with Notes
Leon Nguyen
 
TH4.T05.3_IGARSS-2011-3233-BOUCHER.ppt
TH4.T05.3_IGARSS-2011-3233-BOUCHER.pptTH4.T05.3_IGARSS-2011-3233-BOUCHER.ppt
TH4.T05.3_IGARSS-2011-3233-BOUCHER.ppt
grssieee
 
Fast Sparse 2-D DFT Computation using Sparse-Graph Alias Codes
Fast Sparse 2-D DFT Computation using Sparse-Graph Alias CodesFast Sparse 2-D DFT Computation using Sparse-Graph Alias Codes
Fast Sparse 2-D DFT Computation using Sparse-Graph Alias Codes
Frank Ong
 
communication systems ppt on Frequency modulation
communication systems ppt on Frequency modulationcommunication systems ppt on Frequency modulation
communication systems ppt on Frequency modulation
NatarajVijapur
 

Similar to Subjective Assessment of HRTF Interpolation with Spherical Harmonics - Chris Pike and Tony Tew (20)

3D Spatial Response
3D Spatial Response3D Spatial Response
3D Spatial Response
 
Slide Handouts with Notes
Slide Handouts with NotesSlide Handouts with Notes
Slide Handouts with Notes
 
Measuring Jitter Using Phase Noise Techniques
Measuring Jitter Using Phase Noise TechniquesMeasuring Jitter Using Phase Noise Techniques
Measuring Jitter Using Phase Noise Techniques
 
Wavelet Transform and DSP Applications
Wavelet Transform and DSP ApplicationsWavelet Transform and DSP Applications
Wavelet Transform and DSP Applications
 
US_pres.pptx
US_pres.pptxUS_pres.pptx
US_pres.pptx
 
TH4.T05.3_IGARSS-2011-3233-BOUCHER.ppt
TH4.T05.3_IGARSS-2011-3233-BOUCHER.pptTH4.T05.3_IGARSS-2011-3233-BOUCHER.ppt
TH4.T05.3_IGARSS-2011-3233-BOUCHER.ppt
 
Sound Power Basics
Sound Power BasicsSound Power Basics
Sound Power Basics
 
Feasibility of EEG Super-Resolution Using Deep Convolutional Networks
Feasibility of EEG Super-Resolution Using Deep Convolutional NetworksFeasibility of EEG Super-Resolution Using Deep Convolutional Networks
Feasibility of EEG Super-Resolution Using Deep Convolutional Networks
 
Fast Sparse 2-D DFT Computation using Sparse-Graph Alias Codes
Fast Sparse 2-D DFT Computation using Sparse-Graph Alias CodesFast Sparse 2-D DFT Computation using Sparse-Graph Alias Codes
Fast Sparse 2-D DFT Computation using Sparse-Graph Alias Codes
 
Defense - Sound space rendering based on the virtual Sound space rendering ba...
Defense - Sound space rendering based on the virtual Sound space rendering ba...Defense - Sound space rendering based on the virtual Sound space rendering ba...
Defense - Sound space rendering based on the virtual Sound space rendering ba...
 
PDF3.pdf
PDF3.pdfPDF3.pdf
PDF3.pdf
 
noise.pdf
noise.pdfnoise.pdf
noise.pdf
 
Fundamentals of music processing chapter 6 발표자료
Fundamentals of music processing chapter 6 발표자료Fundamentals of music processing chapter 6 발표자료
Fundamentals of music processing chapter 6 발표자료
 
Oscilloscope Lab
Oscilloscope LabOscilloscope Lab
Oscilloscope Lab
 
communication systems ppt on Frequency modulation
communication systems ppt on Frequency modulationcommunication systems ppt on Frequency modulation
communication systems ppt on Frequency modulation
 
Sampling
SamplingSampling
Sampling
 
Doppler Effect
Doppler EffectDoppler Effect
Doppler Effect
 
Doppler Effect - Ultrasound
Doppler Effect - UltrasoundDoppler Effect - Ultrasound
Doppler Effect - Ultrasound
 
wave-propagation
wave-propagationwave-propagation
wave-propagation
 
Mixed Time Frequency Approach for Multipoint Room Response Equalization
Mixed Time Frequency Approach for Multipoint Room Response EqualizationMixed Time Frequency Approach for Multipoint Room Response Equalization
Mixed Time Frequency Approach for Multipoint Room Response Equalization
 

Recently uploaded

result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
Tonystark477637
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Christo Ananth
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Christo Ananth
 

Recently uploaded (20)

Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 

Subjective Assessment of HRTF Interpolation with Spherical Harmonics - Chris Pike and Tony Tew

  • 1. Subjective Assessment of HRTF Interpolation with Spherical Harmonics Chris Pike and Tony Tew
  • 2. Binaural Filter Processor Mono Input Audio Binaural Filter Generator Source Position Binaural Filter Data Binaural Output Audio Binaural Rendering • Spatialising a monophonic signal for headphone playback with a filter for each ear • We’re discussing the role of filter generation (via interpolation) • Not how you swap between filters (commutation) [Jot 1995] 2
  • 4. Why Interpolate HRTFs? • Reduce measurement requirements • Reduce data storage requirements • Obtain filters for precise source position
 (a direction-continuous HRTF) 4
  • 5. HRTF Interpolation Techniques • Local techniques
 e.g. linear weighting of local measurements around target position • Global techniques
 e.g. linear weighting of spectral or spatial basis functions 5
  • 6. Interpolation with Spherical Harmonics • First suggested by Jot (“binaural B-format”) [Jot 1995] • Investigated practically by Evans at order 17 [Evans et al 1998] • Theory further developed, extended to range extrapolation, extended to irregular and incomplete sample grids with regularisation [Duraiswami et al 2004, Pollow et al 2012] • Metrics and implementation issues discussed by [Richter et al. 2014] • Applied in binaural reproduction of plane waves with reduced modal order (sound field synthesis) [Bernschütz et al 2014] 6
  • 7. Motivation • Use spherical harmonic interpolation to generate HRTFs at arbitrary directions for investigation of binaural rendering of virtual sound fields e.g. higher-order ambisonics • Although the theory is well developed, there appears to be no subjective validation of the interpolation method • Research question: Can spherical harmonic interpolation of HRTFs generate filters indistinguishable from measured data at complexity that could be run in real-time for complex scenes? 7
  • 8. Theoretical Approach • Evaluate the HRTF with a solution of the Helmholtz equation • Assume source at the ear and evaluate the outgoing acoustic radiation at any point outside of the head surface (reciprocity principal) • Expand the pressure field as a series of spatial modes 8
  • 9. Analysis and Synthesis • Project HRTF measurements of constant distance onto the spherical harmonic expansion coefficients using integration over the sphere (spatial Fourier transform) • For given sample weights (quadrature) the discrete spatial Fourier transform is used • Performed separately for each ear 
 (unless the head is left-right symmetric) • An HRTF can then be synthesised at arbitrary points using the inverse spatial Fourier transform 9
  • 10. Sources of Error • Spatial aliasing
 Sampled signal has energy in higher modes than sample grid offers • Truncation error
 Transform order lower than grid order e.g. for real-time efficiency • Ill-conditioning in range extrapolation
 Numerical instability at low frequency due to ratios of spherical Hankel functions
 10
  • 11. Full Sphere HRTF Dataset • Bernschütz measured a full-sphere far-field HRTF dataset on the Neumann KU100 dummy head microphone [Bernschütz 2013] • Three quadrature grids that sample the spherical harmonics well up to given orders (Ng > 35) • Used to obtain HRTFs at arbitrary source directions via spherical harmonic interpolation [Bernschütz et al 2014] • Suggests series order 35 is valid for temporal frequencies up to 20kHz 11
  • 12. Quadratures 12 Gauss Ng =89 L=16020 Lebedev Ng =44 L=2702 Lebedev Ng =41 L=2354
  • 13. Modal Intensity Distributions 13 Gauss Ng = 89 Lebedev Ng = 44
  • 14. 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) -40 -30 -20 -10 0 10 20 Magnitude(dB) Measured 5 10 20 35 44 Synthesis Truncation Order 14 Left ear at (0˚,0˚) 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) -40 -30 -20 -10 0 10 20 Magnitude(dB) Measured 5 10 20 35 44 Left ear at (45˚,0˚)
  • 15. 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) -40 -30 -20 -10 0 10 20 Magnitude(dB) Measured 5 10 20 35 44 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) -40 -30 -20 -10 0 10 20 Magnitude(dB) Measured 5 10 20 35 44 Synthesis Truncation Order 15 Left ear at (90˚,0˚) Left ear at (135˚,0˚)
  • 16. 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) 35 30 25 20 15 10 5 0 Order(N) -50 -45 -40 -35 -30 -25 -20 -15 -10 -5 0 (dB) Modal Intensity Distribution • The intensity distribution energy at increasing modal order for higher temporal frequencies • Truncation will remove this energy and therefore we get a low-pass effect, as observed by 
 [Bernschütz et al 2014] 16
  • 17. 500 1k 2k 5k 10k 20k Frequency (Hz) 0 5 10 15 20 25 30 35 Order(N) 50 50 50 50 90 90 90 90 95 95 95 95 98 98 98 98 Modal Energy Ratio • Can look at truncation error using contour plot of modal energy ratio • At truncation order N=35 we retain 98% of energy up to 20kHz 17
  • 18. 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) 0 5 10 15 20 25 30 CentreofPower(J2) off-centre acoustic centring onset removal Acoustic Centring • [Richter et al 2014] showed that an acoustic centring can reduce the energy in higher modal orders • Translation of measurement points to estimated acoustic centre (including distance adjustment) • Here a frequency-independent offset of 9.75cm in y-axis was used 18
  • 19. 0 100 250 500 1000 2000 4000 8000 16000 Frequency (Hz) 0 5 10 15 20 25 30 CentreofPower(J2) off-centre acoustic centring onset removal Acoustic Centring • Similar reduction in centre of power is achieved with a conventional broadband onset extraction from the HRIRs • A parametric broadband time-of- arrival modal was used [Ziegelwanger et al 2014] • This has advantages in resynthesis: efficiency and ITD personalisation 19
  • 20. Acoustic Centring 20 500 1k 2k 5k 10k 20k Frequency (Hz) 0 5 10 15 20 25 30 35 Order(N) 50 50 50 50 90 90 90 90 95 95 95 95 98 98 98 98 500 1k 2k 5k 10k 20k Frequency (Hz) 0 5 10 15 20 25 30 35 Order(N) 50 50 90 90 90 95 95 95 95 98 98 98 98 98 98 Without onset removal With onset removal Change in modal energy ratio
  • 21. Listening Tests • Using 2702-point Lebedev grid for spherical harmonic analysis • Comparing synthesised HRTFs to measured HRTFs from Gauss set where no measurement available in Lebedev set • Three positions: front (2˚,0˚), rear-right (-100˚,0˚), up-left (30˚,30˚) • Nearest measurement distances: 1.44˚, 2.41˚, 1.18˚ respectively • Stimulus was a repeated pink noise burst
 (750ms burst, 20ms half-cosine fade-in/-out, 1s silence) 21
  • 22. Listening Test A • Two alternative forced choice test (2AFC) • Hypothesis: HRTFs synthesised at order N=35 (no onset removal) are inaudible from those measured at the target position. • Null hypothesis corresponds to group average detection rate of 65% • Design balanced type-I and type-II error levels, keeping them below 5% after accounting for repeated tests with Sidak correction [Leventhal 1986] 22
  • 23. Listening Test A • Critical number of 125 or more correct answers out of 216 • 12 assessors performed 18 repeats for each of the 3 conditions • Could repeat and freely switch between stimuli as often as needed 23
  • 24. (2° ,0° ) (-100° ,0° ) (30° ,30° ) Position 0 10 20 30 40 50 60 70 80 90 100 Percentagecorrectanswers 2 2 3 2 2 3 2 2 2 3 2 2 Listening Test A - Results 24 • Front (2˚,0˚): 118 of 216 correct • Rear-right (-100˚,0˚): 118 of 216 correct • Up-left (30˚,30˚): 115 of 216 correct
  • 25. Listening Test A - Discussion • These results suggest that the 2702-point Lebedev grid dataset can be used with spherical harmonic interpolation at order N=35 to obtain HRTFs that are indistinguishable from real measurements 25
  • 26. Listening Test B • How do the differences to a target measurement compare between relevant options? • Multiple stimulus test with hidden reference, 
 rating overall difference to the reference • Measured reference compared to SH synthesised HRTFs at orders 35 and 5, both with and without separate onset modelling • Nearest-neighbour measurement in 2702-point Lebedev grid also compared 26
  • 27. Listening Test B - Results • Frontal position: • N=5 with onsets left in was very different • N=5 with onsets processed separately was very close to reference but with perceivable differences • For all others no difference could be heard 27
  • 28. Listening Test B - Results • Rear-right position: • Both N=5 versions were very different to the reference, the case with separate onset processing was perceived as more different • Nearest-neighbour and N=35 with separate onset processing may have been perceived as different to the reference 28
  • 29. Listening Test B - Results • Up-left position: • Nearest-neighbour was very close to reference but with perceivable differences • N=5 with onsets left in was very different to the reference • N=5 with onsets processed separately was close to reference but with perceivable differences • For both N=35 cases no differences were heard 29
  • 30. Listening Test B - Discussion • Confirms that N=35 with onsets included cannot be distinguished from a real measurement • Nearest-neighbour selection from the Lebedev grid is also very close but sometimes audible • With separate onset processing, N=35 is not significantly different from the reference either • At N=5 differences are clear, but separate onset processing makes the differences a great deal smaller (except for the lateral position) 30
  • 31. Conclusions • Spherical harmonic interpolation allows generation of HRTF at arbitrary field points that appears indistinguishable from real measurements (for this dataset) • Order limit N=35 is feasible for real-time implementation
 (important for next experiment) • Separate onset processing improves performance at lower-orders (except at lateral positions) and allows ITD personalisation 31