This document describes validation experiments for a real-time virtual auditory system that dynamically updates head-related impulse responses based on head position. Acoustic waveforms were measured on a KEMAR manikin for static and dynamic sounds presented over loudspeakers or headphones. Waveforms matched well across conditions, indicating the system provides accurate binaural cues. A behavioral experiment also found subjects could accurately track dynamic sounds played virtually or over loudspeakers. The results validate that the system correctly updates HRTFs for dynamic presentation of sounds.
Advances in geophysical sensor data acquisitionIXSEA-DELPH
This presentation will explain what is different and new about the ECHOES products and introduce a new approach to Sub-Bottom Profiler (SBP) data acquisition and processing which has the potential to make a real difference to workflow.
The document provides information about integrating sound level meters and measuring sound levels. It discusses decibels, frequency and time weighting, sound pressure levels, equivalent sound levels, and the key features and specifications of the SLM 100 sound level meter, including its data logging and plotting capabilities.
The document discusses different types of audio filters including low-pass, high-pass, band-pass, and notch (band-stop) filters. It provides examples of each filter type using Csound code, including applying filters to synthesized tones and noise to filter out specific frequencies. The document also demonstrates dynamically changing the center frequency and bandwidth of band-pass filters over time.
The Sampling Theorem allows for audio signals to be reconstructed from evenly-spaced samples as long as the signal contains no frequencies higher than half the sampling rate. It is the basis for digital audio and allows audio to be recorded, stored, and transmitted digitally. Limitations like aliasing are addressed through techniques like using higher sample rates, anti-aliasing filters, and dithering noise. Oversampling is also used to lessen the need for dither and filters by sampling at a rate far above the Nyquist frequency.
DPCM and ADPCM are audio compression techniques that exploit the fact that differences between successive audio samples are typically smaller than the sample amplitudes. DPCM encodes differences between original and predicted samples while ADPCM varies the number of bits used based on difference size. Higher compression can be achieved through predictive coding and linear predictive coding, which analyze audio to determine perceptual features like pitch, period, and loudness for encoding. Perceptual coding considers how the human ear perceives sound by exploiting frequency and temporal masking effects.
The document provides information about in-plant training at Doordarshan. It discusses various components of the television production process including: [1] the TV studio with cameras, lighting, audio equipment and vision mixer; [2] the earth station that receives signals from satellites; and [3] TV transmitters and antennas that broadcast the signals. The document also summarizes the functions of different departments at Doordarshan like awareness, knowledge and entertainment and the hierarchy of production centers from national to regional to local levels.
The document summarizes studies performed on BGO crystals as a potential material for the barrel electromagnetic calorimeter of a particle physics experiment. Measurements were made of the energy resolution of a BGO crystal using a 137Cs radioactive source and varying the integration time of the readout electronics. With an integration time of 100 ns, an energy resolution of 2.3% at 50 MeV was achieved, corresponding to a light yield of at least 37 photons per MeV. Future tests planned include performing similar measurements with CsI crystals and studying the energy resolution using cosmic rays. BGO is considered a viable option for the calorimeter due to its radiation hardness, cost advantages over LYSO, and ability to achieve good energy resolution with integration
Advances in geophysical sensor data acquisitionIXSEA-DELPH
This presentation will explain what is different and new about the ECHOES products and introduce a new approach to Sub-Bottom Profiler (SBP) data acquisition and processing which has the potential to make a real difference to workflow.
The document provides information about integrating sound level meters and measuring sound levels. It discusses decibels, frequency and time weighting, sound pressure levels, equivalent sound levels, and the key features and specifications of the SLM 100 sound level meter, including its data logging and plotting capabilities.
The document discusses different types of audio filters including low-pass, high-pass, band-pass, and notch (band-stop) filters. It provides examples of each filter type using Csound code, including applying filters to synthesized tones and noise to filter out specific frequencies. The document also demonstrates dynamically changing the center frequency and bandwidth of band-pass filters over time.
The Sampling Theorem allows for audio signals to be reconstructed from evenly-spaced samples as long as the signal contains no frequencies higher than half the sampling rate. It is the basis for digital audio and allows audio to be recorded, stored, and transmitted digitally. Limitations like aliasing are addressed through techniques like using higher sample rates, anti-aliasing filters, and dithering noise. Oversampling is also used to lessen the need for dither and filters by sampling at a rate far above the Nyquist frequency.
DPCM and ADPCM are audio compression techniques that exploit the fact that differences between successive audio samples are typically smaller than the sample amplitudes. DPCM encodes differences between original and predicted samples while ADPCM varies the number of bits used based on difference size. Higher compression can be achieved through predictive coding and linear predictive coding, which analyze audio to determine perceptual features like pitch, period, and loudness for encoding. Perceptual coding considers how the human ear perceives sound by exploiting frequency and temporal masking effects.
The document provides information about in-plant training at Doordarshan. It discusses various components of the television production process including: [1] the TV studio with cameras, lighting, audio equipment and vision mixer; [2] the earth station that receives signals from satellites; and [3] TV transmitters and antennas that broadcast the signals. The document also summarizes the functions of different departments at Doordarshan like awareness, knowledge and entertainment and the hierarchy of production centers from national to regional to local levels.
The document summarizes studies performed on BGO crystals as a potential material for the barrel electromagnetic calorimeter of a particle physics experiment. Measurements were made of the energy resolution of a BGO crystal using a 137Cs radioactive source and varying the integration time of the readout electronics. With an integration time of 100 ns, an energy resolution of 2.3% at 50 MeV was achieved, corresponding to a light yield of at least 37 photons per MeV. Future tests planned include performing similar measurements with CsI crystals and studying the energy resolution using cosmic rays. BGO is considered a viable option for the calorimeter due to its radiation hardness, cost advantages over LYSO, and ability to achieve good energy resolution with integration
Defense - Sound space rendering based on the virtual Sound space rendering ba...JunjieShi3
This document summarizes a thesis on developing and implementing an auditory display technique called ADVISE (Auditory Display based on the Virtual Sphere Model).
The document first reviews ADVISE and how it can render spatial audio for both virtual and real scenes using techniques like higher-order ambisonics and binaural rendering. It then reviews the adaptive rectangular decomposition (ARD) method for computational room acoustics simulation.
The document outlines the objective of improving ADVISE by developing a complete room acoustics program using ARD and integrating it into ADVISE. It describes implementing ADVISE to demonstrate spatial audio rendering for a sample music hall scene in Unity. Frequency limitations and computational costs of the
Feasibility of EEG Super-Resolution Using Deep Convolutional NetworksSangjun Han
This document summarizes a master's thesis presentation on using deep convolutional networks for EEG spatial super-resolution. The study used simulated EEG data to test how different noise types and upscaling ratios affect the super-resolution process. Key findings include that super-resolution recovered low-resolution signals beyond the level of high-resolution signals for white noise, but only to the level of high-resolution signals for real noise. Higher upscaling ratios yielded better quality signals for white noise. Whitening real noise helped super-resolution, especially for source analysis at low SNR. The study used simulations to isolate the effects of noise types since real EEG noise sources cannot be extracted.
The document provides information about EEG basics. It discusses:
1. EEG records spontaneous electrical activity generated by cortical neurons. The EEG machine has multiple channels that simultaneously record signals from electrode pairs.
2. EEG machines have three main components - the sensor layer with electrodes, the acquisition layer consisting of amplifiers, and the connectivity layer. Standard electrode placement uses the 10-20 system.
3. EEG signals are amplified, filtered, digitized and can be analyzed to detect abnormal electrical discharges that may indicate conditions like epilepsy. Features like amplitude, frequency, and symmetry are evaluated.
Recording Distortion Product Otoacoustic Emissions using the Adaptive Noise C...RicardoVallejo30
Assessing the effectiveness of the Path Medical QSCREEN ® Adaptive Noise Cancellation Technology on various DPOAE test parameters (overall test time, overall pass/refer, test time and pass/refer rate per frequency).
1) Equalizer matching involves finding the power spectrum of an example audio, then multiplying the input audio's magnitude spectrogram by a filter matching the example's power spectrum.
2) Noise matching involves denoising the input and example separately, then recombining their clean and noise components using the original signal-to-noise ratio.
3) Reverberation matching uses convolutive non-negative matrix factorization to decompose the input into a dry sound and reverb kernel, and convolve the estimated dry input with the example's reverb kernel.
This document discusses electroencephalography (EEG) and evoked potentials. It begins by describing EEG as a method for recording electrical brain activity using electrodes placed on the scalp. It then discusses how EEG is used to diagnose neurological conditions like epilepsy and brain tumors. The document outlines different brain wave patterns observed in EEG like alpha, beta, theta, and delta waves. It also discusses how EEG is performed and interpreted, potential artifacts, and different electrode montages. Finally, it describes evoked potentials as the electrical response of the brain to sensory stimulation, and summarizes different types of evoked potentials including visual, somatosensory, and auditory potentials.
ECHOES & DELPH Seismic - Advances in geophysical sensor data acquisitionIXSEA-DELPH
This presentation will explain what is different and new about the ECHOES products and introduce a new approach to Sub-Bottom Profiler (SBP) data acquisition and processing which has the potential to make a real difference to workflow.
Audiometry class by Dr. Kavitha Ashok Kumar MSU MalaysiaKavitha Ashokb
1. Pure tone audiometry is an objective test that measures air and bone conduction thresholds to evaluate the type and severity of hearing loss. It is helpful for documentation and diagnosis.
2. Impedance audiometry objectively measures middle ear function through tympanometry and acoustic reflex testing. It can detect middle ear pathologies and is a fast screening test.
3. Otoacoustic emissions are sounds originating from the cochlea that can help diagnose cochlear hearing loss through an objective, noninvasive test done in both children and adults.
Hybrid Reverberator Using Multiple Impulse Responses for Audio Rendering Impr...a3labdsp
The document proposes an algorithm for audio rendering using multiple impulse responses that allows for reproduction of a moving listener position. It analyzes impulse response tails to generate a prototype tail and uses a hybrid reverberation structure including FIR and IIR filters to synthesize the reverberation effect in real-time. Experimental results on a church impulse response database show the approach can accurately reproduce reverberation time and clarity measurements compared to real impulse responses. Informal listening tests found no perceptible differences between the proposed approach and an existing technique.
This document discusses preparing and analyzing EEG data. It covers the history of EEG, how to record EEG data by applying electrodes to standardized scalp locations and synchronizing recordings with stimulus presentations. It also discusses preprocessing steps like downsampling, filtering, re-referencing, artifact removal through ICA and visual inspection, and epoching data to create ERPs. Examples of ERP components like the N170 face processing component and error-related negativity are also mentioned.
This document outlines the topics that will be covered in an ESP 179 noise fundamentals lecture, including:
1) A recap of noise fundamentals such as decibel scales, human noise perception, and noise level descriptors.
2) A discussion of noise laws, regulations, standards, both subjective and objective approaches.
3) An overview of environmental noise studies and the CEQA noise assessment process.
4) Details on noise study methodologies, thresholds for determining noise impacts, and mitigation strategies.
5) Examples of case studies involving development projects, highway projects, and bridge/pile driving construction.
The document discusses spatial hearing and head-related transfer functions (HRTFs) for virtual audio. It covers measuring HRTFs using a KEMAR manikin, constructing filters based on measured HRTFs to localize sound, issues with non-individualized HRTFs, synthetic HRTF approaches, and techniques for externalization like reverberation and decorrelation. Applications mentioned include immersive environments, hearing aids, and representational sounds.
The document discusses spatial audio and sound localization techniques. It describes how head-related transfer functions (HRTFs) capture the spatial cues the human auditory system uses to localize sound sources. HRTFs are measured impulse responses between sound sources and the ears. Convolving a sound source with HRTFs for that source's location produces binaural audio. Ambisonics uses spherical harmonics to represent soundfields with a small number of channels and can render spatial audio in virtual reality.
Geometric distortion of functional MR images especially of DWI hinders the visualization of the frontal & temporal regions. This is a simple & reliable approach to get rid of. This needs no special software or hardware & applies to old systems.
The Steadfast and Reliable Bull: Taurus Zodiac Signmy Pandit
Explore the steadfast and reliable nature of the Taurus Zodiac Sign. Discover the personality traits, key dates, and horoscope insights that define the determined and practical Taurus, and learn how their grounded nature makes them the anchor of the zodiac.
Storytelling is an incredibly valuable tool to share data and information. To get the most impact from stories there are a number of key ingredients. These are based on science and human nature. Using these elements in a story you can deliver information impactfully, ensure action and drive change.
Defense - Sound space rendering based on the virtual Sound space rendering ba...JunjieShi3
This document summarizes a thesis on developing and implementing an auditory display technique called ADVISE (Auditory Display based on the Virtual Sphere Model).
The document first reviews ADVISE and how it can render spatial audio for both virtual and real scenes using techniques like higher-order ambisonics and binaural rendering. It then reviews the adaptive rectangular decomposition (ARD) method for computational room acoustics simulation.
The document outlines the objective of improving ADVISE by developing a complete room acoustics program using ARD and integrating it into ADVISE. It describes implementing ADVISE to demonstrate spatial audio rendering for a sample music hall scene in Unity. Frequency limitations and computational costs of the
Feasibility of EEG Super-Resolution Using Deep Convolutional NetworksSangjun Han
This document summarizes a master's thesis presentation on using deep convolutional networks for EEG spatial super-resolution. The study used simulated EEG data to test how different noise types and upscaling ratios affect the super-resolution process. Key findings include that super-resolution recovered low-resolution signals beyond the level of high-resolution signals for white noise, but only to the level of high-resolution signals for real noise. Higher upscaling ratios yielded better quality signals for white noise. Whitening real noise helped super-resolution, especially for source analysis at low SNR. The study used simulations to isolate the effects of noise types since real EEG noise sources cannot be extracted.
The document provides information about EEG basics. It discusses:
1. EEG records spontaneous electrical activity generated by cortical neurons. The EEG machine has multiple channels that simultaneously record signals from electrode pairs.
2. EEG machines have three main components - the sensor layer with electrodes, the acquisition layer consisting of amplifiers, and the connectivity layer. Standard electrode placement uses the 10-20 system.
3. EEG signals are amplified, filtered, digitized and can be analyzed to detect abnormal electrical discharges that may indicate conditions like epilepsy. Features like amplitude, frequency, and symmetry are evaluated.
Recording Distortion Product Otoacoustic Emissions using the Adaptive Noise C...RicardoVallejo30
Assessing the effectiveness of the Path Medical QSCREEN ® Adaptive Noise Cancellation Technology on various DPOAE test parameters (overall test time, overall pass/refer, test time and pass/refer rate per frequency).
1) Equalizer matching involves finding the power spectrum of an example audio, then multiplying the input audio's magnitude spectrogram by a filter matching the example's power spectrum.
2) Noise matching involves denoising the input and example separately, then recombining their clean and noise components using the original signal-to-noise ratio.
3) Reverberation matching uses convolutive non-negative matrix factorization to decompose the input into a dry sound and reverb kernel, and convolve the estimated dry input with the example's reverb kernel.
This document discusses electroencephalography (EEG) and evoked potentials. It begins by describing EEG as a method for recording electrical brain activity using electrodes placed on the scalp. It then discusses how EEG is used to diagnose neurological conditions like epilepsy and brain tumors. The document outlines different brain wave patterns observed in EEG like alpha, beta, theta, and delta waves. It also discusses how EEG is performed and interpreted, potential artifacts, and different electrode montages. Finally, it describes evoked potentials as the electrical response of the brain to sensory stimulation, and summarizes different types of evoked potentials including visual, somatosensory, and auditory potentials.
ECHOES & DELPH Seismic - Advances in geophysical sensor data acquisitionIXSEA-DELPH
This presentation will explain what is different and new about the ECHOES products and introduce a new approach to Sub-Bottom Profiler (SBP) data acquisition and processing which has the potential to make a real difference to workflow.
Audiometry class by Dr. Kavitha Ashok Kumar MSU MalaysiaKavitha Ashokb
1. Pure tone audiometry is an objective test that measures air and bone conduction thresholds to evaluate the type and severity of hearing loss. It is helpful for documentation and diagnosis.
2. Impedance audiometry objectively measures middle ear function through tympanometry and acoustic reflex testing. It can detect middle ear pathologies and is a fast screening test.
3. Otoacoustic emissions are sounds originating from the cochlea that can help diagnose cochlear hearing loss through an objective, noninvasive test done in both children and adults.
Hybrid Reverberator Using Multiple Impulse Responses for Audio Rendering Impr...a3labdsp
The document proposes an algorithm for audio rendering using multiple impulse responses that allows for reproduction of a moving listener position. It analyzes impulse response tails to generate a prototype tail and uses a hybrid reverberation structure including FIR and IIR filters to synthesize the reverberation effect in real-time. Experimental results on a church impulse response database show the approach can accurately reproduce reverberation time and clarity measurements compared to real impulse responses. Informal listening tests found no perceptible differences between the proposed approach and an existing technique.
This document discusses preparing and analyzing EEG data. It covers the history of EEG, how to record EEG data by applying electrodes to standardized scalp locations and synchronizing recordings with stimulus presentations. It also discusses preprocessing steps like downsampling, filtering, re-referencing, artifact removal through ICA and visual inspection, and epoching data to create ERPs. Examples of ERP components like the N170 face processing component and error-related negativity are also mentioned.
This document outlines the topics that will be covered in an ESP 179 noise fundamentals lecture, including:
1) A recap of noise fundamentals such as decibel scales, human noise perception, and noise level descriptors.
2) A discussion of noise laws, regulations, standards, both subjective and objective approaches.
3) An overview of environmental noise studies and the CEQA noise assessment process.
4) Details on noise study methodologies, thresholds for determining noise impacts, and mitigation strategies.
5) Examples of case studies involving development projects, highway projects, and bridge/pile driving construction.
The document discusses spatial hearing and head-related transfer functions (HRTFs) for virtual audio. It covers measuring HRTFs using a KEMAR manikin, constructing filters based on measured HRTFs to localize sound, issues with non-individualized HRTFs, synthetic HRTF approaches, and techniques for externalization like reverberation and decorrelation. Applications mentioned include immersive environments, hearing aids, and representational sounds.
The document discusses spatial audio and sound localization techniques. It describes how head-related transfer functions (HRTFs) capture the spatial cues the human auditory system uses to localize sound sources. HRTFs are measured impulse responses between sound sources and the ears. Convolving a sound source with HRTFs for that source's location produces binaural audio. Ambisonics uses spherical harmonics to represent soundfields with a small number of channels and can render spatial audio in virtual reality.
Geometric distortion of functional MR images especially of DWI hinders the visualization of the frontal & temporal regions. This is a simple & reliable approach to get rid of. This needs no special software or hardware & applies to old systems.
The Steadfast and Reliable Bull: Taurus Zodiac Signmy Pandit
Explore the steadfast and reliable nature of the Taurus Zodiac Sign. Discover the personality traits, key dates, and horoscope insights that define the determined and practical Taurus, and learn how their grounded nature makes them the anchor of the zodiac.
Storytelling is an incredibly valuable tool to share data and information. To get the most impact from stories there are a number of key ingredients. These are based on science and human nature. Using these elements in a story you can deliver information impactfully, ensure action and drive change.
Discover timeless style with the 2022 Vintage Roman Numerals Men's Ring. Crafted from premium stainless steel, this 6mm wide ring embodies elegance and durability. Perfect as a gift, it seamlessly blends classic Roman numeral detailing with modern sophistication, making it an ideal accessory for any occasion.
https://rb.gy/usj1a2
The Genesis of BriansClub.cm Famous Dark WEb PlatformSabaaSudozai
BriansClub.cm, a famous platform on the dark web, has become one of the most infamous carding marketplaces, specializing in the sale of stolen credit card data.
Industrial Tech SW: Category Renewal and CreationChristian Dahlen
Every industrial revolution has created a new set of categories and a new set of players.
Multiple new technologies have emerged, but Samsara and C3.ai are only two companies which have gone public so far.
Manufacturing startups constitute the largest pipeline share of unicorns and IPO candidates in the SF Bay Area, and software startups dominate in Germany.
4 Benefits of Partnering with an OnlyFans Agency for Content Creators.pdfonlyfansmanagedau
In the competitive world of content creation, standing out and maximising revenue on platforms like OnlyFans can be challenging. This is where partnering with an OnlyFans agency can make a significant difference. Here are five key benefits for content creators considering this option:
The Most Inspiring Entrepreneurs to Follow in 2024.pdfthesiliconleaders
In a world where the potential of youth innovation remains vastly untouched, there emerges a guiding light in the form of Norm Goldstein, the Founder and CEO of EduNetwork Partners. His dedication to this cause has earned him recognition as a Congressional Leadership Award recipient.
[To download this presentation, visit:
https://www.oeconsulting.com.sg/training-presentations]
This presentation is a curated compilation of PowerPoint diagrams and templates designed to illustrate 20 different digital transformation frameworks and models. These frameworks are based on recent industry trends and best practices, ensuring that the content remains relevant and up-to-date.
Key highlights include Microsoft's Digital Transformation Framework, which focuses on driving innovation and efficiency, and McKinsey's Ten Guiding Principles, which provide strategic insights for successful digital transformation. Additionally, Forrester's framework emphasizes enhancing customer experiences and modernizing IT infrastructure, while IDC's MaturityScape helps assess and develop organizational digital maturity. MIT's framework explores cutting-edge strategies for achieving digital success.
These materials are perfect for enhancing your business or classroom presentations, offering visual aids to supplement your insights. Please note that while comprehensive, these slides are intended as supplementary resources and may not be complete for standalone instructional purposes.
Frameworks/Models included:
Microsoft’s Digital Transformation Framework
McKinsey’s Ten Guiding Principles of Digital Transformation
Forrester’s Digital Transformation Framework
IDC’s Digital Transformation MaturityScape
MIT’s Digital Transformation Framework
Gartner’s Digital Transformation Framework
Accenture’s Digital Strategy & Enterprise Frameworks
Deloitte’s Digital Industrial Transformation Framework
Capgemini’s Digital Transformation Framework
PwC’s Digital Transformation Framework
Cisco’s Digital Transformation Framework
Cognizant’s Digital Transformation Framework
DXC Technology’s Digital Transformation Framework
The BCG Strategy Palette
McKinsey’s Digital Transformation Framework
Digital Transformation Compass
Four Levels of Digital Maturity
Design Thinking Framework
Business Model Canvas
Customer Journey Map
Call8328958814 satta matka Kalyan result satta guessing➑➌➋➑➒➎➑➑➊➍
Satta Matka Kalyan Main Mumbai Fastest Results
Satta Matka ❋ Sattamatka ❋ New Mumbai Ratan Satta Matka ❋ Fast Matka ❋ Milan Market ❋ Kalyan Matka Results ❋ Satta Game ❋ Matka Game ❋ Satta Matka ❋ Kalyan Satta Matka ❋ Mumbai Main ❋ Online Matka Results ❋ Satta Matka Tips ❋ Milan Chart ❋ Satta Matka Boss❋ New Star Day ❋ Satta King ❋ Live Satta Matka Results ❋ Satta Matka Company ❋ Indian Matka ❋ Satta Matka 143❋ Kalyan Night Matka..
Cover Story - China's Investment Leader - Dr. Alyce SUmsthrill
In World Expo 2010 Shanghai – the most visited Expo in the World History
https://www.britannica.com/event/Expo-Shanghai-2010
China’s official organizer of the Expo, CCPIT (China Council for the Promotion of International Trade https://en.ccpit.org/) has chosen Dr. Alyce Su as the Cover Person with Cover Story, in the Expo’s official magazine distributed throughout the Expo, showcasing China’s New Generation of Leaders to the World.
NIMA2024 | De toegevoegde waarde van DEI en ESG in campagnes | Nathalie Lam |...BBPMedia1
Nathalie zal delen hoe DEI en ESG een fundamentele rol kunnen spelen in je merkstrategie en je de juiste aansluiting kan creëren met je doelgroep. Door middel van voorbeelden en simpele handvatten toont ze hoe dit in jouw organisatie toegepast kan worden.
HR search is critical to a company's success because it ensures the correct people are in place. HR search integrates workforce capabilities with company goals by painstakingly identifying, screening, and employing qualified candidates, supporting innovation, productivity, and growth. Efficient talent acquisition improves teamwork while encouraging collaboration. Also, it reduces turnover, saves money, and ensures consistency. Furthermore, HR search discovers and develops leadership potential, resulting in a strong pipeline of future leaders. Finally, this strategic approach to recruitment enables businesses to respond to market changes, beat competitors, and achieve long-term success.
Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.AnnySerafinaLove
This letter, written by Kellen Harkins, Course Director at Full Sail University, commends Anny Love's exemplary performance in the Video Sharing Platforms class. It highlights her dedication, willingness to challenge herself, and exceptional skills in production, editing, and marketing across various video platforms like YouTube, TikTok, and Instagram.
Digital Marketing with a Focus on Sustainabilitysssourabhsharma
Digital Marketing best practices including influencer marketing, content creators, and omnichannel marketing for Sustainable Brands at the Sustainable Cosmetics Summit 2024 in New York
3. Motivation
To validate a real-time system that updates head-related
impulse responses
Goal is to show that the acoustic waveforms measured
on KEMAR match between real and virtual presentations
Applications:
Explore the effects on sound localization with the presentation
of dynamic sound
4. Introduction: What is Real/Virtual Audio?
Real Audio consists of presenting sounds over
loudspeakers
Virtual Audio consists of presenting acoustic waveforms
over headphones.
Advantages
Cost-effective
Portable
Doesn’t depend on room effects
Disadvantages
Unrealistic
5. Introduction: Sound Localization
Interaural Time Difference – ITD – Differences between
sound arrival times at the two ears
Predominant cue in the low frequencies < 2kHz
Interaural Level Difference – ILD – Differences between
sound levels in the two ears
Predominant cue in the higher frequencies due to head
shadowing ~> 2kHz
Encoded in Head-Related Transfer Function (HRTF)
ILD in Magnitude
ITD in Phase
6. Background of RTVAS System
Developed by Jacob Scarpaci (2006)
Uses a Real-Time Kernel in Linux to update HRTF filters
Key to system is that the HRTF being convolved
with input signal is the difference between where
the sound should be and where the subject’s
head position is.
7. Project Motivation/Aims
Goal is to validate that the Real-Time Virtual Auditory
System, developed by Jacob Scarpaci (2006), correctly
updates HRTFs in accordance with head location relative
to sound location.
Approach to validation:
Compare acoustic waveforms measured on KEMAR when
presented with sound over headphones to those presented
over loudspeakers.
Mathematical, signals approach
Perform a behavioral task where subjects are to track dynamic
sound played over headphones or loudspeakers.
Perceptual approach
8. Methods: Real Presentation - Panning
Loudspeaker setup to create a virtual speaker
Nonlinear (Leakey, 1959)
(shown as dashed outline) by interpolation
CH 1 = 1/ 2 (sin(! ) / 2 sin(! pos ))
between two symmetrically located speakers
about 0 degrees azimuth. CH 2 = 1/ 2 + (sin(! ) / 2 sin(! pos ))
9. HRTF Measurement
Empirical KEMAR
17th order MLS used to measure HRTF at every degree from -90 to 90 degrees.
All measurements were windowed to 226 coefficients using a modified
Hanning window to remove reverberations.
Minimum-Phase plus Linear Phase Interpolation
Interpolated from every 5 degree empirical measurements.
Magnitude function was derived using a linear weighted average of the log
magnitude functions from the empirical measurements.
Minimum Phase function was derived from the magnitude function.
Linear Phase component was added corresponding to the ITD calculated for
that position.
10. Acoustic Waveform Comparison: Static
Sound/Static Head Methods
Presented either a speech waveform or noise waveform at three
different static locations: 5, 23, and -23 degrees
During the free-field presentation the positions were created by
using the panning technique (outlined previously) from speakers.
Used 4 different KEMAR HRTF sets in the virtual presentation
Empirical, Min-Phase Interp., Empirical Headphone TF, Min-Phase
Headphone TF
Recorded sounds on KEMAR with microphones located at the
position corresponding to the human eardrum.
11. Static Sound/Static Head: Analysis
Correlated the waveforms recorded over loudspeakers
with the waveforms recorded over headphones for a
given set of HRTFs.
Correlated time, magnitude, and phase functions
Allowed for a maximum delay of 4ms in time to allow for
transmission delays
Broke signals into third-octave bands with the following
center frequencies:
[200 250 315 400 500 630 800 1000 1250 1600 2000 2500 3150 4000
5000 6300 8000 10000]
Correlated time, magnitude, and phase within each band and calculated
the delay(lag) needed to impose on one signal to achieve maximum
correlation.
Looked at differences in binaural cues within each band
18. Dynamic Sound/Static Head: Methods
Presented a speech or a noise waveform either over
loudspeakers or headphones using panning or convolution
algorithm
Sound was presented from 0 to 30 degrees
Used same 4 HRTF sets
25. Static Sound/Dynamic Head: Methods
Speech or noise waveform was presented over
loudspeakers or headphone at a fixed position, 30
degrees.
4 HRTF sets were used
KEMAR was moved from 30 degrees to 0 degree position
while sound was presented.
Head position was monitored using Intersense® IS900
VWT head tracker.
26. Static Sound/Dynamic Head: Analysis
Similar data analysis was performed in this case as in the
previous two cases.
Only tracks that followed the same trajectory were
correlated.
Acceptance Criteria was less than 1 or 1.5 degree difference
between the tracks.
31. Difference in ITDs from Free-Field and
Headphones for Static Noise/Dynamic Head
32. Difference in ILDs from Free-Field and
Headphones for Static Noise/Dynamic Head.
33. Waveform Comparison Discussion
Interaural cues match up very well across the different
conditions as well as between loudspeakers and
headphones.
Result from higher correlations in the magnitude and phase
functions.
Differences (correlation) in waveforms may not matter
perceptually if receiving same binaural cues.
Output algorithm in the RTVAS seems to present correct
directional oriented sounds as well as correctly adjusting
to head movement.
34. Psychophysical Experiment: Details
6 Normal Hearing Subjects
4 Male, 2 Female
Sound was presented over headphones or loudspeakers
Task was to track, using their head, a moving sound
source.
HRTFs tested were, Empirical KEMAR, Minimum-Phase
KEMAR, Individual (Interpolated using Minimum-Phase)
35. Psychophysical Experiment: Details cont.
Sound Details
White noise
Frequency content was 200Hz to 10kHz
Presented at 65dB SPL
5 second in duration
Track Details
15(sin((2pi/5)t)+ sin((2pi/2)t*rand))
36. Psychophysical Experiment: Virtual Setup
Head Movement Training – Subjects just moved head (no sound)
5 repetitions where subjects’ task was to put the square (representing
head) in another box.
Also centers head.
Training – All using Empirical KEMAR
10 trials where subject was shown, via plot, the path of the sound before
it played.
10 trials where the same track as before was presented but no visual
cue was available.
10 trials where subject was shown, via plot, the path but path was
random from trial to trial.
10 trials where tracks were random and no visualization.
37. Psychophysical Experiment: Setup cont.
Experiment (Headphones)
10 trials using Empirical KEMAR HRTFs
10 trials using Minimum-Phase KEMAR HRTFs
10 trials using Individual HRTFs
Repeated 3 times
Loudspeaker Training
Same as headphones but trials were reduced to 5.
Loudspeaker Experiment
30 trials repeated only once
Subjects were instructed to press a button as soon as they
heard the sound. This started the head tracking.
46. Psychophysical Experiment: Discussion
Coherence
The coherence or correlation measure is statistically insignificant in
the empirical and minimum phase interpolation case from that over
loudspeakers.
Coherence of individual HRTFs was surprisingly worse.
Coherence also stays strong as the complexity of the track varies.
Latency
Individual HRTFs show a more variability in latency.
Might be able to track changes quicker using their own HRTFs
Loudspeaker latency is negative which means that subjects are
predicting the path.
Could be because subjects are predicting the path since sound always go
to the right first as well as a result from the delay in pressing the button
47. Psychophysical Experiment: Discussion
Cont.
RMS
No significant difference in total RMS error as well as RMS
undershoot error between Empirical and Minimum-Phase
HRTFs from loudspeakers.
Subjects generally undershoot the path of the sound.
Could be a motor problem, i.e. laziness, as well as perception.
48. Overall Conclusions
Coherence of acoustic recordings may not be the best
measure for validation
Reverberation or panning techniques
If perception is the only thing that matters, than have to
conclude that algorithm works
49. Future Work
Look at different methods for presenting dynamic sound
over loudspeakers.
Try different room environments.
Closer look at differences between headphones
Particularly looking at open canal tube-phones to see if
subjects could distinguish between real and virtual sources.
Various psychophysical experiments that involve dynamic
sound (speech, masking)
Sound localization
Source separation
50. Acknowledgements
Other
Committee
Dave Freedman
Steven Colburn
Jake Scarpaci
Barb Shinn-Cunningham
Nathaniel Durlach My Subjects
Binaural Gang All in Attendance
Todd Jennings
Le Wang
Tim Streeter
Varun Parmar
Akshay Navaladi
Antje Ihlefeld
53. Methods: Real Presentation Continued
Input stimulus was a 17th
Title: Speaker
Presentation
order mls sequence sampled
Source Created
at 50kHz.
Speaker Position
-5
Corresponds to a duration of
10 0
~2.6sec
5
-10
15 0
Waveforms were recorded
10
-20
on KEMAR (Knowles
-10
Electronic Manikin for
30 0
10
Acoustic Research)
20
-40
-30
-10
45 0
10
30
40
54. Results: Real Presentation
• HRTFs measured when sound was presented over loudspeakers using the
linear and nonlinear interpolation functions
Linear Nonlinear
55. Results: Correlation Coefficients at all Spatial Locations for
Interpolated Sound over Loudspeakers
Correlation
Title: Correlation Coefficients
Linear Function Non-linear Function
Speaker Virtual
Left Right Left Right
Location Position
between a
-40 0.98799 0.9758 0.98655 0.97769
-30 0.97427 0.96611 0.97534 0.96777
virtual point
-10 0.96842 0.94612 0.96858 0.9466
45 0 0.95736 0.91602 0.95693 0.91709
10 0.96374 0.95282 0.96384 0.95276
source and a
30 0.97532 0.97095 0.97644 0.97084
40 0.98397 0.98194 0.98268 0.98177
real source
-20 0.98372 0.97316 0.98385 0.97357
-10 0.98054 0.9564 0.98054 0.95649
30 0 0.97184 0.93755 0.97171 0.93774
10 0.97151 0.96414 0.97147 0.96448
20 0.97844 0.97768 0.97883 0.97762
-10 0.993 0.97775 0.99301 0.97787
15 0 0.97821 0.95517 0.97817 0.95503
10 0.98406 0.98576 0.98412 0.98572
-5 0.99326 0.97585 0.99328 0.97601
10 0 0.98927 0.96086 0.98924 0.96077
5 0.99319 0.98977 0.99312 0.98977
Very strong correlation, generally, for all spatial locations
Weaker correlation as speakers become more spatially separated
Weakest correlation when created sound is furthest from both speakers (0
degrees)
56. Spatial Separation of Loudspeakers
Correlation coefficients
for a virtually created
sound source at -10
degrees at various
spatial separations of the
loudspeakers
Correlation declines as the loudspeakers become more spatially separated
57. Example of Psuedo-Anechoic HRTFs
• Correlation coefficients are slightly better when reverberations are taken out of the impulse
responses
• Linear Reverberant: 0.98054, 0.9564 (Left, Right Ears)
• Linear Psuedo-Anechoic: 0.98545, 0.96019 (Left, Right Ears)
• Nonlinear Reverberant: 0.98054, 0.95649 (Left, Right Ears)
• Nonlinear Pseudo-Anechoic: 0.9855, 0.96007 (Left, Right Ears)
58. Correlation Coefficients at all Spatial Locations for Interpolated
Sound over Loudspeakers (Pseudo-Anechoic)
Table 3. Correlation Coefficients for Psuedo-Anechoic HRTFs
Linear Function Non-linear Function
Speaker Virtual
Left Right Left Right
Location Position
-40 0.96567 0.99168 0.96416 0.98421
-30 0.96223 0.95356 0.96138 0.95815
-10 0.96348 0.93433 0.96299 0.93902
45 0 0.95471 0.89491 0.95436 0.89968
10 0.95856 0.93652 0.95913 0.93953
30 0.97678 0.945 0.97825 0.94013
40 0.99563 0.9814 0.99 0.98018
-20 0.98762 0.97555 0.98767 0.97663
-10 0.98545 0.96019 0.9855 0.96007
30 0 0.97281 0.93616 0.97284 0.93623
10 0.97927 0.96945 0.97912 0.96968
20 0.97904 0.98188 0.97846 0.98183
-10 0.99608 0.98114 0.99592 0.98167
15 0 0.97891 0.95475 0.9788 0.95461
10 0.9928 0.98922 0.99287 0.9892
-5 0.99738 0.98141 0.99736 0.98162
10 0 0.99329 0.96323 0.99333 0.9632
5 0.99731 0.9946 0.99736 0.99462
Correlations generally are better when reverberant energy is taken out of the impulse
responses.