How Audio Objects Improve Spatial Accuracy

How Audio Objects
Improve Spatial Accuracy
Mads Maretty Sønderup
Education Content Developer
Lead Team Trainer

How Audio Objects Improve Spatial Accuracy
Object-based Audio?
Binaural Audio?
3D Audio?
HRTF?
Spatial Audio?

HRTF (Head-Related Transfer Function)
HRTF models how a given sound is filtered
by the diffraction and reflection properties
of the head, pinna, and torso before it
reaches the eardrum and inner ear
Varies significantly from person to person

Benefit 1
Maximize spatial accuracy

4ch ambience
Direct assignm.
1ch bird
3D positioned
70%
30%
Spatial accuracy with channels

70%
30%
4ch ambience
Direct assignm.
1ch bird
3D positioned
Spatial accuracy with channels

HRTF
rendering
Channel-based HRTF

4ch ambience
Direct assignm.
1ch bird
3D positioned
Spatial accuracy with Object-based Audio
1x mono object

HRTF
rendering
Object and Channel-based HRTF

What is a System
Audio Object?
An audio buffer accompanied by Metadata.
Position, Distance, Azimuth, Elevation,
Focus, Spread
!!! NOT to be confused with Game Objects

Games with Spatialized Sound features
PC

Source: The Road to PS5 https://youtu.be/ph8LyNIT9sg

Platform Channel Mix and System Audio Object Limits
Format Max Static Objects
(Channel Bed)
*Can be Spatialized
Max Dynamic
Objects
PS5
Max Dynamic
Objects
Xbox Series X|S, UWP
apps & >=2303 GDK
Max Dynamic
Objects
Xbox Series X|S, XDK
& <2303 GDK
Max Dynamic
Objects
Xbox One
PS5 3D Audio 7.1/ Ambisonic 5th Order* 128
PS5 Non-3D Audio 7.1/ Headphones N/A
Windows Sonic (Headphones) 17 (8.1.4.4)* 220 20 15
Dolby Atmos (Headphones) 17 (8.1.4.4)* 128 20 16
DTS Headphone:X (Headphones) 17 (8.1.4.4)* 200 20 16
Dolby Atmos Home Theater (HDMI) 12 (7.1.4)* 20 20 20
DTS:X for Home Theater (HDMI) 17 (8.1.4.4)* 20 20 16
Support for Spread
Time Sync Across Static/ Dynamic Objects
Hardware Spatial Audio Capabilities - Resource Limits

Benefit 3
Author once for all outputs

Delivering the "best" mix
"Best possible way to ensure a good mix is to audition
it"
● Stereo
● 5.1
● 7.1.4
Rely on the adaptability of your sound engine …
… to deliver the best mix for the listening
configuration.

Vector (scalable)
Fixed resolution
Utilizing a scalable format for any output

Design / Authoring Configurations Auditioning Configurations
etc...
Different modes of listening

Designed for the Best Possible Audio
Output Configurations
Dynamically Conformed to the Endpoint
Informed by Authoring
Simplified Complexity

Object-based Audio
It's a way to deliver audio, along with their metadata, to an endpoint.
Benefits
● Best possible spatial precision
● Author once for all outputs
● Can be hardware accelerated
● Opens the door to HRTF
Considerations
● HRTF/Binauralization will "color" your sound.
Some sounds might be best represented as NOT a Audio Object.
● Should you do both a Binaural and non-binaural mix?
If a system doesn't have Audio Objects, sound engine will/should just fall-back on channel based.

Authoring example
Wwise Audio Pipeline

Wwise Audio Pipeline
Inside Wwise Endpoint
User Defined
Output
Configuration
Audio Objects
Passthrough Mix
Main Mix
Audio
Configuration

Wwise Spatial Audio Pipeline Overview
SFX
Music
Passthrough
Channel-based
(ex. Stereo)
Spatial Bed
(Virtualized)
Channel-based
(ex. Ambisonics)
Inside Wwise Endpoint
Audio
Objects
DSP
Music
SFX
VO
SFX
Amb
User Defined
Output
Configuration
● Speakers
● Headphones
● Spatialization
… and
Initializes Wwise
To match the
Configuration
Audio Objects
(+ Metadata)
VO
Audio
Objects
Passthrough
Mix
Main
Mix
Master
Audio
Bus
Audio Device
Mix
DSP

All beds combined
Passthrough Mix Main Mix Audio Objects

Passthrough Mix
Example of sound assignment

Passthrough Mix
Main Mix

Audio Objects
Passthrough Mix
Main Mix

Main Mix
https://www.audiokinetic.com/library/edge/?source=Help&id=system_audio_device
Automatic output determination
If not
Passthrough Mix
It has a mono or stereo channel configuration.
It does not have a 3D position.
Wwise will help determine output type
Audio Object
It has a 3D position.
Its Speaker Panning / 3D Spatialization Mix is set to 100%.
Has a standard channel configuration* that does not have any height channels.
Audio Object would not exceed the number of available Audio Objects.
If not

Use Default
Mix to Main
Mix to Passthrough
Manual Output Assignment using Metadata

HRTF
rendering
3D Audio vs Audio Objects

Spatial Precision
Crackling
Combustion
Rumble
Main Mix
Audio Objects
Passthrough
Creative Intention
Campfire: Creative Intention

How do I prioritize which sounds
become Audio Objects?

Audio/ Auxiliary Bus: 3D Audio Bed Mixer
● Can reduce the number of Audio Objects passing through the bus
● Mixes Audio Objects over the defined limit, depending on the
behavior settings

Useful links (Will be converted into QR codes)
https://games.dolby.com/atmos/
Damian Kastbauer | Object-based Routing with Wwise
https://games.dolby.com/news/gdc-2023-object-based-routing/
Approaching the 3D Audio Ideal
https://www.youtube.com/watch?v=Y2dm_ISSEtU
One Minute Wwise | Preserving Accuracy using Audio Objects
https://youtu.be/1eFCcGhiVTE
One Minute Wwise | Simple Sound Positioning
https://youtu.be/yT1vP7LSu3o
Hardware Audio
Object limits

Same as parent
Audio Objects
Same as main mix
Same as passthrough mix
Bus Configurations

Format Channel Mix Audio Objects
PS5™
3D Audio 7.1/ Ambisonic 5th Order Many
Non-3D Audio 7.1/ Headphones N/A
Windows 10 Xbox One HoloLens
Windows 10
Xbox One
HoloLens
*LFE Not Counted
Windows Sonic for Headphones 16 (8.1.4.4) 112 16 31
Dolby Atmos (Headphones & Built-in Speakers) 16 (8.1.4.4) 16 16 N/A
Dolby Atmos (HDMI) 12 (7.1.4) 20 20 N/A
DTS Headphone:X (Headphones) 16 (8.1.4.4) 32 32 N/A
* iOS / Android TBD
https://learn.microsoft.com/en-us/windows/win32/coreaudio/spatial-sound
Hardware Acceleration & Capabilities

Override
Default
100’s of Objects 15 Objects
Optimizing Bus Configuration to account for differences
in the availability of System Audio Objects across platforms
Per-Platform Changes: Bus Configuration

Audio/ Auxiliary Bus Effects now process Audio Objects individually
Audio/ Auxiliary Bus: Effects
Audio Object
Audio Object
Audio Object
Audio Object
Audio Object
Audio Object
Audio Bus: Wwise Effects
DSP
DSP
DSP

Windows Sonic
Included with Windows
https://support.microsoft.com/en-
us/windows/how-to-turn-on-spatial-sound-in-
windows-10-ca2700a0-6519-448d-5434-
56f499d59c96
Free and already installed
DOLBY ATMOS
Get with Dolby Access
https://apps.microsoft.com/store/detail/dolby-
access/9N0866FS04W8?hl=en-us&gl=us
What's Dolby Atmos?
https://youtu.be/XfSj4wIcLIY
Dolby Atmos + Wwise
https://games.dolby.com/atmos/wwise/
Paid plugin. Only supports 16 Audio Objects,
but working on extending it.
HRTF renderers on Windows
DTS
Get with DTS Sound Unbound
https://apps.microsoft.com/store/detail/dts-
sound-unbound/9PJ0NKL8MCSJ?hl=en-
us&gl=us
Paid plugin + Most expensive. Only supports
up to 32 Audio Objects.
* Mac not yet supported by Wwise.

Audio/ Auxiliary Bus: Effects
Effect Plug-ins can use Audio Objects Metadata for processing
● Effects can modify Audio Object configurations
● Audio Objects can be gathered and signals processed per-object
● Example: Wwise Compressor
○ Audio Objects are gathered and evaluated as a group
○ Volume offset is calculated and applied to each Audio Object
○ Metadata is preserved
Audio Object
Audio Object
Audio Object
Mix and Calculate
Volume Reduction
Audio Bus: Wwise Compressor
DSP
Audio Object
Audio Object
Audio Object
Gather Audio Objects
Apply Volume Offset
To Each Audio Object

Not compatible Effects
(Demonstrated in Wwise)
The following Effects are not supported by busses that are Processing Audio Objects:
● Wwise Convolution Reverb: Running one instance of the Effect for each Audio Object would cause
performance issues.
● Wwise Matrix Reverb: Running one instance of the Effect for each Audio Object would cause performance
issues.
● Wwise RoomVerb: Running one instance of the Effect for each Audio Object would cause performance
issues.
● Wwise Peak Limiter: Peak limiting at the Audio Object level would be unreliable. When authoring Audio
Objects, use the Mastering Suite plug-in on an Audio Device to apply peak limiting.
● Wwise Recorder: The Recorder cannot run multiple instances.
● Auro Headphone: Not supported.
https://www.audiokinetic.com/library/edge/?source=Help&id=using_effects_with_audio_objects#ef
fects_on_mixing_bus

Bus Instance Object Processors
(Demonstrated in Wwise)
Certain Wwise Effects support Audio Objects intrinsically. Such Effects are called Object Processors and are
instantiated only once per bus instance:
● Wwise Compressor: The Compressor is instantiated once and performs the analysis phase once on an
internal downmix. The gain reduction is common to all Audio Objects.
● Mastering Suite: Multiband compressor works a bit like compressor.
● Wwise Meter: Analysis 1x internal downmix.
● Wwise Reflect: Set Audio Objects > 1x Audio Object per Reflection.
https://www.audiokinetic.com/library/edge/?source=Help&id=using_effects_with_audio_objects#effects_on_mixin
g_bus

Wwise Routing - Defined by the Bus
Bus
Configuration
Master
Audio
Bus
3D
Audio
Active
7.1.4
2
Audio
Device
Audio Object
Metadata: Default
Audio Object
Audio Object
Metadata: Same as Main Mix
Metadata: Default
Endpoint
Wwise
Any audio object without
positioning will end up in the
Main Mix or Passthrough
Same as
Parent

Wwise Routing - Defined by the Bus
Bus
Configuration
Master
Audio
Bus
3D
Audio
Active
7.1.4
2.0
1
Audio
Device
Audio Object
Metadata: Default
Audio Object
Audio Object
Metadata: Same as Passthrough
Metadata: Same as Main Mix
Endpoint
Wwise
Same as
Parent

How Audio Objects Improve Spatial Accuracy

Recommended

Recommended

More Related Content

Similar to How Audio Objects Improve Spatial Accuracy

Similar to How Audio Objects Improve Spatial Accuracy (20)

More from DevGAMM Conference

More from DevGAMM Conference (20)

Recently uploaded

Recently uploaded (20)

How Audio Objects Improve Spatial Accuracy

Editor's Notes