SlideShare a Scribd company logo
1 of 45
FILE FORMAT FOR
AUDIO PROGRAMME
MATERIALS WITH
METADATA
Mostafa Tavallaei
This presentation contains:
– BWF as an old but useful format
– The specification of the BW64 (Broadcast Wave 64Bit)
audio file format
– ADM, Audio Definition Model
2
BWF
■ The BWF file format (Broadcast Wave Format) ,
Recommendation ITU-R BS.1352
■ The (BWF) is based on the Microsoft WAVE audio file format
which is a type of file specified in the Microsoft “Resource
Interchange File Format”, RIFF
■ The basic building block of the RIFF file format, called a
chunk, contains a group of tightly related pieces of
information. 3
BWF
■ It consists of a chunk identifier, an integer value
representing the length in bytes and the information
carried.
■ A RIFF file is made up of a collection of chunks.
■ For the BWF, some restrictions are applied to the original
WAVE format.
■ In addition the BWF file includes a <Broadcast Audio
Extension> chunk Fig. 1 4
5
BWF
Contents of a broadcast wave format file:
A broadcast wave format file shall start with the mandatory
Microsoft RIFF “WAVE” header and at least the following
chunks:
6
BWF
BWF has a number of limitations, most notably:
• Maximum file size of less than 4Gbytes.
• No support for advanced multichannel audio.
• Inadequate support for technical metadata.
7
BW64 format described in next slides aims to overcome these
limitations, and maintain as much compatibility as possible
with the Recommendation ITU-R BS.1352 format with many of
the core elements shared.
BW64 RECOMMENDATION ITU-R BS.2088-0
■ The BW64 (Broadcast Wave 64Bit) format is based on
RIFF
■ including the new chunks
– <ds64>
– <axml>
– <chna>
8
“Long-form file format for the international exchange of
audio programme materials with metadata”
which enable the file to carry large multichannel files and
metadata including the Audio Definition Model (ADM) specified
in Recommendation ITU-R BS.2076.
BW64
format description:
A BW64 format file should start with the mandatory “WAVE” header
and at least the following chunks:
<WAVE-form> ->
BW64(‘WAVE’
<ds64-ck> // ds64 chunk for 64-bit addressing
<fmt-ck> // Format of the audio signal: PCM/non-PCM
<chna-ck> // chna chunk for ADM look-up
<axml-ck> // axml chunk for carrying XML metadata
<wave-data>) // sound data
9
BW64
■ There is an increasing demand on Audio Definition Model (ADM)
metadata according to Recommendation ITU-R BS.2076.
■ This Recommendation includes a definition of the <axml>
chunk for storing and transferring metadata as XML
■ The primary purpose of the <chna> is to provide the references
from each track in a BW64 file to the IDs in the ADM metadata
defined in Recommendation ITU-R BS.2076
10
BW64
format description:
Existing chunks defined as part of the RIFF/WAVE standard
The RIFF/WAVE standard uses a number of chunks that are already defined.
These are:
• <RIFF>
• <fmt>
• <data>
11
BW64
format description:
The RIFF/WAVE is a subset of the ITU-R BS.1352 format.
Recommendation ITU-R BS.1352 contains these additional
chunks:
• <bext>
• <ubxt>
These chunks will not be included in the BW64 format,
which provides a more flexible solution carrying broadcast
metadata.
12
BW64
format description:
New Chunks and Structs in the BW64 format
• <BW64>
• <ds64>
• <JUNK>
• <axml>
• <chna>
13
BW64
format description:
Using the <ds64> (data size 64) chunk to enable the use of files greater
than 4 Gbyte in size
14
• The reason for the 4 Gbyte barrier is the 32-bit addressing in RIFF/WAVE and
BWF
• With 32 bits a maximum of 4294967296 bytes = 4 Gbyte can be addressed
• The structure of a basic conventional RIFF/WAVE file is shown in Fig. 2, where
the chunkSize fields are 32-bit numbers representing the sizes of their chunks
BW64
15
FIGURE 2
Basic RIFF/WAVE file structure
Just changing the size of
every field in a BWF to
64-bit would produce a
file that is not compatible
with the standard
RIFF/WAVE format – an
obvious but important
observation
BW64
format description:
The approach adopted is to define a new 64-bit based RIFF called BW64
that is identical to the original RIFF/WAVE format, except for the
following changes:
16
• The ID ‘BW64’ is used instead of ‘RIFF’ in the first four bytes of the file
• A mandatory <ds64> chunk is added, which has to be the first chunk after the “BW64 chunk”
BW64
format description:
17
The ‘ds64’ chunk has two mandatory 64-bit integer values, which
replace two 32-bit fields of the RIFF/WAVE format:
• bw64Size (replaces the size field of the <RIFF> chunk)
• dataSize (replaces the size field of the <data> chunk)
For all two 32-bit fields of the RIFF/WAVE format the following rule applies:
 If the 32-bit value in the field is not “-1” (= FFFFFFFF hex) then this 32-bit value is used.
 If the 32-bit value in the field is “-1” the 64-bit value in the ‘ds64’ chunk is used instead.
BW64
format description:
18
The complete structure of the BW64 file format is illustrated in Fig. 3,
where the chunkSize values for the <BW64> and <data> chunks are
set to −1, to allow them to use 64-bit size values from the <ds64>
chunk.
BW64
format description:
19
FIGURE 3
BW64 file structure
BW64
format description:
20
Achieving compatibility between RIFF/WAVE and BW64
• Some production audio files will inevitably be smaller than 4 Gbyte and they should
therefore stay in the short-form RIFF/WAVE format
• A recording application cannot know in advance whether the recorded audio it is compiling
will exceed 4 Gbyte or not at end of recording
• The solution is to enable the recording application to switch from RIFF/WAVE to BW64 on
the fly at the 4 Gbyte size limit, while the recording is still going on
BW64
format description:
21
Achieving compatibility between RIFF/WAVE and BW64
• This is achieved by reserving additional space in the RIFF/WAVE by inserting a <JUNK>
chunk that is of the same size as a <ds64> chunk
• This reserved space has no meaning for short-form WAVE, but will become the <ds64>
chunk, if a transition to BW64 is necessary.
The diagram in Fig. 4 shows the <JUNK> placeholder chunk placed before the <fmt > chunk
BW64
format description:
22
FIGURE 4
File structure with JUNK chunk
BW64
format description:
23
At the beginning of a recording, a BW64-aware application will create a standard
RIFF/WAVE with a ’JUNK’ chunk as the first chunk. While recording, it will check the
RIFF and data sizes. If they exceed 4 Gbyte, the application will:
• Replace the chunkID <JUNK> with <ds64> chunk. (This transforms the previous <JUNK> chunk
into a <ds64> chunk).
• Insert the RIFF size, 'data' chunk size and sample count in the <ds64> chunk
• Set RIFF size, 'data' chunk size and sample count in the 32 bit fields to −1 = FFFFFFFF hex
• Replaces the ID ‘RIFF’ with ‘BW64’ in the first four bytes of the file
• Continue with the recording.
BW64
DS64 and JUNK chunks
24
Definitions:
• The <ds64> chunk carries 64-bit size values for the file size, <data> chunk and an
array of 64-bit size values of other definable chunks
• The structure for the <ds64> chunk is shown below, followed by the structure for
ChunkSize64 table that carries the sizes for definable chunks (other than <data>)
• The empty array brackets indicate a variable number of elements can be used
(including zero)
BW64
25
BW64
Elements of the <ds64> chunk
26
BW64
Elements of the <ds64> chunk
27
BW64
Elements of the <ds64> chunk
28
The ChunkSize64 table is specified as follows.
An array of ChunkSize64 structs is used to store the length of any chunk other than
<data> in the optional part of the <ds64> chunk.
Currently, the only chunk type other than <data> is likely to exceed a size of 4 Gbytes
would be the <axml> chunk
Compatibility with Recommendation ITU-R
BS.1352
29
• BWF carries broadcast metadata in the <bext>
and <ubxt> chunks.
• These chunks have fixed length fields and are
limited to the specified fields, thus preventing
any other broadcast related metadata being
carried.
• The <axml> chunk in BW64 can carry any
XML metadata, so can be used to carry
broadcast metadata, including the parameters
in the <bext> and <ubxt> chunks
XML (Extensible Markup Language)
■ XML is similar to HTML
■ Both XML and HTML contain markup symbols to describe the
contents of a page or file
■ HTML, however, describes the content only in terms of how it is to
be displayed and interacted with. For example, the letter "p"
placed within markup tags starts a new paragraph
■ XML describes the content in terms of what data is being described
■ XML is considered extensible because, unlike HTML, the markup
symbols are unlimited and self-defining
■ Introduction to XML
30
ADM RECOMMENDATION ITU-R BS.2076
Audio Definition Model
31
• This Recommendation describes the structure of a metadata model
that allows the format and content of audio files to be reliably
described.
• This model specifies how XML metadata can be generated to
provide the definitions of tracks in an audio file
• Each individual track within a file or stream should be able to be
correctly rendered, processed or distributed according to the
accompanying metadata
ADM RECOMMENDATION ITU-R BS.2076
■ Audio Definition Model is an open standard to ensure
compatibility across all systems
■ This purpose of this model is to formalize the description
of audio.
■ It is not a format for carrying audio.
32
ADM RECOMMENDATION ITU-R BS.2076
Cooking Analogy
• To help explain what the ADM actually does, it may be useful to
consider a cooking analogy. The recipe for a cake will contain a list
of ingredients, instructions on how to combine those ingredients and
how to bake the cake.
• The ADM is like a set of rules for writing the list of ingredients; it
gives a clear description of each item, for example: 2 eggs, 400g
flour, 200g butter, 200g sugar.
• The ADM provides the instructions for combining ingredients but
does not tell you how to do the mixing or baking; in the audio world
that is what the renderer does.
• The ADM is in general compatible with wave-file based formats such
as BW64
33
ADM RECOMMENDATION ITU-R BS.2076
■ This model will initially use XML as its specification
language
■ When it is used with BWF, the XML can be embedded in
the <axml> chunk of the file
■ The model is divided into two sections:
• format part
• content part
34
ADM RECOMMENDATION ITU-R BS.2076
■ The format part describes the technical nature of the audio so it
can be decoded or rendered correctly.
■ The content part describes what is contained in the audio, so will
describe things like the language of any dialogue, the loudness and
so on.
■ Some of the format elements may be defined before we have any
audio signals, whereas the content parts can usually only be
completed after the signals have been generated.
35
AudioContent
■ An audioContent element describes the content of one component
of a programme (e.g. background music
■ It refers to audioObjects to tie the content to its format.
■ This element includes loudness metadata.
36
AudioContent
Attributes:
37
Sub-elements:
AudioContent
Elements:
38
AudioContent
Dialogue:
39
AudioContent
Dialogue:
40
AudioProgramme
■ An audioProgramme element refers to a set of one or more
audioContents that are combined to create a full audio programme.
■ It contains start and end timecodes for the programme which can
be used for alignment with video timecodes.
■ Loudness metadata is also included to allow the programme’s
loudness to be used.
41
AudioProgramme
42
AudioProgramme
43
Loudness project scheme for radio
automation assessment:
44
Questions??
45
Thanks for your
attention

More Related Content

Similar to file format for audio programme materials with metadata.pptx

Programming segment 8086 Memory model 8086
Programming segment 8086 Memory model 8086Programming segment 8086 Memory model 8086
Programming segment 8086 Memory model 8086FazalHameed14
 
Performance Evaluation of H.264 AVC Using CABAC Entropy Coding For Compound I...
Performance Evaluation of H.264 AVC Using CABAC Entropy Coding For Compound I...Performance Evaluation of H.264 AVC Using CABAC Entropy Coding For Compound I...
Performance Evaluation of H.264 AVC Using CABAC Entropy Coding For Compound I...DR.P.S.JAGADEESH KUMAR
 
Group presentation.pptx
Group presentation.pptxGroup presentation.pptx
Group presentation.pptxYonas D. Ebren
 
Cost Efficient H.320 Video Conferencing over ISDN including ...
Cost Efficient H.320 Video Conferencing over ISDN including ...Cost Efficient H.320 Video Conferencing over ISDN including ...
Cost Efficient H.320 Video Conferencing over ISDN including ...Videoguy
 
Cost Efficient H.320 Video Conferencing over ISDN including ...
Cost Efficient H.320 Video Conferencing over ISDN including ...Cost Efficient H.320 Video Conferencing over ISDN including ...
Cost Efficient H.320 Video Conferencing over ISDN including ...Videoguy
 
Spatial Scalable Video Compression Using H.264
Spatial Scalable Video Compression Using H.264Spatial Scalable Video Compression Using H.264
Spatial Scalable Video Compression Using H.264IOSR Journals
 
P C I L O C A L B U S
P C I  L O C A L  B U S P C I  L O C A L  B U S
P C I L O C A L B U S jainfu2
 
DCN 5th ed. slides ch22 Next Generation IP.pdf
DCN 5th ed. slides ch22 Next Generation IP.pdfDCN 5th ed. slides ch22 Next Generation IP.pdf
DCN 5th ed. slides ch22 Next Generation IP.pdfBilal Munir Mughal
 
/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt
/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt
/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.pptVideoguy
 
Ame 2280 mq for z os latest features
Ame 2280 mq for z os latest featuresAme 2280 mq for z os latest features
Ame 2280 mq for z os latest featuresPete Siddall
 
Protected addressing mode and Paging
Protected addressing mode and PagingProtected addressing mode and Paging
Protected addressing mode and Pagingmarada0033
 
Introduction to Storage.ppt
Introduction to Storage.pptIntroduction to Storage.ppt
Introduction to Storage.pptParam Eswaran
 
AVC VIDEO SECURITY ON WIRELESS CHANNEL
AVC VIDEO SECURITY ON WIRELESS CHANNELAVC VIDEO SECURITY ON WIRELESS CHANNEL
AVC VIDEO SECURITY ON WIRELESS CHANNELijma
 

Similar to file format for audio programme materials with metadata.pptx (20)

video compression2
video compression2video compression2
video compression2
 
video compression2
video compression2video compression2
video compression2
 
Programming segment 8086 Memory model 8086
Programming segment 8086 Memory model 8086Programming segment 8086 Memory model 8086
Programming segment 8086 Memory model 8086
 
Performance Evaluation of H.264 AVC Using CABAC Entropy Coding For Compound I...
Performance Evaluation of H.264 AVC Using CABAC Entropy Coding For Compound I...Performance Evaluation of H.264 AVC Using CABAC Entropy Coding For Compound I...
Performance Evaluation of H.264 AVC Using CABAC Entropy Coding For Compound I...
 
Group presentation.pptx
Group presentation.pptxGroup presentation.pptx
Group presentation.pptx
 
Cost Efficient H.320 Video Conferencing over ISDN including ...
Cost Efficient H.320 Video Conferencing over ISDN including ...Cost Efficient H.320 Video Conferencing over ISDN including ...
Cost Efficient H.320 Video Conferencing over ISDN including ...
 
Cost Efficient H.320 Video Conferencing over ISDN including ...
Cost Efficient H.320 Video Conferencing over ISDN including ...Cost Efficient H.320 Video Conferencing over ISDN including ...
Cost Efficient H.320 Video Conferencing over ISDN including ...
 
E010132529
E010132529E010132529
E010132529
 
Spatial Scalable Video Compression Using H.264
Spatial Scalable Video Compression Using H.264Spatial Scalable Video Compression Using H.264
Spatial Scalable Video Compression Using H.264
 
proposal
proposalproposal
proposal
 
P C I L O C A L B U S
P C I  L O C A L  B U S P C I  L O C A L  B U S
P C I L O C A L B U S
 
Chap12
Chap12Chap12
Chap12
 
H263.ppt
H263.pptH263.ppt
H263.ppt
 
DCN 5th ed. slides ch22 Next Generation IP.pdf
DCN 5th ed. slides ch22 Next Generation IP.pdfDCN 5th ed. slides ch22 Next Generation IP.pdf
DCN 5th ed. slides ch22 Next Generation IP.pdf
 
/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt
/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt
/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt
 
Ame 2280 mq for z os latest features
Ame 2280 mq for z os latest featuresAme 2280 mq for z os latest features
Ame 2280 mq for z os latest features
 
Protected addressing mode and Paging
Protected addressing mode and PagingProtected addressing mode and Paging
Protected addressing mode and Paging
 
Chapter 1 archietecture of 8086
Chapter 1 archietecture of 8086Chapter 1 archietecture of 8086
Chapter 1 archietecture of 8086
 
Introduction to Storage.ppt
Introduction to Storage.pptIntroduction to Storage.ppt
Introduction to Storage.ppt
 
AVC VIDEO SECURITY ON WIRELESS CHANNEL
AVC VIDEO SECURITY ON WIRELESS CHANNELAVC VIDEO SECURITY ON WIRELESS CHANNEL
AVC VIDEO SECURITY ON WIRELESS CHANNEL
 

Recently uploaded

Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...SUHANI PANDEY
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringmulugeta48
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxfenichawla
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 

Recently uploaded (20)

Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 

file format for audio programme materials with metadata.pptx

  • 1. FILE FORMAT FOR AUDIO PROGRAMME MATERIALS WITH METADATA Mostafa Tavallaei
  • 2. This presentation contains: – BWF as an old but useful format – The specification of the BW64 (Broadcast Wave 64Bit) audio file format – ADM, Audio Definition Model 2
  • 3. BWF ■ The BWF file format (Broadcast Wave Format) , Recommendation ITU-R BS.1352 ■ The (BWF) is based on the Microsoft WAVE audio file format which is a type of file specified in the Microsoft “Resource Interchange File Format”, RIFF ■ The basic building block of the RIFF file format, called a chunk, contains a group of tightly related pieces of information. 3
  • 4. BWF ■ It consists of a chunk identifier, an integer value representing the length in bytes and the information carried. ■ A RIFF file is made up of a collection of chunks. ■ For the BWF, some restrictions are applied to the original WAVE format. ■ In addition the BWF file includes a <Broadcast Audio Extension> chunk Fig. 1 4
  • 5. 5
  • 6. BWF Contents of a broadcast wave format file: A broadcast wave format file shall start with the mandatory Microsoft RIFF “WAVE” header and at least the following chunks: 6
  • 7. BWF BWF has a number of limitations, most notably: • Maximum file size of less than 4Gbytes. • No support for advanced multichannel audio. • Inadequate support for technical metadata. 7 BW64 format described in next slides aims to overcome these limitations, and maintain as much compatibility as possible with the Recommendation ITU-R BS.1352 format with many of the core elements shared.
  • 8. BW64 RECOMMENDATION ITU-R BS.2088-0 ■ The BW64 (Broadcast Wave 64Bit) format is based on RIFF ■ including the new chunks – <ds64> – <axml> – <chna> 8 “Long-form file format for the international exchange of audio programme materials with metadata” which enable the file to carry large multichannel files and metadata including the Audio Definition Model (ADM) specified in Recommendation ITU-R BS.2076.
  • 9. BW64 format description: A BW64 format file should start with the mandatory “WAVE” header and at least the following chunks: <WAVE-form> -> BW64(‘WAVE’ <ds64-ck> // ds64 chunk for 64-bit addressing <fmt-ck> // Format of the audio signal: PCM/non-PCM <chna-ck> // chna chunk for ADM look-up <axml-ck> // axml chunk for carrying XML metadata <wave-data>) // sound data 9
  • 10. BW64 ■ There is an increasing demand on Audio Definition Model (ADM) metadata according to Recommendation ITU-R BS.2076. ■ This Recommendation includes a definition of the <axml> chunk for storing and transferring metadata as XML ■ The primary purpose of the <chna> is to provide the references from each track in a BW64 file to the IDs in the ADM metadata defined in Recommendation ITU-R BS.2076 10
  • 11. BW64 format description: Existing chunks defined as part of the RIFF/WAVE standard The RIFF/WAVE standard uses a number of chunks that are already defined. These are: • <RIFF> • <fmt> • <data> 11
  • 12. BW64 format description: The RIFF/WAVE is a subset of the ITU-R BS.1352 format. Recommendation ITU-R BS.1352 contains these additional chunks: • <bext> • <ubxt> These chunks will not be included in the BW64 format, which provides a more flexible solution carrying broadcast metadata. 12
  • 13. BW64 format description: New Chunks and Structs in the BW64 format • <BW64> • <ds64> • <JUNK> • <axml> • <chna> 13
  • 14. BW64 format description: Using the <ds64> (data size 64) chunk to enable the use of files greater than 4 Gbyte in size 14 • The reason for the 4 Gbyte barrier is the 32-bit addressing in RIFF/WAVE and BWF • With 32 bits a maximum of 4294967296 bytes = 4 Gbyte can be addressed • The structure of a basic conventional RIFF/WAVE file is shown in Fig. 2, where the chunkSize fields are 32-bit numbers representing the sizes of their chunks
  • 15. BW64 15 FIGURE 2 Basic RIFF/WAVE file structure Just changing the size of every field in a BWF to 64-bit would produce a file that is not compatible with the standard RIFF/WAVE format – an obvious but important observation
  • 16. BW64 format description: The approach adopted is to define a new 64-bit based RIFF called BW64 that is identical to the original RIFF/WAVE format, except for the following changes: 16 • The ID ‘BW64’ is used instead of ‘RIFF’ in the first four bytes of the file • A mandatory <ds64> chunk is added, which has to be the first chunk after the “BW64 chunk”
  • 17. BW64 format description: 17 The ‘ds64’ chunk has two mandatory 64-bit integer values, which replace two 32-bit fields of the RIFF/WAVE format: • bw64Size (replaces the size field of the <RIFF> chunk) • dataSize (replaces the size field of the <data> chunk) For all two 32-bit fields of the RIFF/WAVE format the following rule applies:  If the 32-bit value in the field is not “-1” (= FFFFFFFF hex) then this 32-bit value is used.  If the 32-bit value in the field is “-1” the 64-bit value in the ‘ds64’ chunk is used instead.
  • 18. BW64 format description: 18 The complete structure of the BW64 file format is illustrated in Fig. 3, where the chunkSize values for the <BW64> and <data> chunks are set to −1, to allow them to use 64-bit size values from the <ds64> chunk.
  • 20. BW64 format description: 20 Achieving compatibility between RIFF/WAVE and BW64 • Some production audio files will inevitably be smaller than 4 Gbyte and they should therefore stay in the short-form RIFF/WAVE format • A recording application cannot know in advance whether the recorded audio it is compiling will exceed 4 Gbyte or not at end of recording • The solution is to enable the recording application to switch from RIFF/WAVE to BW64 on the fly at the 4 Gbyte size limit, while the recording is still going on
  • 21. BW64 format description: 21 Achieving compatibility between RIFF/WAVE and BW64 • This is achieved by reserving additional space in the RIFF/WAVE by inserting a <JUNK> chunk that is of the same size as a <ds64> chunk • This reserved space has no meaning for short-form WAVE, but will become the <ds64> chunk, if a transition to BW64 is necessary. The diagram in Fig. 4 shows the <JUNK> placeholder chunk placed before the <fmt > chunk
  • 22. BW64 format description: 22 FIGURE 4 File structure with JUNK chunk
  • 23. BW64 format description: 23 At the beginning of a recording, a BW64-aware application will create a standard RIFF/WAVE with a ’JUNK’ chunk as the first chunk. While recording, it will check the RIFF and data sizes. If they exceed 4 Gbyte, the application will: • Replace the chunkID <JUNK> with <ds64> chunk. (This transforms the previous <JUNK> chunk into a <ds64> chunk). • Insert the RIFF size, 'data' chunk size and sample count in the <ds64> chunk • Set RIFF size, 'data' chunk size and sample count in the 32 bit fields to −1 = FFFFFFFF hex • Replaces the ID ‘RIFF’ with ‘BW64’ in the first four bytes of the file • Continue with the recording.
  • 24. BW64 DS64 and JUNK chunks 24 Definitions: • The <ds64> chunk carries 64-bit size values for the file size, <data> chunk and an array of 64-bit size values of other definable chunks • The structure for the <ds64> chunk is shown below, followed by the structure for ChunkSize64 table that carries the sizes for definable chunks (other than <data>) • The empty array brackets indicate a variable number of elements can be used (including zero)
  • 26. BW64 Elements of the <ds64> chunk 26
  • 27. BW64 Elements of the <ds64> chunk 27
  • 28. BW64 Elements of the <ds64> chunk 28 The ChunkSize64 table is specified as follows. An array of ChunkSize64 structs is used to store the length of any chunk other than <data> in the optional part of the <ds64> chunk. Currently, the only chunk type other than <data> is likely to exceed a size of 4 Gbytes would be the <axml> chunk
  • 29. Compatibility with Recommendation ITU-R BS.1352 29 • BWF carries broadcast metadata in the <bext> and <ubxt> chunks. • These chunks have fixed length fields and are limited to the specified fields, thus preventing any other broadcast related metadata being carried. • The <axml> chunk in BW64 can carry any XML metadata, so can be used to carry broadcast metadata, including the parameters in the <bext> and <ubxt> chunks
  • 30. XML (Extensible Markup Language) ■ XML is similar to HTML ■ Both XML and HTML contain markup symbols to describe the contents of a page or file ■ HTML, however, describes the content only in terms of how it is to be displayed and interacted with. For example, the letter "p" placed within markup tags starts a new paragraph ■ XML describes the content in terms of what data is being described ■ XML is considered extensible because, unlike HTML, the markup symbols are unlimited and self-defining ■ Introduction to XML 30
  • 31. ADM RECOMMENDATION ITU-R BS.2076 Audio Definition Model 31 • This Recommendation describes the structure of a metadata model that allows the format and content of audio files to be reliably described. • This model specifies how XML metadata can be generated to provide the definitions of tracks in an audio file • Each individual track within a file or stream should be able to be correctly rendered, processed or distributed according to the accompanying metadata
  • 32. ADM RECOMMENDATION ITU-R BS.2076 ■ Audio Definition Model is an open standard to ensure compatibility across all systems ■ This purpose of this model is to formalize the description of audio. ■ It is not a format for carrying audio. 32
  • 33. ADM RECOMMENDATION ITU-R BS.2076 Cooking Analogy • To help explain what the ADM actually does, it may be useful to consider a cooking analogy. The recipe for a cake will contain a list of ingredients, instructions on how to combine those ingredients and how to bake the cake. • The ADM is like a set of rules for writing the list of ingredients; it gives a clear description of each item, for example: 2 eggs, 400g flour, 200g butter, 200g sugar. • The ADM provides the instructions for combining ingredients but does not tell you how to do the mixing or baking; in the audio world that is what the renderer does. • The ADM is in general compatible with wave-file based formats such as BW64 33
  • 34. ADM RECOMMENDATION ITU-R BS.2076 ■ This model will initially use XML as its specification language ■ When it is used with BWF, the XML can be embedded in the <axml> chunk of the file ■ The model is divided into two sections: • format part • content part 34
  • 35. ADM RECOMMENDATION ITU-R BS.2076 ■ The format part describes the technical nature of the audio so it can be decoded or rendered correctly. ■ The content part describes what is contained in the audio, so will describe things like the language of any dialogue, the loudness and so on. ■ Some of the format elements may be defined before we have any audio signals, whereas the content parts can usually only be completed after the signals have been generated. 35
  • 36. AudioContent ■ An audioContent element describes the content of one component of a programme (e.g. background music ■ It refers to audioObjects to tie the content to its format. ■ This element includes loudness metadata. 36
  • 41. AudioProgramme ■ An audioProgramme element refers to a set of one or more audioContents that are combined to create a full audio programme. ■ It contains start and end timecodes for the programme which can be used for alignment with video timecodes. ■ Loudness metadata is also included to allow the programme’s loudness to be used. 41
  • 44. Loudness project scheme for radio automation assessment: 44

Editor's Notes

  1. Compulsory: اجباری
  2. Inadiquate: ناکافی