SlideShare a Scribd company logo
1 of 30
Download to read offline
AOM - AV1
How does it work?
July 2017
Pascal Massimino - Google
pascal.massimino@gmail.com
skal@
who?
Start with existing
FANBASES
AOM update - Membership
AOM: 25+ members and growing fast
Most recently joined: IBM, Hulu
Still targeting high-profile potential
members
Start with existing
FANBASES
AOMedia launched in Q4 2015
AV1 royalty-free video codec in active development
Goal: 40-50% gain over VP9/HEVC
Rapid hardware adoption
AOM update - AV1
Start with existing
FANBASES
Recent Developments
Successful 2-day Codec WG face-to-face in February
Attendance from 70+ engineers from member companies
AV1 currently delivering 25-35% gains across resolutions
50+ tools currently under consideration or fully adopted
Planning Towards Launch
Goal to achieve:
● Soft freeze of tools in Q3 2017
● Final bitstream freeze at EOY 2017
Dependencies on:
● Quality gains
● Completion of IP review
● Final approval of AOM Board
Start with existing
FANBASES
AV1 Adoption
YouTube will begin using AV1 immediately
Netflix will be an “early adopter” (video)
AV1 ecosystem (testing tools + SW optimizations) gearing up
vital: test streams!
Some AV1 Coding Tools
Introduction
AV1 Development model
● Tools are coded as experiment, under a flag (off by default)
● Eng review (~weekly)
● IP review (including external review)
● -> decision: go/no-go.
● Go: Flag is removed -> on by default.
● No-go: code is removed
AV1 is not compatible with VP9, or any prior ‘VP10’.
It is an open source project:
Code: https://aomedia.googlesource.com/ (and Issue tracker)
Video compression overview
Best Predictor
Inter/Intra
Source block
Prediction
Residue
T Q
Entropy
coding of
coefficients
Restoration
Info
Entropy
coding of
predictor
How to
update Ref
Buffers
CompressedBitstream
Loss
introduced
SI
Deblocking
Info
Level
AV1: just more of these
AV1: just more of these
Some experiments
● Allows coding of multi symbol alphabets:
○ 15-bit probabilities
○ Alphabet size up to 16 symbols
● Raw (equiprobable) bits:
○ Encoder writes from end of buffer & merges at finalize:
● One buffer per frame or tile
(longer explanation in video)
AV1 Entropy Coding
Multi-symbol Arithmetic Range Coder*
● Binary syntax elements combined into non-binary symbols:
○ Reduces symbol throughput in the entropy coder
○ Allows (non-boolean) per-symbol probability update
● Reduced signaling overhead:
○ Explicit probability signaling not required, reduced header size
○ Better performance at lower data rates
○ Reduced overhead in error-resilient case
AV1 Entropy Coding
Symbol Adaptive Coding
Maximum coding block size 64x64 (a “Superblock”)
Recursive sub-partitioning down to 4x4 (luma)
Coding parameters maintained at 4x4 level
Prediction:
Down to 2x2 for Chroma YUV 4:2:0 inter
Single reference frame at 4x4, for all YUV planes
Transform:
Minimum transform size 4x4 for intra- & inter-prediction
AV1 Block Structure
Dynamic partitioning
AV1 Block Structure
Prediction: Block Recursive Partitioning
"Superblock"
64x64
32x64
Vert
4x4
64x64
64x32 Horiz 32x32
prediction angle = nominal_angle + (angle_delta * angle_step)
nominal_angle ∊ {45, 63, 90, 117, 135, 153, 180, 207} degrees
Method:
○ Back-project P(r,c) to reference pixel along selected direction
○ Apply 2-tap linear interpolation filter with 1/256th
pixel precision
○ LUT tabulates dy corresponding to each possible dx in each available angle
90 entries
Each is a 16-bit integer, including 8-bits of precision
AV1 Intra Prediction
Generic Directional Predictor
angle_step angle_delta Total number of
angles
3 [-3, +3] 8 * 7 = 56
Example for a 4x4 block:
Lj
= Left, Ti
= Top, TL = Top-left neighbors closest to Pi,j
Paeth Predictor:
One of Lj
, Ti
, TL closest in value to Lj
+ Ti
- TL
Well-defined tie-break order: Lj,
Ti,
TL
AV1 Intra Prediction
Paeth Predictor (replaces TM-pred)
TL Ti
Lj
Pi,j
● Example for a 4x4 block:
○ Lj
= Left, Ti
= Top, Rj
= Right, Bi
= Bottom, TR = Top-right, BL = Bottom Left neighbors
outside of block closest to Pi,j
● Smooth Predictor:
○ Assume Rj
= TR, Bi
= BL
○ Pi,j
is a weighted combination of Li
, Rj
, Ti
, Bi
○ Weights equivalent to quadratic interpolation
○ Useful for blocks with smooth gradient
Variants: SMOOTH_H_PREV , SMOOTH_V_PRED
(0.5% intra BDRate improvement)
AV1 Intra Prediction
Smooth Predictor
Ti
TR
Lj
Pi,j
Rj
BL Bi
● Palette sizes of 2 to 8 ‘base’ color entries supported
● Separate palettes for Y, U, V planes
● Palette mode available only when block size >= 8x8:
● Palette size (N) and the N ‘base’ entries are encoded in
the bitstream
● A palette index is encoded, using a context model, for
each pixel in the block
● Pixels predicted in ‘wavefront’ order to allow parallel
computation
AV1 Intra Prediction
Palette Predictor - For Screen Content
● Specify the index of a MV from a list of MVs computed from neighbors
○ Examine MVs at the 8x8 block level
● Rank the MVs in the list by:
○ Distance from current block
○ Amount of overlap with current block
● Encode the index of the selected MV
mv4 mv5 mv5
mv0 mv0 mv1 mv1 mv2
mv5 mv2
mv6
mv3
mv5 mv1
AV1 Inter Prediction
Dynamic Reference Motion Vector Prediction (REFMV)
Category 1 Category 2
mv1, mv0, mv2, mv3 mv5, mv4, mv6
AV1 Inter Prediction
More Reference frames
Display Order
1 2
5
4
6 8
7
9
3
ARF
KF/
GF
BRF
LAST_
BIPRED
LAST_
BIPRED
LAST_
BIPRED
BRF BRF
LF
● Upto 6 reference frames per block (VP9 specified 3)
AV1 Inter Prediction
Overlapped Block Motion Compensation
● Use neighbors’ predictors to refine the prediction of the current block
● Potential benefits:
○ Better prediction near to the block boundary
○ Smoother predictions / residues
● For each neighboring inter block:
○ Build a predictor, p2
(x, y), using its MV
○ Combine p0
(x, y) and p2
(x, y): pob
(x, y) = w0
(x, y)p0
(x, y) + w2
(x, y)p2
(x, y)
■ Where w0
and w2
are defined by a 1-D raised cosine function
COMPLEX!
AV1 Inter Prediction
Global & Warped Motion
● Global Motion:
○ GM parameters computed for each reference frame using feature matching + RANSAC
○ Supported motion models (& DoF):
■ Translation (2), similarity (4), affine (6), homographic (8)
● Warping:
○ Implemented as 2 shears:
○ Using 8-tap filters
● Motion parameters coded in the bitstream
Combined GM + WM + OBMC
-> 2.0 - 6.0% gain (esp. Handheld camera)
AV1 Inter Prediction
Other Tools
● Guided restoration (filters and upscaling):
○ Downscale before encoding, upscale before updating reference
○ Parameters transmitted in the bitstream (learned Wiener filters)
○ Super-resolution = simple linear upscale + loop-restoration
○ 10-30% rate reduction for super-res
○ ~3% average gain for switchable restoration 7x7 filters
○ simple
Encode
Source
Size
Output
Size
EncodeSize
To Decoder Decode
To
Refs
To
Refs
Superres
Downscale
(non-
normative)
Resize
(optional,
non-normative)
Linear
Upscale
(normat
ive)
Loop
restore
(normat
ive)
Linear
Upscale
(normat
ive)
Loop
restore
(normat
ive)
Deblocking
Deblocking
● Dual filter:
○ Independent selection of horizontal and vertical interpolation filters
● Masked transform:
○ Only transform a portion of a square block
AV1 Inter Prediction
Other Tools
offset
offset
direction
4 Transform types:
DCT, ADST, Flipped (reverse) ADST, Identity
Upto 16 transform combinations with independent horizontal & vertical selection
Reduce available combinations based on block size (for efficiency)
Identity transform useful for screen content coding
Rectangular transforms added:
4x8, 8x4, 8x16, 16x8, 16x32, 32x16
Used only on rectangular partitions of corresponding size
AV1 Transform
Extended Transform Set
Non-linear quantization matrix:
Can also support a linear model
Delta QP values can be signaled at the Superblock level (64x64)
AV1 Quantization
Non-Linear Quantization & Delta QP Signaling
AV1 Hardware IP
Google’s codec HW group developing a free AV1 HW decoder
Target features
Full hardware acceleration
AV1 and/or VP9 decode support
8 and 10 bits per pixel
2160p 60 fps at 600 MHz / core
3 Mgates logic estimated silicon area
AV1 Hardware IP
Licensing
Availability
Early pre-releases available from Q4 / 2017
Final release at the time of AV1 bitstream freeze
Licensing
royalty free license
Includes C and Verilog source code
Modification rights
Thanks!
questions?

More Related Content

Recently uploaded

The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is insideshinachiaurasa2
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...masabamasaba
 
tonesoftg
tonesoftgtonesoftg
tonesoftglanshi9
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfonteinmasabamasaba
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrandmasabamasaba
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park masabamasaba
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech studentsHimanshiGarg82
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...masabamasaba
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...masabamasaba
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrainmasabamasaba
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfonteinmasabamasaba
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...masabamasaba
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2
 

Recently uploaded (20)

The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
 
tonesoftg
tonesoftgtonesoftg
tonesoftg
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 

Featured

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 

Featured (20)

Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 

AOM - AV1How does it work?

  • 1. AOM - AV1 How does it work? July 2017 Pascal Massimino - Google pascal.massimino@gmail.com
  • 3. Start with existing FANBASES AOM update - Membership AOM: 25+ members and growing fast Most recently joined: IBM, Hulu Still targeting high-profile potential members
  • 4. Start with existing FANBASES AOMedia launched in Q4 2015 AV1 royalty-free video codec in active development Goal: 40-50% gain over VP9/HEVC Rapid hardware adoption AOM update - AV1
  • 5. Start with existing FANBASES Recent Developments Successful 2-day Codec WG face-to-face in February Attendance from 70+ engineers from member companies AV1 currently delivering 25-35% gains across resolutions 50+ tools currently under consideration or fully adopted
  • 6. Planning Towards Launch Goal to achieve: ● Soft freeze of tools in Q3 2017 ● Final bitstream freeze at EOY 2017 Dependencies on: ● Quality gains ● Completion of IP review ● Final approval of AOM Board
  • 7. Start with existing FANBASES AV1 Adoption YouTube will begin using AV1 immediately Netflix will be an “early adopter” (video) AV1 ecosystem (testing tools + SW optimizations) gearing up vital: test streams!
  • 9. Introduction AV1 Development model ● Tools are coded as experiment, under a flag (off by default) ● Eng review (~weekly) ● IP review (including external review) ● -> decision: go/no-go. ● Go: Flag is removed -> on by default. ● No-go: code is removed AV1 is not compatible with VP9, or any prior ‘VP10’. It is an open source project: Code: https://aomedia.googlesource.com/ (and Issue tracker)
  • 10. Video compression overview Best Predictor Inter/Intra Source block Prediction Residue T Q Entropy coding of coefficients Restoration Info Entropy coding of predictor How to update Ref Buffers CompressedBitstream Loss introduced SI Deblocking Info Level AV1: just more of these
  • 11. AV1: just more of these Some experiments
  • 12. ● Allows coding of multi symbol alphabets: ○ 15-bit probabilities ○ Alphabet size up to 16 symbols ● Raw (equiprobable) bits: ○ Encoder writes from end of buffer & merges at finalize: ● One buffer per frame or tile (longer explanation in video) AV1 Entropy Coding Multi-symbol Arithmetic Range Coder*
  • 13. ● Binary syntax elements combined into non-binary symbols: ○ Reduces symbol throughput in the entropy coder ○ Allows (non-boolean) per-symbol probability update ● Reduced signaling overhead: ○ Explicit probability signaling not required, reduced header size ○ Better performance at lower data rates ○ Reduced overhead in error-resilient case AV1 Entropy Coding Symbol Adaptive Coding
  • 14. Maximum coding block size 64x64 (a “Superblock”) Recursive sub-partitioning down to 4x4 (luma) Coding parameters maintained at 4x4 level Prediction: Down to 2x2 for Chroma YUV 4:2:0 inter Single reference frame at 4x4, for all YUV planes Transform: Minimum transform size 4x4 for intra- & inter-prediction AV1 Block Structure Dynamic partitioning
  • 15. AV1 Block Structure Prediction: Block Recursive Partitioning "Superblock" 64x64 32x64 Vert 4x4 64x64 64x32 Horiz 32x32
  • 16. prediction angle = nominal_angle + (angle_delta * angle_step) nominal_angle ∊ {45, 63, 90, 117, 135, 153, 180, 207} degrees Method: ○ Back-project P(r,c) to reference pixel along selected direction ○ Apply 2-tap linear interpolation filter with 1/256th pixel precision ○ LUT tabulates dy corresponding to each possible dx in each available angle 90 entries Each is a 16-bit integer, including 8-bits of precision AV1 Intra Prediction Generic Directional Predictor angle_step angle_delta Total number of angles 3 [-3, +3] 8 * 7 = 56
  • 17. Example for a 4x4 block: Lj = Left, Ti = Top, TL = Top-left neighbors closest to Pi,j Paeth Predictor: One of Lj , Ti , TL closest in value to Lj + Ti - TL Well-defined tie-break order: Lj, Ti, TL AV1 Intra Prediction Paeth Predictor (replaces TM-pred) TL Ti Lj Pi,j
  • 18. ● Example for a 4x4 block: ○ Lj = Left, Ti = Top, Rj = Right, Bi = Bottom, TR = Top-right, BL = Bottom Left neighbors outside of block closest to Pi,j ● Smooth Predictor: ○ Assume Rj = TR, Bi = BL ○ Pi,j is a weighted combination of Li , Rj , Ti , Bi ○ Weights equivalent to quadratic interpolation ○ Useful for blocks with smooth gradient Variants: SMOOTH_H_PREV , SMOOTH_V_PRED (0.5% intra BDRate improvement) AV1 Intra Prediction Smooth Predictor Ti TR Lj Pi,j Rj BL Bi
  • 19. ● Palette sizes of 2 to 8 ‘base’ color entries supported ● Separate palettes for Y, U, V planes ● Palette mode available only when block size >= 8x8: ● Palette size (N) and the N ‘base’ entries are encoded in the bitstream ● A palette index is encoded, using a context model, for each pixel in the block ● Pixels predicted in ‘wavefront’ order to allow parallel computation AV1 Intra Prediction Palette Predictor - For Screen Content
  • 20. ● Specify the index of a MV from a list of MVs computed from neighbors ○ Examine MVs at the 8x8 block level ● Rank the MVs in the list by: ○ Distance from current block ○ Amount of overlap with current block ● Encode the index of the selected MV mv4 mv5 mv5 mv0 mv0 mv1 mv1 mv2 mv5 mv2 mv6 mv3 mv5 mv1 AV1 Inter Prediction Dynamic Reference Motion Vector Prediction (REFMV) Category 1 Category 2 mv1, mv0, mv2, mv3 mv5, mv4, mv6
  • 21. AV1 Inter Prediction More Reference frames Display Order 1 2 5 4 6 8 7 9 3 ARF KF/ GF BRF LAST_ BIPRED LAST_ BIPRED LAST_ BIPRED BRF BRF LF ● Upto 6 reference frames per block (VP9 specified 3)
  • 22. AV1 Inter Prediction Overlapped Block Motion Compensation ● Use neighbors’ predictors to refine the prediction of the current block ● Potential benefits: ○ Better prediction near to the block boundary ○ Smoother predictions / residues ● For each neighboring inter block: ○ Build a predictor, p2 (x, y), using its MV ○ Combine p0 (x, y) and p2 (x, y): pob (x, y) = w0 (x, y)p0 (x, y) + w2 (x, y)p2 (x, y) ■ Where w0 and w2 are defined by a 1-D raised cosine function COMPLEX!
  • 23. AV1 Inter Prediction Global & Warped Motion ● Global Motion: ○ GM parameters computed for each reference frame using feature matching + RANSAC ○ Supported motion models (& DoF): ■ Translation (2), similarity (4), affine (6), homographic (8) ● Warping: ○ Implemented as 2 shears: ○ Using 8-tap filters ● Motion parameters coded in the bitstream Combined GM + WM + OBMC -> 2.0 - 6.0% gain (esp. Handheld camera)
  • 24. AV1 Inter Prediction Other Tools ● Guided restoration (filters and upscaling): ○ Downscale before encoding, upscale before updating reference ○ Parameters transmitted in the bitstream (learned Wiener filters) ○ Super-resolution = simple linear upscale + loop-restoration ○ 10-30% rate reduction for super-res ○ ~3% average gain for switchable restoration 7x7 filters ○ simple Encode Source Size Output Size EncodeSize To Decoder Decode To Refs To Refs Superres Downscale (non- normative) Resize (optional, non-normative) Linear Upscale (normat ive) Loop restore (normat ive) Linear Upscale (normat ive) Loop restore (normat ive) Deblocking Deblocking
  • 25. ● Dual filter: ○ Independent selection of horizontal and vertical interpolation filters ● Masked transform: ○ Only transform a portion of a square block AV1 Inter Prediction Other Tools offset offset direction
  • 26. 4 Transform types: DCT, ADST, Flipped (reverse) ADST, Identity Upto 16 transform combinations with independent horizontal & vertical selection Reduce available combinations based on block size (for efficiency) Identity transform useful for screen content coding Rectangular transforms added: 4x8, 8x4, 8x16, 16x8, 16x32, 32x16 Used only on rectangular partitions of corresponding size AV1 Transform Extended Transform Set
  • 27. Non-linear quantization matrix: Can also support a linear model Delta QP values can be signaled at the Superblock level (64x64) AV1 Quantization Non-Linear Quantization & Delta QP Signaling
  • 28. AV1 Hardware IP Google’s codec HW group developing a free AV1 HW decoder Target features Full hardware acceleration AV1 and/or VP9 decode support 8 and 10 bits per pixel 2160p 60 fps at 600 MHz / core 3 Mgates logic estimated silicon area
  • 29. AV1 Hardware IP Licensing Availability Early pre-releases available from Q4 / 2017 Final release at the time of AV1 bitstream freeze Licensing royalty free license Includes C and Verilog source code Modification rights