SlideShare a Scribd company logo
1 of 20
Download to read offline
Are Video Codecs... Done?
derekb@vimeo.com / derek@videolan.org
@daemon404
Derek Buitenhuis
13 October 2022
San Francisco, USA
Who’s this?
1
13 October 2022
• Senior Principal Video Engineer @ Vimeo
• Open source developer (FFmpeg, FFMS2, rav1e, obuparse, etc.)
• VideoLAN non-profit board member
• Involved in codecs, containers, DSP, etc. since 2004.
• Professional Twitter Sh*tposter
Context / Biases
2
13 October 2022
• Vimeo is:
• Large but not FAANG-sized. Scale, but not FAANG-level cash.
• User Generated Content (Long Tail, Large Corpus, Bad Input, etc.)
• VOD, Live, OTT, etc.
• Primarily familiar with western companies.
• I am cognizant of the use of HEVC, etc. in e.g. China – but that’s not a sector I have much
connection to, or who publish cost numbers, at least in English.
• The patent situation there can be “interesting”.
• I would love to learn more and fill in gaps in my understanding about use in these
markets.
3
13 October 2022
Part One:
The Practicalities of Deployment
First: A Confession
4
13 October 2022
• 10 years ago, I was a HEVC True Believer.
• The H.264 good times were in full swing.
• Why wouldn’t the next MPEG codec be just as great of a success as H.264?
• Surely we’d have people chomping at the bit to write “the x264 of HEVC”.
• MPEG-LA definitely wouldn’t do something silly like mess with royalties.
• VP8 (and WebM) looked to be dead on arrival.
• Spec? Who needs that? Not Google.
• Only danger was Chrome announced in 2011 that they’d remove support for H.264.
• I think they’re getting close, probably just needs a final code review.
• Boy, was I wrong.
It’s 2022: Why have people actually deployed HEVC?
5
13 October 2022
• Specifically non-megacorps.
• A few main reasons:
• Apple and Dolby.
• HDR:
• Many devices require HEVC for any HDR support at all, even types which are
not codec-specific (i.e. not Dolby Vision).
• Not all of us can be YouTube and get special codec support for HDR put in our
favorite OSes.
• Some cheaper devices decided not to get better H.264 chipsets, so simply required HEVC
for >1080p support (e.g. Roku).
• They worked on HEVC and their employer has patents on it.
• Notably not included here: lower bandwidth and storage costs.
Anecdotally: It is not saving money for those who have.
6
13 October 2022
• In addition to HEVC being just generally more intensive to encode, due to the reasons on the
previous slide, this is also when many of us made the jump to 10-bit, making it even more costly.
• We also need to remember: We are not extremely spoiled with H.264 encoding speeds and
costs 10-15 years later, and many finance departments now take this for granted – slower codecs
are a hard sell.
• While I can’t share exact numbers, I can say we definitely aren’t saving money by using HEVC.
• It’s treated as a feature, not a cost savings thing.
• Still mainly only Apple devices are consuming it.
• Slower encodes, even chunked, are worse at using spot instances.
• Even ingesting HEVC (and HDR – hooray tone mapping) is slower!
• When the iPhone started doing HDR HEVC by default, us and industry colleagues noted we
could really see the cost increases tick up starting right after launch!
VP{8,9}: Eh…
7
13 October 2022
• As before: VP8 is irrelevant to most of us, aside from GIF hosts.
• VP9 may have not had the patent issues of HEVC (more on that later), but saw, from my
observation, even less adoption.
• Not just a new codec, also required whole new tooling an infra for WebM.
• ‘WebM DASH’ was a gross hack that abused Matroska cues.
• libvpx was terrible… especially at rate control. Now it is merely bad.
• I am convinced Google just doesn’t get what non-Google orgs need to drive creation-side adoption.
• Either that or they don’t care, as long as it saves YouTube money.
• Was mostly only embraced by big corps like Google, and much later, Netflix.
• Netflix waited for MP4 support, I think?
• Google had to make its own hardware to make it truly economical.
So who do / don’t slower, better codecs work for?
8
13 October 2022
• General: Patient liability / risk tolerance is too high for HEVC for SMB.
• Small corpus, high consumption:
• Netflix
• High bandwidth websites like fashion, where video is not the main focus.
• UGC – only at megacorp scale
• The long tail only really economical with Google-level “we made out own hardware” investment.
• Off the shelf hardware encoders aren’t economical or scalable past SMB-level.
• Live:
• Unclear.
• Twitch trialed it but not widely deployed.
• Most YouTube livestreams are still H.264.
• Can be done, but either at a cost, or without sufficient bandwidth savings vs H.264.
The Future
9
13 October 2022
• AV1:
• YouTube. Duh.
• Hot take: Real bandwidth returns on cost are diminishing vs compute, and much like audio codecs: only
telcos the big bois care about the amount shaved off.
• Real use case – Cisco:
• Replaced H.264, not HEVC or VP9.
• Very small subset of tools.
• Could be a way forward for next gen codecs? No one seems to be targeting this.
• VVC:
• Hotter take: Nobody cares about VVC except people involved in creating it (MPEG companies).
• Patents. “Let’s just wait and see approach”.
• I’ve heard rumors some Indian vendors have deployed it to get The Smallest Files.
• I would love to know the cost and patent situation.
The Result
10
13 October 2022
H.264 is the JPEG of video.
11
13 October 2022
Part Two:
Development and Deep Understanding
Back in my day…
12
13 October 2022
• When I started in video, MPEG-2, MPEG-4 ASP, and H.264 were all relevant.
• A sole person could learn every bit of these, and implement them themselves,
reasonably quickly.
• Reasonably-ish well documented. Sort of. Kind of… Well. OK.
• You could apply your newly gained knowledge to current problems.
• Tools in these codecs were less intertwined.
• Does anyone do this nowadays (besides Ronald?)
• All literature online (DIY, DSP intros, etc.) hasn’t really moved past these, even
in 2022.
Current Year Argument
13
13 October 2022
• Houses built on top of other houses built on top of other houses. Turtles all the way down.
• Only the Old Gods (Yuriy, Gary, etc.) remember why things were done.
• At 32, I am considered “young” in the field.
• MPEG: In theory public, but good luck finding the confusingly named ZIP file with the right doc.
• AOM: Alliance for non-open documents.
• Not only that, but the original “how did we get to X” stuff was never well documented to begin with.
• KLT->DCT? Zigzag? YCbCr (luma illuminants, etc). 33 degrees YIQ rotation? Hope you like
scanned documents.
• This sort of deeper understanding is important – if we don’t understand why then how can we
effectively apply it?
• Only going to get worse with AI.
• The coding tools are too numerous to feasibly be implemented and understood by one person
in a sane amount of time.
Current Year Argument + 1
14
13 October 2022
• Aside: Following WGs, it’s all very boring and iterative
• More angles, bigger / different shaped / better structured “blocks”, non-binary arithmetic coding.
• Have we reached the limits of Novel New Tools on hybrid block-based video codecs?
• This doesn’t even touch on things like RDO, quantization strategies (deadzones, trellis), etc.
Result
15
13 October 2022
• MASSIVE barrier to entry, let alone a deep understanding.
• A lot of mediocre multimedia engineers due to not having a background on why/how.
• A lot of repeating things from the 90s/80s/70s.
• Multi-symbol arithmetic coding… also known as “normal arithmetic coding”.
• Wavelets. Again. Still.
• A lot of smart people who are specialized in e.g. transforms, or color spaces, etc. but lack the full
picture view on how tools interact with other tools.
• None of this is taught well in uni or literature – we all learned it by smashing our heads on desks for
10 years. What is taught as “DSP” is only foundational, and poorly taught at that. Lecturers often lack
deep understanding.
16
13 October 2022
So how do we fix this?
Economics
17
13 October 2022
• I don’t actually have a solution. If I said I did, I would be a snake oil salesman.
• We may have to make peace with less profits (omg) in order to invest in a better user experience.
• AI in encoders choosing tools? They’re all currently slower aside from a few. Jury’s out.
• Specialized video SIMD instructions (more scalable than ASICs – e.g. one per CPU QuickSync).
• YOU tell ME! I want to be wrong and dumb.
• Please don’t include a sales pitch.
Understanding / New Blood
18
13 October 2022
• I would do unspeakable things for a large tome of All Historical Video Info written by the Old Gods.
• Current literature is fragmented, scanned from 1970s papers, lacking, missing, or undiscoverable.
• Better and deeper uni education? Seems unlikely.
• If codecs “done”, does it matter? Will codecs become the new COBOL?
• Hey at least I can make a bomb consulting when I’m 70.
• We haven’t even solved the COBOL problem, though.
19
13 October 2022
I want to hear from you after this talk!

More Related Content

Similar to Are Video Codecs... Done?

HD Video Essay (final)
HD Video  Essay (final)HD Video  Essay (final)
HD Video Essay (final)j.jones
 
Kevinjohn Gallagher's: Emperors new clothes (WordUp Glasgow 2012)
Kevinjohn Gallagher's: Emperors new clothes (WordUp Glasgow 2012)Kevinjohn Gallagher's: Emperors new clothes (WordUp Glasgow 2012)
Kevinjohn Gallagher's: Emperors new clothes (WordUp Glasgow 2012)kevinjohngallagher
 
Fredieu shall we have a future
Fredieu shall we have a futureFredieu shall we have a future
Fredieu shall we have a futureObsidian Software
 
IP Creators & Users Group Description
IP Creators & Users Group DescriptionIP Creators & Users Group Description
IP Creators & Users Group Descriptionedesigner25
 
CloudFoundry-summit-2015-a-look-back
CloudFoundry-summit-2015-a-look-backCloudFoundry-summit-2015-a-look-back
CloudFoundry-summit-2015-a-look-backKrishna-Kumar
 
Everyone wants (someone else) to do it: writing documentation for open source...
Everyone wants (someone else) to do it: writing documentation for open source...Everyone wants (someone else) to do it: writing documentation for open source...
Everyone wants (someone else) to do it: writing documentation for open source...Jody Garnett
 
Emperors new clothes_digitalbarn_output_snakk
Emperors new clothes_digitalbarn_output_snakkEmperors new clothes_digitalbarn_output_snakk
Emperors new clothes_digitalbarn_output_snakkkevinjohngallagher
 
Emperors new clothes - digitalbarn2012
Emperors new clothes - digitalbarn2012Emperors new clothes - digitalbarn2012
Emperors new clothes - digitalbarn2012kevinjohngallagher
 
Microservices to FastData in the Enterprise with Spring: John Davies at Sprin...
Microservices to FastData in the Enterprise with Spring: John Davies at Sprin...Microservices to FastData in the Enterprise with Spring: John Davies at Sprin...
Microservices to FastData in the Enterprise with Spring: John Davies at Sprin...C24 Technologies
 
Prepare yourself to switch computing to Open Hardware Power Architecture
Prepare yourself to switch computing to Open Hardware Power ArchitecturePrepare yourself to switch computing to Open Hardware Power Architecture
Prepare yourself to switch computing to Open Hardware Power ArchitectureRoberto Innocenti
 
FFMUC presents #ffmeet - #virtualUKNOF
FFMUC presents #ffmeet - #virtualUKNOFFFMUC presents #ffmeet - #virtualUKNOF
FFMUC presents #ffmeet - #virtualUKNOFAnnika Wickert
 
EOLE / OWF 12 - The affero gp lv3. why it exists and who it's for - bradley m...
EOLE / OWF 12 - The affero gp lv3. why it exists and who it's for - bradley m...EOLE / OWF 12 - The affero gp lv3. why it exists and who it's for - bradley m...
EOLE / OWF 12 - The affero gp lv3. why it exists and who it's for - bradley m...Paris Open Source Summit
 
Nitobi/PhoneGap at Bootup 2011
Nitobi/PhoneGap at Bootup 2011Nitobi/PhoneGap at Bootup 2011
Nitobi/PhoneGap at Bootup 2011Brian LeRoux
 

Similar to Are Video Codecs... Done? (20)

Feature satip4
Feature satip4Feature satip4
Feature satip4
 
HD Video Essay (final)
HD Video  Essay (final)HD Video  Essay (final)
HD Video Essay (final)
 
Feature satip4
Feature satip4Feature satip4
Feature satip4
 
Kevinjohn Gallagher's: Emperors new clothes (WordUp Glasgow 2012)
Kevinjohn Gallagher's: Emperors new clothes (WordUp Glasgow 2012)Kevinjohn Gallagher's: Emperors new clothes (WordUp Glasgow 2012)
Kevinjohn Gallagher's: Emperors new clothes (WordUp Glasgow 2012)
 
Fredieu shall we have a future
Fredieu shall we have a futureFredieu shall we have a future
Fredieu shall we have a future
 
IP Creators & Users Group Description
IP Creators & Users Group DescriptionIP Creators & Users Group Description
IP Creators & Users Group Description
 
CloudFoundry-summit-2015-a-look-back
CloudFoundry-summit-2015-a-look-backCloudFoundry-summit-2015-a-look-back
CloudFoundry-summit-2015-a-look-back
 
Everyone wants (someone else) to do it: writing documentation for open source...
Everyone wants (someone else) to do it: writing documentation for open source...Everyone wants (someone else) to do it: writing documentation for open source...
Everyone wants (someone else) to do it: writing documentation for open source...
 
Feature satip4
Feature satip4Feature satip4
Feature satip4
 
Articulo
ArticuloArticulo
Articulo
 
Feature satip4
Feature satip4Feature satip4
Feature satip4
 
Emperors new clothes_digitalbarn_output_snakk
Emperors new clothes_digitalbarn_output_snakkEmperors new clothes_digitalbarn_output_snakk
Emperors new clothes_digitalbarn_output_snakk
 
Emperors new clothes - digitalbarn2012
Emperors new clothes - digitalbarn2012Emperors new clothes - digitalbarn2012
Emperors new clothes - digitalbarn2012
 
Microservices to FastData in the Enterprise with Spring: John Davies at Sprin...
Microservices to FastData in the Enterprise with Spring: John Davies at Sprin...Microservices to FastData in the Enterprise with Spring: John Davies at Sprin...
Microservices to FastData in the Enterprise with Spring: John Davies at Sprin...
 
Prepare yourself to switch computing to Open Hardware Power Architecture
Prepare yourself to switch computing to Open Hardware Power ArchitecturePrepare yourself to switch computing to Open Hardware Power Architecture
Prepare yourself to switch computing to Open Hardware Power Architecture
 
UPNext
UPNextUPNext
UPNext
 
FFMUC presents #ffmeet - #virtualUKNOF
FFMUC presents #ffmeet - #virtualUKNOFFFMUC presents #ffmeet - #virtualUKNOF
FFMUC presents #ffmeet - #virtualUKNOF
 
EOLE / OWF 12 - The affero gp lv3. why it exists and who it's for - bradley m...
EOLE / OWF 12 - The affero gp lv3. why it exists and who it's for - bradley m...EOLE / OWF 12 - The affero gp lv3. why it exists and who it's for - bradley m...
EOLE / OWF 12 - The affero gp lv3. why it exists and who it's for - bradley m...
 
Feature satip4
Feature satip4Feature satip4
Feature satip4
 
Nitobi/PhoneGap at Bootup 2011
Nitobi/PhoneGap at Bootup 2011Nitobi/PhoneGap at Bootup 2011
Nitobi/PhoneGap at Bootup 2011
 

More from Derek Buitenhuis

Colorspace: Useful For More Than Just Color? - SF Video Tech Meetup - 27 May ...
Colorspace: Useful For More Than Just Color? - SF Video Tech Meetup - 27 May ...Colorspace: Useful For More Than Just Color? - SF Video Tech Meetup - 27 May ...
Colorspace: Useful For More Than Just Color? - SF Video Tech Meetup - 27 May ...Derek Buitenhuis
 
A Progressive Approach to the Past: Ensuring Backwards Compatability Through ...
A Progressive Approach to the Past: Ensuring Backwards Compatability Through ...A Progressive Approach to the Past: Ensuring Backwards Compatability Through ...
A Progressive Approach to the Past: Ensuring Backwards Compatability Through ...Derek Buitenhuis
 
I Wrote an FFV1 Decoder in Go for Fun: What I Learned Going from Spec to Impl...
I Wrote an FFV1 Decoder in Go for Fun: What I Learned Going from Spec to Impl...I Wrote an FFV1 Decoder in Go for Fun: What I Learned Going from Spec to Impl...
I Wrote an FFV1 Decoder in Go for Fun: What I Learned Going from Spec to Impl...Derek Buitenhuis
 
Let's Be HAV1ng You - London Video Tech October 2019
Let's Be HAV1ng You - London Video Tech October 2019Let's Be HAV1ng You - London Video Tech October 2019
Let's Be HAV1ng You - London Video Tech October 2019Derek Buitenhuis
 
Let's Write a JPEG Decoder (Vimeo Lunch Talks)
Let's Write a JPEG Decoder (Vimeo Lunch Talks)Let's Write a JPEG Decoder (Vimeo Lunch Talks)
Let's Write a JPEG Decoder (Vimeo Lunch Talks)Derek Buitenhuis
 
FFMS2: Indexing, Edge Cases, and Insanity
FFMS2: Indexing, Edge Cases, and InsanityFFMS2: Indexing, Edge Cases, and Insanity
FFMS2: Indexing, Edge Cases, and InsanityDerek Buitenhuis
 
Every Solution is Wrong: Normalizing Ambiguous, Broken, and Pants-on-Head Cra...
Every Solution is Wrong: Normalizing Ambiguous, Broken, and Pants-on-Head Cra...Every Solution is Wrong: Normalizing Ambiguous, Broken, and Pants-on-Head Cra...
Every Solution is Wrong: Normalizing Ambiguous, Broken, and Pants-on-Head Cra...Derek Buitenhuis
 
Multimedia Buzzword Bingo: Translating to English
 Multimedia Buzzword Bingo: Translating to English Multimedia Buzzword Bingo: Translating to English
Multimedia Buzzword Bingo: Translating to EnglishDerek Buitenhuis
 
Things Developers Believe About Video Files (Proven Wrong by User Uploads)
Things Developers Believe About Video Files (Proven Wrong by User Uploads)Things Developers Believe About Video Files (Proven Wrong by User Uploads)
Things Developers Believe About Video Files (Proven Wrong by User Uploads)Derek Buitenhuis
 

More from Derek Buitenhuis (10)

Colorspace: Useful For More Than Just Color? - SF Video Tech Meetup - 27 May ...
Colorspace: Useful For More Than Just Color? - SF Video Tech Meetup - 27 May ...Colorspace: Useful For More Than Just Color? - SF Video Tech Meetup - 27 May ...
Colorspace: Useful For More Than Just Color? - SF Video Tech Meetup - 27 May ...
 
A Progressive Approach to the Past: Ensuring Backwards Compatability Through ...
A Progressive Approach to the Past: Ensuring Backwards Compatability Through ...A Progressive Approach to the Past: Ensuring Backwards Compatability Through ...
A Progressive Approach to the Past: Ensuring Backwards Compatability Through ...
 
Opening up Open Source
Opening up Open SourceOpening up Open Source
Opening up Open Source
 
I Wrote an FFV1 Decoder in Go for Fun: What I Learned Going from Spec to Impl...
I Wrote an FFV1 Decoder in Go for Fun: What I Learned Going from Spec to Impl...I Wrote an FFV1 Decoder in Go for Fun: What I Learned Going from Spec to Impl...
I Wrote an FFV1 Decoder in Go for Fun: What I Learned Going from Spec to Impl...
 
Let's Be HAV1ng You - London Video Tech October 2019
Let's Be HAV1ng You - London Video Tech October 2019Let's Be HAV1ng You - London Video Tech October 2019
Let's Be HAV1ng You - London Video Tech October 2019
 
Let's Write a JPEG Decoder (Vimeo Lunch Talks)
Let's Write a JPEG Decoder (Vimeo Lunch Talks)Let's Write a JPEG Decoder (Vimeo Lunch Talks)
Let's Write a JPEG Decoder (Vimeo Lunch Talks)
 
FFMS2: Indexing, Edge Cases, and Insanity
FFMS2: Indexing, Edge Cases, and InsanityFFMS2: Indexing, Edge Cases, and Insanity
FFMS2: Indexing, Edge Cases, and Insanity
 
Every Solution is Wrong: Normalizing Ambiguous, Broken, and Pants-on-Head Cra...
Every Solution is Wrong: Normalizing Ambiguous, Broken, and Pants-on-Head Cra...Every Solution is Wrong: Normalizing Ambiguous, Broken, and Pants-on-Head Cra...
Every Solution is Wrong: Normalizing Ambiguous, Broken, and Pants-on-Head Cra...
 
Multimedia Buzzword Bingo: Translating to English
 Multimedia Buzzword Bingo: Translating to English Multimedia Buzzword Bingo: Translating to English
Multimedia Buzzword Bingo: Translating to English
 
Things Developers Believe About Video Files (Proven Wrong by User Uploads)
Things Developers Believe About Video Files (Proven Wrong by User Uploads)Things Developers Believe About Video Files (Proven Wrong by User Uploads)
Things Developers Believe About Video Files (Proven Wrong by User Uploads)
 

Recently uploaded

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

Are Video Codecs... Done?

  • 1. Are Video Codecs... Done? derekb@vimeo.com / derek@videolan.org @daemon404 Derek Buitenhuis 13 October 2022 San Francisco, USA
  • 2. Who’s this? 1 13 October 2022 • Senior Principal Video Engineer @ Vimeo • Open source developer (FFmpeg, FFMS2, rav1e, obuparse, etc.) • VideoLAN non-profit board member • Involved in codecs, containers, DSP, etc. since 2004. • Professional Twitter Sh*tposter
  • 3. Context / Biases 2 13 October 2022 • Vimeo is: • Large but not FAANG-sized. Scale, but not FAANG-level cash. • User Generated Content (Long Tail, Large Corpus, Bad Input, etc.) • VOD, Live, OTT, etc. • Primarily familiar with western companies. • I am cognizant of the use of HEVC, etc. in e.g. China – but that’s not a sector I have much connection to, or who publish cost numbers, at least in English. • The patent situation there can be “interesting”. • I would love to learn more and fill in gaps in my understanding about use in these markets.
  • 4. 3 13 October 2022 Part One: The Practicalities of Deployment
  • 5. First: A Confession 4 13 October 2022 • 10 years ago, I was a HEVC True Believer. • The H.264 good times were in full swing. • Why wouldn’t the next MPEG codec be just as great of a success as H.264? • Surely we’d have people chomping at the bit to write “the x264 of HEVC”. • MPEG-LA definitely wouldn’t do something silly like mess with royalties. • VP8 (and WebM) looked to be dead on arrival. • Spec? Who needs that? Not Google. • Only danger was Chrome announced in 2011 that they’d remove support for H.264. • I think they’re getting close, probably just needs a final code review. • Boy, was I wrong.
  • 6. It’s 2022: Why have people actually deployed HEVC? 5 13 October 2022 • Specifically non-megacorps. • A few main reasons: • Apple and Dolby. • HDR: • Many devices require HEVC for any HDR support at all, even types which are not codec-specific (i.e. not Dolby Vision). • Not all of us can be YouTube and get special codec support for HDR put in our favorite OSes. • Some cheaper devices decided not to get better H.264 chipsets, so simply required HEVC for >1080p support (e.g. Roku). • They worked on HEVC and their employer has patents on it. • Notably not included here: lower bandwidth and storage costs.
  • 7. Anecdotally: It is not saving money for those who have. 6 13 October 2022 • In addition to HEVC being just generally more intensive to encode, due to the reasons on the previous slide, this is also when many of us made the jump to 10-bit, making it even more costly. • We also need to remember: We are not extremely spoiled with H.264 encoding speeds and costs 10-15 years later, and many finance departments now take this for granted – slower codecs are a hard sell. • While I can’t share exact numbers, I can say we definitely aren’t saving money by using HEVC. • It’s treated as a feature, not a cost savings thing. • Still mainly only Apple devices are consuming it. • Slower encodes, even chunked, are worse at using spot instances. • Even ingesting HEVC (and HDR – hooray tone mapping) is slower! • When the iPhone started doing HDR HEVC by default, us and industry colleagues noted we could really see the cost increases tick up starting right after launch!
  • 8. VP{8,9}: Eh… 7 13 October 2022 • As before: VP8 is irrelevant to most of us, aside from GIF hosts. • VP9 may have not had the patent issues of HEVC (more on that later), but saw, from my observation, even less adoption. • Not just a new codec, also required whole new tooling an infra for WebM. • ‘WebM DASH’ was a gross hack that abused Matroska cues. • libvpx was terrible… especially at rate control. Now it is merely bad. • I am convinced Google just doesn’t get what non-Google orgs need to drive creation-side adoption. • Either that or they don’t care, as long as it saves YouTube money. • Was mostly only embraced by big corps like Google, and much later, Netflix. • Netflix waited for MP4 support, I think? • Google had to make its own hardware to make it truly economical.
  • 9. So who do / don’t slower, better codecs work for? 8 13 October 2022 • General: Patient liability / risk tolerance is too high for HEVC for SMB. • Small corpus, high consumption: • Netflix • High bandwidth websites like fashion, where video is not the main focus. • UGC – only at megacorp scale • The long tail only really economical with Google-level “we made out own hardware” investment. • Off the shelf hardware encoders aren’t economical or scalable past SMB-level. • Live: • Unclear. • Twitch trialed it but not widely deployed. • Most YouTube livestreams are still H.264. • Can be done, but either at a cost, or without sufficient bandwidth savings vs H.264.
  • 10. The Future 9 13 October 2022 • AV1: • YouTube. Duh. • Hot take: Real bandwidth returns on cost are diminishing vs compute, and much like audio codecs: only telcos the big bois care about the amount shaved off. • Real use case – Cisco: • Replaced H.264, not HEVC or VP9. • Very small subset of tools. • Could be a way forward for next gen codecs? No one seems to be targeting this. • VVC: • Hotter take: Nobody cares about VVC except people involved in creating it (MPEG companies). • Patents. “Let’s just wait and see approach”. • I’ve heard rumors some Indian vendors have deployed it to get The Smallest Files. • I would love to know the cost and patent situation.
  • 11. The Result 10 13 October 2022 H.264 is the JPEG of video.
  • 12. 11 13 October 2022 Part Two: Development and Deep Understanding
  • 13. Back in my day… 12 13 October 2022 • When I started in video, MPEG-2, MPEG-4 ASP, and H.264 were all relevant. • A sole person could learn every bit of these, and implement them themselves, reasonably quickly. • Reasonably-ish well documented. Sort of. Kind of… Well. OK. • You could apply your newly gained knowledge to current problems. • Tools in these codecs were less intertwined. • Does anyone do this nowadays (besides Ronald?) • All literature online (DIY, DSP intros, etc.) hasn’t really moved past these, even in 2022.
  • 14. Current Year Argument 13 13 October 2022 • Houses built on top of other houses built on top of other houses. Turtles all the way down. • Only the Old Gods (Yuriy, Gary, etc.) remember why things were done. • At 32, I am considered “young” in the field. • MPEG: In theory public, but good luck finding the confusingly named ZIP file with the right doc. • AOM: Alliance for non-open documents. • Not only that, but the original “how did we get to X” stuff was never well documented to begin with. • KLT->DCT? Zigzag? YCbCr (luma illuminants, etc). 33 degrees YIQ rotation? Hope you like scanned documents. • This sort of deeper understanding is important – if we don’t understand why then how can we effectively apply it? • Only going to get worse with AI. • The coding tools are too numerous to feasibly be implemented and understood by one person in a sane amount of time.
  • 15. Current Year Argument + 1 14 13 October 2022 • Aside: Following WGs, it’s all very boring and iterative • More angles, bigger / different shaped / better structured “blocks”, non-binary arithmetic coding. • Have we reached the limits of Novel New Tools on hybrid block-based video codecs? • This doesn’t even touch on things like RDO, quantization strategies (deadzones, trellis), etc.
  • 16. Result 15 13 October 2022 • MASSIVE barrier to entry, let alone a deep understanding. • A lot of mediocre multimedia engineers due to not having a background on why/how. • A lot of repeating things from the 90s/80s/70s. • Multi-symbol arithmetic coding… also known as “normal arithmetic coding”. • Wavelets. Again. Still. • A lot of smart people who are specialized in e.g. transforms, or color spaces, etc. but lack the full picture view on how tools interact with other tools. • None of this is taught well in uni or literature – we all learned it by smashing our heads on desks for 10 years. What is taught as “DSP” is only foundational, and poorly taught at that. Lecturers often lack deep understanding.
  • 17. 16 13 October 2022 So how do we fix this?
  • 18. Economics 17 13 October 2022 • I don’t actually have a solution. If I said I did, I would be a snake oil salesman. • We may have to make peace with less profits (omg) in order to invest in a better user experience. • AI in encoders choosing tools? They’re all currently slower aside from a few. Jury’s out. • Specialized video SIMD instructions (more scalable than ASICs – e.g. one per CPU QuickSync). • YOU tell ME! I want to be wrong and dumb. • Please don’t include a sales pitch.
  • 19. Understanding / New Blood 18 13 October 2022 • I would do unspeakable things for a large tome of All Historical Video Info written by the Old Gods. • Current literature is fragmented, scanned from 1970s papers, lacking, missing, or undiscoverable. • Better and deeper uni education? Seems unlikely. • If codecs “done”, does it matter? Will codecs become the new COBOL? • Hey at least I can make a bomb consulting when I’m 70. • We haven’t even solved the COBOL problem, though.
  • 20. 19 13 October 2022 I want to hear from you after this talk!