SlideShare a Scribd company logo
1 of 21
Download to read offline
Trialling AI to
automate captions
Development - Jacksen Kline
Video Architect - Jeremy Brown
Some content
is still supplied
uncaptioned
VOD content supply
Catch up VOD
✓ Captions supplied reliably, required for broadcast
Live Recorded VOD (L2V)
x Live captions in sent as file much later
Back Catalogue VOD (BCV)
x Content released before broadcast so not required yet
x Digital only content deals often don’t supply captions
● Back Catalogue VOD has a large audience
● BCV has year long shelf life, Live typically a week
● Time to review, clips supplied a week ahead
● Shows are global, differing formats and genres
BCV Back Catalogue
Why is Live special
● News & daily talk shows
● Over 12 hours daily supplied
● Words are enunciated cleary
● Minimum background sound
● Easy to add to existing automated workflow
● All Australian content, consistent accents
Live was determined our preferred
content for automated caption trial
Why we tried?
We determined traditional
captioning partners were
out of our (small digital)
budget and had slower
turn around than planned.
Caption process
POC solution was:
● Cheap, $0.03/min
● Completely
automated
● Fast, ~2x real time
Benefits
What about the others?
Similar costs but more complexities caused
moving large files from our existing host
Google was most accurate translating in testing
Results were accurate,
but did fail in bad ways
“Windsor”
Location names
“Quay”
Homophones
“how the fight”
The Australian accent?
“Turnbull”
The team praised
the technology
But, we had to consider
the likely user reaction
“AI generated captions are amazing!”
“Are they illiterate? This is embarrassing.”
Each translation is scored
“Windsor” - Tow (0.4940) wins (0.9490) a (0.9801) bridge (0.6395)
“Quay” - Circular (1.0000) key (0.3497)
“How the fight” - About (0.9996) hell (0.8830) the (0.9879) fuck (0.8509)
“Turnbull” - Alleged (0.9986) malcolm (0.9986) terrible (0.4550) also (1.0000)
POC Caption
Output
Publish direct if
highly confident
Manual review if
less confident
We are considering
reviewing low scores
How this efficient
review tool could look
Certainly viable, check your
legal obligations for accuracy
before considering a fully
automated workflow
Final thoughts
Speech translation is
improving quickly and
could be be less reliant
on manual intervention
Final thoughts
Thanks
Development - Jacksen Kline
Video Architect - Jeremy Brown

More Related Content

Similar to Trialling AI speech to automate VOD captions

NAB Show 2018 Notes
NAB Show 2018 NotesNAB Show 2018 Notes
NAB Show 2018 NotesYusuke Goto
 
Nitty Gritty of Adaptive Video Transmuxing in JS
Nitty Gritty of Adaptive Video Transmuxing in JSNitty Gritty of Adaptive Video Transmuxing in JS
Nitty Gritty of Adaptive Video Transmuxing in JSDonato Borrello
 
Encoding Video for Microsoft Silverlight
Encoding Video for Microsoft SilverlightEncoding Video for Microsoft Silverlight
Encoding Video for Microsoft Silverlightgoodfriday
 
Intro Audio Description
Intro Audio DescriptionIntro Audio Description
Intro Audio Description3Play Media
 
Digital Cinema Mastering 101
Digital Cinema Mastering 101Digital Cinema Mastering 101
Digital Cinema Mastering 101sundanceinstitute
 
Intro to Audio Description 06-28-2018
Intro to Audio Description 06-28-2018Intro to Audio Description 06-28-2018
Intro to Audio Description 06-28-20183Play Media
 
(SPOT209) Raising the Bar on Video Streaming Quality Using AWS
(SPOT209) Raising the Bar on Video Streaming Quality Using AWS(SPOT209) Raising the Bar on Video Streaming Quality Using AWS
(SPOT209) Raising the Bar on Video Streaming Quality Using AWSAmazon Web Services
 
Digital Cre8or Unit D Lesson 1
Digital Cre8or Unit D Lesson 1Digital Cre8or Unit D Lesson 1
Digital Cre8or Unit D Lesson 1Julian Prior
 
Orchestrating the execution of workflows for media streaming service and even...
Orchestrating the execution of workflows for media streaming service and even...Orchestrating the execution of workflows for media streaming service and even...
Orchestrating the execution of workflows for media streaming service and even...Shuen-Huei Guan
 
Serverless Media Workflow
Serverless Media WorkflowServerless Media Workflow
Serverless Media WorkflowMooYeol Lee
 
Intro to Audio Description 10-04-2018
Intro to Audio Description 10-04-2018Intro to Audio Description 10-04-2018
Intro to Audio Description 10-04-20183Play Media
 
AWS re:Invent 2016: Journeys to the Cloud: Different Experiences in Video (CT...
AWS re:Invent 2016: Journeys to the Cloud: Different Experiences in Video (CT...AWS re:Invent 2016: Journeys to the Cloud: Different Experiences in Video (CT...
AWS re:Invent 2016: Journeys to the Cloud: Different Experiences in Video (CT...Amazon Web Services
 
#NetflixEverywhere Global Architecture
#NetflixEverywhere Global Architecture#NetflixEverywhere Global Architecture
#NetflixEverywhere Global ArchitectureC4Media
 
Intro to Audio Description
Intro to Audio DescriptionIntro to Audio Description
Intro to Audio Description3Play Media
 
Architecting a Video Encoding Strategy Designed For Growth
Architecting a Video Encoding Strategy Designed For GrowthArchitecting a Video Encoding Strategy Designed For Growth
Architecting a Video Encoding Strategy Designed For GrowthZencoder
 
Radvision webinar: Making Real Time Video Work Over The Internet
Radvision webinar: Making Real Time Video Work Over The InternetRadvision webinar: Making Real Time Video Work Over The Internet
Radvision webinar: Making Real Time Video Work Over The InternetRADVISION Ltd.
 
Separate Pasts, Common Futures: Digital film preservation in a broadcast en...
Separate Pasts,  Common Futures: Digital film preservation in a  broadcast en...Separate Pasts,  Common Futures: Digital film preservation in a  broadcast en...
Separate Pasts, Common Futures: Digital film preservation in a broadcast en...Erwin Verbruggen
 
AWS re:Invent 2016: Accelerating the Transition to Broadcast and OTT Infrastr...
AWS re:Invent 2016: Accelerating the Transition to Broadcast and OTT Infrastr...AWS re:Invent 2016: Accelerating the Transition to Broadcast and OTT Infrastr...
AWS re:Invent 2016: Accelerating the Transition to Broadcast and OTT Infrastr...Amazon Web Services
 
[AWS LA Media & Entertainment Event 2015]: Raising the Bar on Video Streaming...
[AWS LA Media & Entertainment Event 2015]: Raising the Bar on Video Streaming...[AWS LA Media & Entertainment Event 2015]: Raising the Bar on Video Streaming...
[AWS LA Media & Entertainment Event 2015]: Raising the Bar on Video Streaming...Amazon Web Services
 

Similar to Trialling AI speech to automate VOD captions (20)

NAB Show 2018 Notes
NAB Show 2018 NotesNAB Show 2018 Notes
NAB Show 2018 Notes
 
Nitty Gritty of Adaptive Video Transmuxing in JS
Nitty Gritty of Adaptive Video Transmuxing in JSNitty Gritty of Adaptive Video Transmuxing in JS
Nitty Gritty of Adaptive Video Transmuxing in JS
 
Encoding Video for Microsoft Silverlight
Encoding Video for Microsoft SilverlightEncoding Video for Microsoft Silverlight
Encoding Video for Microsoft Silverlight
 
Intro Audio Description
Intro Audio DescriptionIntro Audio Description
Intro Audio Description
 
Digital Cinema Mastering 101
Digital Cinema Mastering 101Digital Cinema Mastering 101
Digital Cinema Mastering 101
 
Intro to Audio Description 06-28-2018
Intro to Audio Description 06-28-2018Intro to Audio Description 06-28-2018
Intro to Audio Description 06-28-2018
 
(SPOT209) Raising the Bar on Video Streaming Quality Using AWS
(SPOT209) Raising the Bar on Video Streaming Quality Using AWS(SPOT209) Raising the Bar on Video Streaming Quality Using AWS
(SPOT209) Raising the Bar on Video Streaming Quality Using AWS
 
Digital Cre8or Unit D Lesson 1
Digital Cre8or Unit D Lesson 1Digital Cre8or Unit D Lesson 1
Digital Cre8or Unit D Lesson 1
 
Orchestrating the execution of workflows for media streaming service and even...
Orchestrating the execution of workflows for media streaming service and even...Orchestrating the execution of workflows for media streaming service and even...
Orchestrating the execution of workflows for media streaming service and even...
 
Serverless Media Workflow
Serverless Media WorkflowServerless Media Workflow
Serverless Media Workflow
 
Intro to Audio Description 10-04-2018
Intro to Audio Description 10-04-2018Intro to Audio Description 10-04-2018
Intro to Audio Description 10-04-2018
 
AWS re:Invent 2016: Journeys to the Cloud: Different Experiences in Video (CT...
AWS re:Invent 2016: Journeys to the Cloud: Different Experiences in Video (CT...AWS re:Invent 2016: Journeys to the Cloud: Different Experiences in Video (CT...
AWS re:Invent 2016: Journeys to the Cloud: Different Experiences in Video (CT...
 
#NetflixEverywhere Global Architecture
#NetflixEverywhere Global Architecture#NetflixEverywhere Global Architecture
#NetflixEverywhere Global Architecture
 
Intro to Audio Description
Intro to Audio DescriptionIntro to Audio Description
Intro to Audio Description
 
Architecting a Video Encoding Strategy Designed For Growth
Architecting a Video Encoding Strategy Designed For GrowthArchitecting a Video Encoding Strategy Designed For Growth
Architecting a Video Encoding Strategy Designed For Growth
 
Radvision webinar: Making Real Time Video Work Over The Internet
Radvision webinar: Making Real Time Video Work Over The InternetRadvision webinar: Making Real Time Video Work Over The Internet
Radvision webinar: Making Real Time Video Work Over The Internet
 
Separate Pasts, Common Futures: Digital film preservation in a broadcast en...
Separate Pasts,  Common Futures: Digital film preservation in a  broadcast en...Separate Pasts,  Common Futures: Digital film preservation in a  broadcast en...
Separate Pasts, Common Futures: Digital film preservation in a broadcast en...
 
Database History From Codd to Brewer
Database History From Codd to BrewerDatabase History From Codd to Brewer
Database History From Codd to Brewer
 
AWS re:Invent 2016: Accelerating the Transition to Broadcast and OTT Infrastr...
AWS re:Invent 2016: Accelerating the Transition to Broadcast and OTT Infrastr...AWS re:Invent 2016: Accelerating the Transition to Broadcast and OTT Infrastr...
AWS re:Invent 2016: Accelerating the Transition to Broadcast and OTT Infrastr...
 
[AWS LA Media & Entertainment Event 2015]: Raising the Bar on Video Streaming...
[AWS LA Media & Entertainment Event 2015]: Raising the Bar on Video Streaming...[AWS LA Media & Entertainment Event 2015]: Raising the Bar on Video Streaming...
[AWS LA Media & Entertainment Event 2015]: Raising the Bar on Video Streaming...
 

More from Jeremy Brown

What if a video dev redesigned a serverless API?
What if a video dev redesigned a serverless API?What if a video dev redesigned a serverless API?
What if a video dev redesigned a serverless API?Jeremy Brown
 
Demuxed 2019 Lightning Talk - Multi lingual captions
Demuxed 2019 Lightning Talk - Multi lingual captions Demuxed 2019 Lightning Talk - Multi lingual captions
Demuxed 2019 Lightning Talk - Multi lingual captions Jeremy Brown
 
Three times ml enhanced video workflows for 7plus
Three times ml enhanced video workflows for 7plusThree times ml enhanced video workflows for 7plus
Three times ml enhanced video workflows for 7plusJeremy Brown
 
Optimising video delivery - Brightcove PLAY 2019
Optimising video delivery - Brightcove PLAY 2019Optimising video delivery - Brightcove PLAY 2019
Optimising video delivery - Brightcove PLAY 2019Jeremy Brown
 
Live streaming remote events
Live streaming remote eventsLive streaming remote events
Live streaming remote eventsJeremy Brown
 
Mitigating Ad Loudness - Jeremy Brown
Mitigating Ad Loudness - Jeremy BrownMitigating Ad Loudness - Jeremy Brown
Mitigating Ad Loudness - Jeremy BrownJeremy Brown
 
Better images for video - Jeremy Brown
Better images for video - Jeremy BrownBetter images for video - Jeremy Brown
Better images for video - Jeremy BrownJeremy Brown
 

More from Jeremy Brown (7)

What if a video dev redesigned a serverless API?
What if a video dev redesigned a serverless API?What if a video dev redesigned a serverless API?
What if a video dev redesigned a serverless API?
 
Demuxed 2019 Lightning Talk - Multi lingual captions
Demuxed 2019 Lightning Talk - Multi lingual captions Demuxed 2019 Lightning Talk - Multi lingual captions
Demuxed 2019 Lightning Talk - Multi lingual captions
 
Three times ml enhanced video workflows for 7plus
Three times ml enhanced video workflows for 7plusThree times ml enhanced video workflows for 7plus
Three times ml enhanced video workflows for 7plus
 
Optimising video delivery - Brightcove PLAY 2019
Optimising video delivery - Brightcove PLAY 2019Optimising video delivery - Brightcove PLAY 2019
Optimising video delivery - Brightcove PLAY 2019
 
Live streaming remote events
Live streaming remote eventsLive streaming remote events
Live streaming remote events
 
Mitigating Ad Loudness - Jeremy Brown
Mitigating Ad Loudness - Jeremy BrownMitigating Ad Loudness - Jeremy Brown
Mitigating Ad Loudness - Jeremy Brown
 
Better images for video - Jeremy Brown
Better images for video - Jeremy BrownBetter images for video - Jeremy Brown
Better images for video - Jeremy Brown
 

Recently uploaded

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 

Recently uploaded (20)

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 

Trialling AI speech to automate VOD captions

  • 1. Trialling AI to automate captions Development - Jacksen Kline Video Architect - Jeremy Brown
  • 2. Some content is still supplied uncaptioned
  • 3. VOD content supply Catch up VOD ✓ Captions supplied reliably, required for broadcast Live Recorded VOD (L2V) x Live captions in sent as file much later Back Catalogue VOD (BCV) x Content released before broadcast so not required yet x Digital only content deals often don’t supply captions
  • 4. ● Back Catalogue VOD has a large audience ● BCV has year long shelf life, Live typically a week ● Time to review, clips supplied a week ahead ● Shows are global, differing formats and genres BCV Back Catalogue
  • 5. Why is Live special ● News & daily talk shows ● Over 12 hours daily supplied ● Words are enunciated cleary ● Minimum background sound ● Easy to add to existing automated workflow ● All Australian content, consistent accents Live was determined our preferred content for automated caption trial
  • 6. Why we tried? We determined traditional captioning partners were out of our (small digital) budget and had slower turn around than planned.
  • 8. POC solution was: ● Cheap, $0.03/min ● Completely automated ● Fast, ~2x real time Benefits
  • 9. What about the others? Similar costs but more complexities caused moving large files from our existing host Google was most accurate translating in testing
  • 10. Results were accurate, but did fail in bad ways
  • 13. “how the fight” The Australian accent?
  • 15. The team praised the technology But, we had to consider the likely user reaction “AI generated captions are amazing!” “Are they illiterate? This is embarrassing.”
  • 16. Each translation is scored “Windsor” - Tow (0.4940) wins (0.9490) a (0.9801) bridge (0.6395) “Quay” - Circular (1.0000) key (0.3497) “How the fight” - About (0.9996) hell (0.8830) the (0.9879) fuck (0.8509) “Turnbull” - Alleged (0.9986) malcolm (0.9986) terrible (0.4550) also (1.0000)
  • 17. POC Caption Output Publish direct if highly confident Manual review if less confident We are considering reviewing low scores
  • 18. How this efficient review tool could look
  • 19. Certainly viable, check your legal obligations for accuracy before considering a fully automated workflow Final thoughts
  • 20. Speech translation is improving quickly and could be be less reliant on manual intervention Final thoughts
  • 21. Thanks Development - Jacksen Kline Video Architect - Jeremy Brown