SlideShare a Scribd company logo
1 of 15
Video Summarization
Framework for Newscasts
and Reports – Work in
Progress
Mikołaj Leszczuk
Michał Grega
Arian Koźbiał
Jarosław Gliwski
Krzysztof Wasieczko
Kamel Smaïli
Introduction
• 300 h of video uploaded to YouTube every min
• Average video length: 4 min & 20 s
• “We live in 140 characters era”
• How to assimilate main ideas carried by video?
• Best way: summarizing information
2017-11-17 2
Database of Video Sequences
• Focusing on
summarization of
newscasts & reports in
our research
• Major news channels,
like Euronews,
France24, BBC, Russia
Today & Al Jazeera
• Data on trending topics
(based on Twitter)
• Topics:
• “Syria”
• “Real Madrid – FC
Barcelona”
• “Animal rights”
• “Women’s rights”
• “Homosexual marriage”
• “Drug liberalization”
• “Death sentence”
• “Occupied territories”
• “Trump”
2017-11-17 3
Custom Developed Toolkit to
Download Necessary Videos
2017-11-17 4
Some Statistics
2017-11-17 5
Video Summarisation Framework
2017-11-17 6
Metadata Extraction Algorithms
• Shot transition detection
• Speech recognition
• Detection of “talking head” shots
• Detection of day and night shots
• Video quality indicators
2017-11-17 7
Shot Transition Detection
• Automated detection of
transitions between
shots in digital video
• Purpose: temporal
segmentation of videos
• Based on Py-Scene-
Detect
2017-11-17 8
Speech recognition
• Automatic recognition & translation of spoken
language into text (by computer)
• Each video frame related to transcription and vice
versa
• Languages: English, French, Arabic
2017-11-17 9
Detection of “Talking Head” Shots
(1/2)
Shot
Frame 1 Frame 2
To Grayscale
Histogram Equalization
Face Detection
Mouth Analysis
Frame n
Percentage of number
frames with face to the
number of frames w/o
face
Percentage of area of
face to the area of the
frame
Percentage of frames
with more than one
face
Percentage of frames
with open mouth to
frames with closed
mouth
2017-11-17 10
Detection of
“Talking Head”
Shots (2/2) Is ratio
of # of frames with face
to # of frames w/o
face >20%?
Is area
of face to the area of frame
>3%?
Is ratio of #
frames with more than 1
face to # of frames in
shot <10%?
AND
Is ratio of #
frames with open mouth
to # frames with closed
mouth >20%?
NotTalkingHead
Talking Head
Start
T
F
T
F
F
T
Sensitivity: 88%
Specificity: 100%
2017-11-17 11
Detection of Day and Night Shots
• Based on neural
network
• Tested on >2000
photos
• Accuracy >90%
2017-11-17 12
Video Characteristics Indicators
• Exposure
• Freezing
• Spatial activity
• Temporal activity
• Letter-boxing
• Pillar-boxing
• Blockiness
• Noise
• Slicing
• Block-loss
• Blur
• Contrast
• Flickering
• Blackout
2017-11-17 13
Summarising Videos
Summarisation
Recipe
Shot
transition
detection
Speech
recognition
Detection
of “talking
head” shots Detection
of day and
night shots
Video
quality
indicators
2017-11-17 14
Thank You!
But There’s One More Thing… Summarised Video…
2017-11-17 15

More Related Content

Similar to Video summarization framework for newscasts and reports – work in progress

Similar to Video summarization framework for newscasts and reports – work in progress (11)

Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
 
Using artificial intelligence to enhance your customer experience
Using artificial intelligence to enhance your customer experienceUsing artificial intelligence to enhance your customer experience
Using artificial intelligence to enhance your customer experience
 
From IoT to Human Interactions: Voice and Messages to the rescue - Stève Sfar...
From IoT to Human Interactions: Voice and Messages to the rescue - Stève Sfar...From IoT to Human Interactions: Voice and Messages to the rescue - Stève Sfar...
From IoT to Human Interactions: Voice and Messages to the rescue - Stève Sfar...
 
GAB 2017 PARIS - IoT Azure - Aymeric Weinbach
GAB 2017 PARIS - IoT Azure - Aymeric WeinbachGAB 2017 PARIS - IoT Azure - Aymeric Weinbach
GAB 2017 PARIS - IoT Azure - Aymeric Weinbach
 
Gab 2017 iot azure aymeric weinbach
Gab 2017 iot  azure aymeric weinbachGab 2017 iot  azure aymeric weinbach
Gab 2017 iot azure aymeric weinbach
 
Parallel session: security
Parallel session: securityParallel session: security
Parallel session: security
 
In that case, we have an OWASP Top 10 opportunity...
In that case, we have an OWASP Top 10 opportunity...In that case, we have an OWASP Top 10 opportunity...
In that case, we have an OWASP Top 10 opportunity...
 
Topic Listener - Observing Key Topics from Multi-Channel Speech Audio Streams...
Topic Listener - Observing Key Topics from Multi-Channel Speech Audio Streams...Topic Listener - Observing Key Topics from Multi-Channel Speech Audio Streams...
Topic Listener - Observing Key Topics from Multi-Channel Speech Audio Streams...
 
Dolování dat z řeči pro bezpečnostní aplikace - Jan Černocký
Dolování dat z řeči pro bezpečnostní aplikace - Jan ČernockýDolování dat z řeči pro bezpečnostní aplikace - Jan Černocký
Dolování dat z řeči pro bezpečnostní aplikace - Jan Černocký
 
AIEMpro 2010: CONTENTUS: Technologies for Next Generation Multimedia Libraries
AIEMpro 2010: CONTENTUS: Technologies for Next Generation Multimedia LibrariesAIEMpro 2010: CONTENTUS: Technologies for Next Generation Multimedia Libraries
AIEMpro 2010: CONTENTUS: Technologies for Next Generation Multimedia Libraries
 
Defending the Endpoint with Next-Gen Security
Defending the Endpoint with Next-Gen SecurityDefending the Endpoint with Next-Gen Security
Defending the Endpoint with Next-Gen Security
 

More from Mikolaj Leszczuk

Człowiek, ósma warstwa modelu ISO/OSI, jako element ekosystemu teleinformaty...
Człowiek, ósma warstwa modelu ISO/OSI, jako element ekosystemu teleinformaty...Człowiek, ósma warstwa modelu ISO/OSI, jako element ekosystemu teleinformaty...
Człowiek, ósma warstwa modelu ISO/OSI, jako element ekosystemu teleinformaty...
Mikolaj Leszczuk
 

More from Mikolaj Leszczuk (20)

Selected Aspects of the New Recommendation on Subjective Methods of Assessing...
Selected Aspects of the New Recommendation on Subjective Methods of Assessing...Selected Aspects of the New Recommendation on Subjective Methods of Assessing...
Selected Aspects of the New Recommendation on Subjective Methods of Assessing...
 
Survey on the State-Of-The-Art Methods for Objective Video Quality Assessment...
Survey on the State-Of-The-Art Methods for Objective Video Quality Assessment...Survey on the State-Of-The-Art Methods for Objective Video Quality Assessment...
Survey on the State-Of-The-Art Methods for Objective Video Quality Assessment...
 
#VQEG #QUADRIVIA 2020
#VQEG #QUADRIVIA 2020#VQEG #QUADRIVIA 2020
#VQEG #QUADRIVIA 2020
 
Evaluation of Video Summarization
Evaluation of Video SummarizationEvaluation of Video Summarization
Evaluation of Video Summarization
 
Special Session on: Quality Assessment for Computer Vision and Immersive Medi...
Special Session on:Quality Assessment for Computer Vision and Immersive Medi...Special Session on:Quality Assessment for Computer Vision and Immersive Medi...
Special Session on: Quality Assessment for Computer Vision and Immersive Medi...
 
Self-Improving Sustainable Intelligent Transport System (ITS) Using Video Con...
Self-Improving Sustainable Intelligent Transport System (ITS) Using Video Con...Self-Improving Sustainable Intelligent Transport System (ITS) Using Video Con...
Self-Improving Sustainable Intelligent Transport System (ITS) Using Video Con...
 
#Paris Meeting 2018 - Presentation of @chist_era_AMIS
#Paris Meeting 2018 - Presentation of @chist_era_AMIS#Paris Meeting 2018 - Presentation of @chist_era_AMIS
#Paris Meeting 2018 - Presentation of @chist_era_AMIS
 
Spotkanie w VIII Prywatnym Akademickim Liceum Ogólnokształcącym
Spotkanie w VIII Prywatnym Akademickim Liceum OgólnokształcącymSpotkanie w VIII Prywatnym Akademickim Liceum Ogólnokształcącym
Spotkanie w VIII Prywatnym Akademickim Liceum Ogólnokształcącym
 
QoE Research
QoE ResearchQoE Research
QoE Research
 
Prace naukowe prowadzone w Katedrze Telekomunikacji @AGH_Krakow
Prace naukowe prowadzone w Katedrze Telekomunikacji @AGH_KrakowPrace naukowe prowadzone w Katedrze Telekomunikacji @AGH_Krakow
Prace naukowe prowadzone w Katedrze Telekomunikacji @AGH_Krakow
 
Infrastructure for High-Attendance, Simple Psychophysical Experiments
Infrastructure for High-Attendance, Simple Psychophysical ExperimentsInfrastructure for High-Attendance, Simple Psychophysical Experiments
Infrastructure for High-Attendance, Simple Psychophysical Experiments
 
J. Imaging: Special Issue on Image Quality
J. Imaging: Special Issue on Image QualityJ. Imaging: Special Issue on Image Quality
J. Imaging: Special Issue on Image Quality
 
Automatic Extraction of Machine Tags in Flickr Service
Automatic Extraction of Machine Tags in Flickr ServiceAutomatic Extraction of Machine Tags in Flickr Service
Automatic Extraction of Machine Tags in Flickr Service
 
Results on video summarization
Results on video summarizationResults on video summarization
Results on video summarization
 
Człowiek, ósma warstwa modelu ISO/OSI, jako element ekosystemu teleinformaty...
Człowiek, ósma warstwa modelu ISO/OSI, jako element ekosystemu teleinformaty...Człowiek, ósma warstwa modelu ISO/OSI, jako element ekosystemu teleinformaty...
Człowiek, ósma warstwa modelu ISO/OSI, jako element ekosystemu teleinformaty...
 
Badanie i implementacja aspektu QoE (ang. Quality of Experience) w aplikacjac...
Badanie i implementacja aspektu QoE (ang. Quality of Experience) w aplikacjac...Badanie i implementacja aspektu QoE (ang. Quality of Experience) w aplikacjac...
Badanie i implementacja aspektu QoE (ang. Quality of Experience) w aplikacjac...
 
Platforma do automatycznej, obiektywnej oceny jakości usług transmisji wideo
Platforma do automatycznej, obiektywnej oceny jakości usług transmisji wideoPlatforma do automatycznej, obiektywnej oceny jakości usług transmisji wideo
Platforma do automatycznej, obiektywnej oceny jakości usług transmisji wideo
 
Modelling of Quality of Experience in No-Reference (NR) Model
Modelling of Quality of Experience in No-Reference (NR) ModelModelling of Quality of Experience in No-Reference (NR) Model
Modelling of Quality of Experience in No-Reference (NR) Model
 
Definition of Requirements for Accessing Multilingual Information and Opinions
Definition of Requirements for Accessing Multilingual Information and OpinionsDefinition of Requirements for Accessing Multilingual Information and Opinions
Definition of Requirements for Accessing Multilingual Information and Opinions
 
Aplikacja mobilna do rozpoznawania numerów linii komunikacji miejskiej
Aplikacja mobilna do rozpoznawania numerów linii komunikacji miejskiejAplikacja mobilna do rozpoznawania numerów linii komunikacji miejskiej
Aplikacja mobilna do rozpoznawania numerów linii komunikacji miejskiej
 

Recently uploaded

development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptx
Cherry
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
Scintica Instrumentation
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
Cherry
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
Cherry
 

Recently uploaded (20)

development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptx
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Terpineol and it's characterization pptx
Terpineol and it's characterization pptxTerpineol and it's characterization pptx
Terpineol and it's characterization pptx
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
Early Development of Mammals (Mouse and Human).pdf
Early Development of Mammals (Mouse and Human).pdfEarly Development of Mammals (Mouse and Human).pdf
Early Development of Mammals (Mouse and Human).pdf
 

Video summarization framework for newscasts and reports – work in progress

  • 1. Video Summarization Framework for Newscasts and Reports – Work in Progress Mikołaj Leszczuk Michał Grega Arian Koźbiał Jarosław Gliwski Krzysztof Wasieczko Kamel Smaïli
  • 2. Introduction • 300 h of video uploaded to YouTube every min • Average video length: 4 min & 20 s • “We live in 140 characters era” • How to assimilate main ideas carried by video? • Best way: summarizing information 2017-11-17 2
  • 3. Database of Video Sequences • Focusing on summarization of newscasts & reports in our research • Major news channels, like Euronews, France24, BBC, Russia Today & Al Jazeera • Data on trending topics (based on Twitter) • Topics: • “Syria” • “Real Madrid – FC Barcelona” • “Animal rights” • “Women’s rights” • “Homosexual marriage” • “Drug liberalization” • “Death sentence” • “Occupied territories” • “Trump” 2017-11-17 3
  • 4. Custom Developed Toolkit to Download Necessary Videos 2017-11-17 4
  • 7. Metadata Extraction Algorithms • Shot transition detection • Speech recognition • Detection of “talking head” shots • Detection of day and night shots • Video quality indicators 2017-11-17 7
  • 8. Shot Transition Detection • Automated detection of transitions between shots in digital video • Purpose: temporal segmentation of videos • Based on Py-Scene- Detect 2017-11-17 8
  • 9. Speech recognition • Automatic recognition & translation of spoken language into text (by computer) • Each video frame related to transcription and vice versa • Languages: English, French, Arabic 2017-11-17 9
  • 10. Detection of “Talking Head” Shots (1/2) Shot Frame 1 Frame 2 To Grayscale Histogram Equalization Face Detection Mouth Analysis Frame n Percentage of number frames with face to the number of frames w/o face Percentage of area of face to the area of the frame Percentage of frames with more than one face Percentage of frames with open mouth to frames with closed mouth 2017-11-17 10
  • 11. Detection of “Talking Head” Shots (2/2) Is ratio of # of frames with face to # of frames w/o face >20%? Is area of face to the area of frame >3%? Is ratio of # frames with more than 1 face to # of frames in shot <10%? AND Is ratio of # frames with open mouth to # frames with closed mouth >20%? NotTalkingHead Talking Head Start T F T F F T Sensitivity: 88% Specificity: 100% 2017-11-17 11
  • 12. Detection of Day and Night Shots • Based on neural network • Tested on >2000 photos • Accuracy >90% 2017-11-17 12
  • 13. Video Characteristics Indicators • Exposure • Freezing • Spatial activity • Temporal activity • Letter-boxing • Pillar-boxing • Blockiness • Noise • Slicing • Block-loss • Blur • Contrast • Flickering • Blackout 2017-11-17 13
  • 14. Summarising Videos Summarisation Recipe Shot transition detection Speech recognition Detection of “talking head” shots Detection of day and night shots Video quality indicators 2017-11-17 14
  • 15. Thank You! But There’s One More Thing… Summarised Video… 2017-11-17 15