Unsupervised video summarization framework using keyframe extraction and video skimming

•Download as PPTX, PDF•

0 likes•32 views

Video is one of the robust sources of information and the consumption of online and offline videos has reached an unprecedented level in the last few years. A fundamental challenge of extracting information from videos is a viewer has to go through the complete video to understand the context, as opposed to an image where the viewer can extract information from a single frame. Apart from context understanding, it almost impossible to create a universal summarized video for everyone, as everyone has their own bias of keyframe, e.g; In a soccer game, a coach person might consider those frames which consist of information on player placement, techniques, etc; however, a person with less knowledge about a soccer game, will focus more on frames which consist of goals and score-board.

Technology

2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020)
[Shruti Jadon] [IEEE Member] [FD5][144]
Unsupervised video summarization framework using
keyframe extraction and video skimming

2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020)
[Shruti Jadon] [IEEE Member] [FD5][144]
Video Summarization
Video is one of the robust sources of information and the consumption of online and offline videos has
reached an unprecedented level in the last few years.
Apart from context understanding, it almost impossible to create a universal summarized video for
everyone, as everyone has their own bias of keyframe.
Example: In a soccer game, a coach person might consider those frames which consist of information on
player placement, techniques, etc; however, a person with less knowledge about a soccer game, will focus
more on frames which consist of goals and score-board.

2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020)
[Shruti Jadon] [IEEE Member] [FD5][144]
Our Approach & Contributions:
● We attempt to solve video summarization through unsupervised learning by employing traditional vision-
based algorithmic methodologies for accurate feature extraction from video frames.
● We have also proposed a deep learning-based feature extraction followed by multiple clustering methods
to find an effective way of summarizing a video by interesting keyframe extraction.
● We have compared the performance of these approaches on the SumMe dataset and showcased that using
deep learning-based feature extraction has been proven to perform better in case of dynamic viewpoint
videos.

2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020)
[Shruti Jadon] [IEEE Member] [FD5][144]
Our Approach mainly have 2 key parts:
1. Keyframe Extraction via analysing features/interest points from different video frames.
1. Video Skimming: Add +2 and -2 seconds of points around interesting frame.

2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020)
[Shruti Jadon] [IEEE Member] [FD5][144]

2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020)
[Shruti Jadon] [IEEE Member] [FD5][144]
Methods for KeyFrame Extraction
1. Uniform Sampling
1. Image Histogram
1. SIFT(Scale Invariant Feature Transform)
1. VSUMM
1. ResNet16 pre-trained on ImageNet.

2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020)
[Shruti Jadon] [IEEE Member] [FD5][144]
EXPERIMENTS AND RESULTS

2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020)
[Shruti Jadon] [IEEE Member] [FD5][144]
Dataset
SumMe dataset which was created to be used as a benchmark for video summarization. The dataset contains
25 videos with the length ranging from one to six minutes. Each of these videos are annotated by at least 15
humans with a total of 390 human summaries. The annotations were collected by crowd sourcing.
The length of all the human generated summaries are restricted to be within 15% of the original video.

2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020)
[Shruti Jadon] [IEEE Member] [FD5][144]
Evaluation Method
The SumMe dataset provides individual scores to each annotated frames. We evaluate our method by
measuring the F-score from the set of frames that have been selected by our method.

2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020)
[Shruti Jadon] [IEEE Member] [FD5][144]
Results

2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020)
[Shruti Jadon] [IEEE Member] [FD5][144]
Conclusion
We can conclude that
1. Gaussian Clustering along with Convolutional Networks can give better performance than other methods
with moving point camera videos.
2. Even Uniform Sampling is giving better result for videos which have stable camera view point and very
less motion.
We can conclude that one single algorithm can’t be solution of video summarization, it is dependent of the
type of video, the motion inside video.

2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020)
[Shruti Jadon] [IEEE Member] [FD5][144]
Thank you!!!
For further queries you can email me at sjadon@umass.edu

Recently uploaded

Quantum Leap in Next-Generation Computing

WSO2

Dubai, known for its towering skyscrapers, luxurious lifestyle, and relentless pursuit of innovation, often finds itself in the global spotlight. However, amidst the glitz and glamour, the emirate faces its own set of challenges, including the occasional threat of flooding. In recent years, Dubai has experienced sporadic but significant floods, disrupting normalcy and posing unique challenges to its infrastructure. Among the critical nodes in this bustling metropolis is the Dubai International Airport, a vital hub connecting the world. This article delves into the intersection of Dubai flood events and the resilience demonstrated by the Dubai International Airport in the face of such challenges.

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

Orbitshub

The presentation was made in “Web3 Fusion: Embracing AI and Beyond” is more than a conference; it's a journey into the heart of digital transformation. The conference a provided a platform where the future of technology meets practical application. This three-day hybrid event, set in the heart of innovation, served as a gateway to the latest trends and transformative discussions in AI, Blockchain, IoT, AR/VR, and their collective impact on the information space.

AI in Action: Real World Use Cases by Anitaraj

AnitaRaj43

Exploring Multimodal Embeddings with Milvus

Zilliz

Architecting Cloud Native Applications

WSO2

How to Check CNIC Information Online with Pakdata cf

danishmna97

Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows. We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases. This video focuses on the deployment of external web forms using Jotform for Bonterra Impact Management. This solution can be customized to your organization’s needs and deployed to support the common use cases below: - Intake and consent - Assessments - Surveys - Applications - Program registration Interested in deploying web form automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Jeffrey Haguewood

Key topics covered: - Real-world examples of Choreo's comprehensive coverage from application design and deployment, security, scaling, and monitoring - Running different types of workloads, such as web applications, APIs, microservices, integrations, and tasks at scale, and wire them together to deliver seamless omnichannel digital experiences - How Choreo improves the developer experience by eliminating repetition, silos, and redundancy through enhanced discoverability and self-serviceability

Choreo: Empowering the Future of Enterprise Software Engineering

WSO2

JohnPollard-hybrid-app-RailsConf2024.pptx

JohnPollard37

Tracing the root cause of a performance issue requires a lot of patience, experience, and focus. It’s so hard that we sometimes attempt to guess by trying out tentative fixes, but that usually results in frustration, messy code, and a considerable waste of time and money. This talk explains how to correctly zoom in on a performance bottleneck using three levels of profiling: distributed tracing, metrics, and method profiling. After we learn to read the JVM profiler output as a flame graph, we explore a series of bottlenecks typical for backend systems, like connection/thread pool starvation, invisible aspects, blocking code, hot CPU methods, lock contention, and Virtual Thread pinning, and we learn to trace them even if they occur in library code you are not familiar with. Attend this talk and prepare for the performance issues that will eventually hit any successful system. About authorWith two decades of experience, Victor is a Java Champion working as a trainer for top companies in Europe. Five thousands developers in 120 companies attended his workshops, so he gets to debate every week the challenges that various projects struggle with. In return, Victor summarizes key points from these workshops in conference talks and online meetups for the European Software Crafters, the world’s largest developer community around architecture, refactoring, and testing. Discover how Victor can help you on victorrentea.ro : company training catalog, consultancy and YouTube playlists.

Finding Java's Hidden Performance Traps @ DevoxxUK 2024

Victor Rentea

MINDCTI Revenue Release Quarter One 2024

MIND CTI

Webinar Recording: https://www.panagenda.com/webinars/why-teams-call-analytics-is-critical-to-your-entire-business Nothing is as frustrating and noticeable as being in an important call and being unable to see or hear the other person. Not surprising then, that issues with Teams calls are among the most common problems users call their helpdesk for. Having in depth insight into everything relevant going on at the user’s device, local network, ISP and Microsoft itself during the call is crucial for good Microsoft Teams Call quality support. To ensure a quick and adequate solution and to ensure your users get the most out of their Microsoft 365. But did you know that ‘bad calls’ are also an excellent indicator of other problems arising? Precisely because it is so noticeable!? Like the canary in the mine, bad calls can be early indicators of problems. Problems that might otherwise not have been noticed for a while but can have a big impact on productivity and satisfaction. Join this session by Christoph Adler to learn how true Microsoft Teams call quality analytics helped other organizations troubleshoot bad calls and identify and fix problems that impacted Teams calls or the use of Microsoft365 in general. See what it can do to keep your users happy and productive! In this session we will cover - Why CQD data alone is not enough to troubleshoot call problems - The importance of attributing call problems to the right call participant - What call quality analytics can do to help you quickly find, fix-, and prevent problems - Why having retrospective detailed insights matters - Real life examples of how others have used Microsoft Teams call quality monitoring to problem shoot problems with their ISP, network, device health and more.

Why Teams call analytics are critical to your entire business

panagenda

AWS Community Day CPH - Three problems of Terraform

Andrey Devyatkin

The microservices honeymoon is over. When starting a new project or revamping a legacy monolith, teams started looking for alternatives to microservices. The Modular Monolith, or 'Modulith', is an architecture that reaps the benefits of (vertical) functional decoupling without the high costs associated with separate deployments. This talk will delve into the advantages and challenges of this progressive architecture, beginning with exploring the concept of a 'module', its internal structure, public API, and inter-module communication patterns. Supported by spring-modulith, the talk provides practical guidance on addressing the main challenges of a Modultith Architecture: finding and guarding module boundaries, data decoupling, and integration module-testing. You should not miss this talk if you are a software architect or tech lead seeking practical, scalable solutions. About the author With two decades of experience, Victor is a Java Champion working as a trainer for top companies in Europe. Five thousands developers in 120 companies attended his workshops, so he gets to debate every week the challenges that various projects struggle with. In return, Victor summarizes key points from these workshops in conference talks and online meetups for the European Software Crafters, the world’s largest developer community around architecture, refactoring, and testing. Discover how Victor can help you on victorrentea.ro : company training catalog, consultancy and YouTube playlists.

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

Victor Rentea

Introduction to use of FHIR Documents in ABDM

Kumar Satyam

Dubai, often portrayed as a shimmering oasis in the desert, faces its own set of challenges, including the occasional threat of flooding. Despite its reputation for opulence and modernity, the emirate is not immune to the forces of nature. In recent years, Dubai has experienced sporadic but significant floods, testing the resilience of its infrastructure and communities. Among the critical lifelines in this bustling metropolis is the Dubai International Airport, a bustling hub that connects the city to the world. This article explores the intersection of Dubai flood events and the resilience demonstrated by the Dubai International Airport in the face of such challenges.

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...

Orbitshub

In this session, we will discuss the journey of API governance from its initial, ungoverned state to the development of sophisticated models that tackle contemporary challenges. We'll explore how APIs have become essential in the intersection of business and technology, adapting to advancements and evolving needs. We'll focus on how organizations have moved from launching to monetizing APIs, using models like pay-per-use and subscriptions, and finding the right balance between technical implementation and business strategy. We'll also highlight the impact of governance on monetization strategies, especially how data security, compliance, and service quality influence pricing. Real-world examples will demonstrate the effective integration of governance with monetization, including AI's role in dynamic pricing. Looking ahead, we'll share insights into future trends in API governance and monetization, emphasizing the importance of adaptability, continuous learning, and innovation.

API Governance and Monetization - The evolution of API governance

WSO2

Corporate and higher education. Two industries that, in the past, have had a clear divide with very little crossover. The difference in goals, learning styles and objectives paved the way for differing learning technologies platforms to evolve. Now, those stark lines are blurring as both sides are discovering they have content that’s relevant to the other. Join Tammy Rutherford as she walks through the pros and cons of corporate and higher ed collaborating. And the challenges of these different technology platforms working together for a brighter future.

Corporate and higher education May webinar.pptx

Rustici Software

The integration landscape is changing rapidly with the introduction of technologies like GraphQL, gRPC, stream processing, iPaaS, and platformless. However, not all existing applications and industries can keep up with these new technologies. Certain industries, like manufacturing, logistics, and finance, still rely on well-established EDI-based message formats. Some applications use XML or CSV with file-based communications, while others have strict on premises deployment requirements. This talk focuses on how Ballerina's built-in integration capabilities can bridge the gap between "old" and "new" technologies, modernizing enterprise applications without disrupting business operations.

Modernizing Legacy Systems Using Ballerina

WSO2

TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....

rightmanforbloodline

Recently uploaded (20)

Quantum Leap in Next-Generation Computing

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

AI in Action: Real World Use Cases by Anitaraj

Exploring Multimodal Embeddings with Milvus

Architecting Cloud Native Applications

How to Check CNIC Information Online with Pakdata cf

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Choreo: Empowering the Future of Enterprise Software Engineering

JohnPollard-hybrid-app-RailsConf2024.pptx

Finding Java's Hidden Performance Traps @ DevoxxUK 2024

MINDCTI Revenue Release Quarter One 2024

Why Teams call analytics are critical to your entire business

AWS Community Day CPH - Three problems of Terraform

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

Introduction to use of FHIR Documents in ABDM

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...

API Governance and Monetization - The evolution of API governance

Corporate and higher education May webinar.pptx

Modernizing Legacy Systems Using Ballerina

TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....

Unsupervised video summarization framework using keyframe extraction and video skimming

1. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144] Unsupervised video summarization framework using keyframe extraction and video skimming

2. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144] Video Summarization Video is one of the robust sources of information and the consumption of online and offline videos has reached an unprecedented level in the last few years. Apart from context understanding, it almost impossible to create a universal summarized video for everyone, as everyone has their own bias of keyframe. Example: In a soccer game, a coach person might consider those frames which consist of information on player placement, techniques, etc; however, a person with less knowledge about a soccer game, will focus more on frames which consist of goals and score-board.

3. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144] Video Summarization problems A Summarized video must have the following properties: • It must contain the high priority entities and events from the video. • The summary should be free of repetition and redundancy. Failure to exclude these components might lead to misinterpretation of the video from its summarized version. A summary of video also varies from person to person, so a supervised video summarization can be seen as personalized recommendation approach, but it requires extensive amount of data labeling by a person and yet the trained model isn’t a generalized model among mass.

4. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144] Our Approach & Contributions: ● We attempt to solve video summarization through unsupervised learning by employing traditional vision- based algorithmic methodologies for accurate feature extraction from video frames. ● We have also proposed a deep learning-based feature extraction followed by multiple clustering methods to find an effective way of summarizing a video by interesting keyframe extraction. ● We have compared the performance of these approaches on the SumMe dataset and showcased that using deep learning-based feature extraction has been proven to perform better in case of dynamic viewpoint videos.

5. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144] Our Approach mainly have 2 key parts: 1. Keyframe Extraction via analysing features/interest points from different video frames. 1. Video Skimming: Add +2 and -2 seconds of points around interesting frame.

6. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144]

7. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144] Methods for KeyFrame Extraction 1. Uniform Sampling 1. Image Histogram 1. SIFT(Scale Invariant Feature Transform) 1. VSUMM 1. ResNet16 pre-trained on ImageNet.

8. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144] EXPERIMENTS AND RESULTS

9. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144] Dataset SumMe dataset which was created to be used as a benchmark for video summarization. The dataset contains 25 videos with the length ranging from one to six minutes. Each of these videos are annotated by at least 15 humans with a total of 390 human summaries. The annotations were collected by crowd sourcing. The length of all the human generated summaries are restricted to be within 15% of the original video.

10. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144] Evaluation Method The SumMe dataset provides individual scores to each annotated frames. We evaluate our method by measuring the F-score from the set of frames that have been selected by our method.

11. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144] Results

12. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144] Conclusion We can conclude that 1. Gaussian Clustering along with Convolutional Networks can give better performance than other methods with moving point camera videos. 2. Even Uniform Sampling is giving better result for videos which have stable camera view point and very less motion. We can conclude that one single algorithm can’t be solution of video summarization, it is dependent of the type of video, the motion inside video.

13. 2020 5th IEEE International Conference on Computing, Communication and Automation (ICCCA 2020) [Shruti Jadon] [IEEE Member] [FD5][144] Thank you!!! For further queries you can email me at sjadon@umass.edu

Unsupervised video summarization framework using keyframe extraction and video skimming

Recommended

Recommended

More Related Content

Similar to Unsupervised video summarization framework using keyframe extraction and video skimming

Similar to Unsupervised video summarization framework using keyframe extraction and video skimming (20)

Recently uploaded

Recently uploaded (20)

Unsupervised video summarization framework using keyframe extraction and video skimming