SlideShare a Scribd company logo
1 of 23
SEPTEMBER 26th, 2017
Neeraj Baghel
M.Tech, 178150005
Under the Supervision of
Prof. Charul Bhatnagar
Professor, Deptt. of CEA
GLA University, Mathura
1/20
FIRST PROGRESS PRESENTATION
ON
VIDEO SUMMARIZATION
OUTLINE
 Video Summarization
 Types of Video Summarization
 Applications
 Issues & Challenges
 Tools & Datasets
 Journals & Conferences
 Researchers & Groups
 References
2/20
Video
• Video data is a great asset
for information extraction
and knowledge discovery.
• Due to its size an variability,
it is extremely hard for
users to monitor.[4]
Video Summarization
• Intelligent video
summarization algorithms
allow us to quickly browse a
lengthy video by capturing
the essence and removing
redundant information.[4]
3/20
Video Summarization
[4] Sharghi, Aidean, "Query-focused video summarization: Dataset, evaluation, and a memory network based
approach." The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017.
[9] https://www.slideshare.net/MikolajLeszczuk/results-on-video-summarization (D.L.V 01/09/18)
Fig 1: Video Summarization Work Flow [9]
Video can be summarized by two different ways which are as
follows.
4/20
Types of Video summarization
Fig 2: Video Summarization Technique Classification [7]
[7] Mundur, Padmavathi, Yong Rao, and Yelena Yesha. "Keyframe-based video summarization using Delaunay
clustering." International Journal on Digital Libraries 6.2 (2006): 219-232. (D.L.V 20/08/18)
Key Frame Extraction
Fig 3: Key Frame Extraction [8]
5/20
[8] Souza, Celso L. de, et al. "A unified approach to content-based indexing and retrieval of digital videos from
television archives." (2014). (D.L.V 05/09/18)
Video Skims
• This is also called a moving-image abstract, moving story
board, or summary sequence.
• The original video is segmented into various parts which is a
video clip with shorter duration.
6/20
[11] https://www.cs.cmu.edu/~msmith/skim_homepage.html
Fig 4: Automated Video Skimming Informedia Digital Video Library Project [11]
Applications
The application of video summarization can be divided into three
main categories:
1) Consumer Video Applications
 Browsing the recorded content
 View the interesting parts quickly
7/20
Fig 4: View The Interesting Parts Quickly [12]
[12] https://www.youtube.com/watch?v=OHAWwaYu2H0&t=46s (D.L.V 20/09/18)
Cont…
2) Image-Video Databases Management
 Video search engine
 Digital video library
 Object indexing and retrieval
 Automatic object labeling
8/20
Fig 5: Digital video library [13]
[13] https://www.searchenginejournal.com/deep-learning-powers-video-seo/175145/ (D.L.V 21/09/18)
Cont…
2) Surveillance
 Outdoor Perimeter Security
 Internet Security Systems
 Parking Lots
 Traffic Monitoring
Fig 6 :Traffic Monitoring[14]
Fig 7:Outdoor Perimeter Security[14]
9/20
[14] https://www.framos.com/en/solutions/mobility/ (D.L.V 21/09/18)
Issues and Challenges
Some general issues and Challengesrelated to video
summarization:
 Loss of information
 Computationally expensive
 Evaluate the performance of a video summarizer
 No single video summarizer fits all users
10/20
Tools
 Matlab
Matlab is a commercial product that is pretty widely-used in the image
/video processing community. It also has an adequate image processing
`toolbox,' and toolboxes for things like Kalman filters, neural networks,
genetic algorithms, and so on. It runs on most Unices, including Linux, and
on Windows 95/NT. For people who are researching into vision algorithms,
the lack of source code is a killer.
 OpenCV
is a library of programming functions mainly aimed at real-time computer
vision. Originally developed by Intel. The library is cross-platform and free
for use under the open-source BSD license
11/20
Datasets
 UT Egocnetric (UTE)
The dataset contains 4 videos from head-mounted cameras, each about 3-
5 hours long. (Size: 1.4Gb)
 SumMe
The dataset consists of 25 videos which are single-shot and range in length
from 1-6 minutes. The dataset contains summaries created by 15 to 18
users with the constraint in length being that the summaries should be 5%
to 15% of the original video. (Size: 2.2 GB)
12/20
Datasets Cont…
Dataset
 YouTube-8M
YouTube-8M is a large-scale labeled video dataset that consists of millions of
YouTube video IDs and associated labels from a diverse vocabulary of 4700+
visual entities
• Each video must be public and have at least 1000 views
• Each video must be between 120 and 500 seconds long
• Each video must be associated with at least one entity from our target
vocabulary
• Adult & sensitive content is removed (as determined by automated classifiers)
May 2018 version (current): 6.1M videos, 3862 classes, 3.0 labels/video, 2.6B
audio-visual features
13/20
Datasets Cont…
Dataset
 MED Summaries
The "MED Summaries" is a dataset for evaluation of dynamic video
summaries. It contains annotations of 160 videos: a validation set of 60
videos and a test set of 100 videos. There are 10 event categories in the
test set. The current available dataset is from 235 users, all images are in
bitmap(*.bmp)format. The resolution of these images is 800 * 600 pixels.
(size:12Gb).
14/20
Journals
 IEEE Transactions on Pattern Analysis and Machine Intelligence
 IEEE Transactions on Image Processing
 SPINGER-IPSJ Transactions on Computer Vision and
Applications (CVA)
 ELSEVIER- Computer Vision and Image Understanding
 ELSEVIER-Pattern Recognition
 IJCV - International Journal of Computer Vision
 IJIPA- International Journal of Image Processing and Applications
 IET- The Institution of Engineering and Technology
15/20
Conferences
 IEEE/CVF Conference on Computer Vision and Pattern Recognition
(CVPR)
 IEEE International Conference on Image Processing (ICIP)
 IEEE/CVF International Conference on Computer Vision (ICCV)
 IEEE Winter Conference on Applications of Computer Vision (WACV)
 ACCV - Asian Conference on Computer Vision
 ECCV - European Conference on Computer Vision
 CVIP- International Conference on Computer Vision and Image
Processing , India
 NCVPRIPG -National Conference on Computer Vision, Pattern
Recognition, Image Processing and Graphics , India
16/20
Research Group
17/20
Fei-Fei Li
Professor Director, Stanford AI Lab
Computer Science Department
Feifeili@cs.stanford.edu
Stanford Computer Vision Lab
Animesh Garg
Professor ,Stanford AI Lab
Computer Science Department
garg@cs.standford.edu
Research Group
18/20
Aidean Sharghi
Center for Research in Computer Vision,
University of Central Florida
aidean.sharghi@gmail.com
Boqing Gong
Assistant Professor
Center for Research in Computer Vision
Department of Computer Science
University of Central Florida
boqingGo@outlook.com
Center for Research in Computer Vision,
University of Central Florida
Research Group
19/20
Abhishek Sarkar
Senior Research Scientist
International Institute of Information Technology
Hyderabad, INDIA
Abhishek.sarkar@iiit.ac.in
Dr. C. V. Jawahar
Researcher,
International Institute of Information Technology
Hyderabad, INDIA
jawahar@iiit.ac.in
International Institute of Information Technology
References
[1] Song, Yale, et al. "Tvsum: Summarizing web videos using
titles." Proceedings of the IEEE conference on computer vision and pattern
recognition. 2015.
[2] Zhuang, Yueting, Ruogui Xiao, and Fei Wu. "Key issues in video summarization and
its application." Information, Communications and Signal Processing, 2003 and
Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint
Conference of the Fourth International Conference on. Vol. 1. IEEE, 2003.
[3] Kansagara, Ravi, Darshak Thakore, and Mahasweta Joshi. "A study on video
summarization tech-niques." International journal of innovative research in
computer and communication engi-neering 2 (2014).
[4] Sharghi, Aidean, Jacob S. Laurel, and Boqing Gong. "Query-focused video
summarization: Dataset, evaluation, and a memory network based
approach." The IEEE Conference on Computer Vision and Pattern Recognition (
(CVPR). 2017.
[5] Ramesh, Animesh, et al. "Video Summarization: An Overview of Techniques.“
20/20
References
[6] Sabbar, W.; Chergui, A.; Bekkhoucha, A., "Video summarization using shot
segmentation and local motion estimation," InnovativeComputing Technology
(INTECH), 2012 Second International Conference on, vol., no., pp.190, 193, 18-20
Sept. 2012
[7] Mundur, Padmavathi, Yong Rao, and Yelena Yesha. "Keyframe-based video
summarization using Delaunay clustering." International Journal on Digital
Libraries 6.2 (2006): 219-232.
[8] Souza, Celso L. de, et al. "A unified approach to content-based indexing and
retrieval of digital videos from television archives." (2014).
[9] https://www.slideshare.net/MikolajLeszczuk/results-on-video-summarization
[10] Landy, Michael S., Yoav Cohen, and George Sperling. "HIPS: A Unix-based image
processing system." Computer Vision, Graphics, and Image Processing 25.3
(1984): 331-347.
21/20
References
[11] https://www.cs.cmu.edu/~msmith/skim_homepage.html
[12] https://www.youtube.com/watch?v=OHAWwaYu2H0&t=46s
[13] https://www.searchenginejournal.com/deep-learning-powers-video-
seo/175145/
[14] https://www.framos.com/en/solutions/mobility/
22/20
23

More Related Content

What's hot

4.3 multimedia datamining
4.3 multimedia datamining4.3 multimedia datamining
4.3 multimedia dataminingKrish_ver2
 
Ensemble learning
Ensemble learningEnsemble learning
Ensemble learningHaris Jamil
 
A Study on Credit Card Fraud Detection using Machine Learning
A Study on Credit Card Fraud Detection using Machine LearningA Study on Credit Card Fraud Detection using Machine Learning
A Study on Credit Card Fraud Detection using Machine Learningijtsrd
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Dev Sahu
 
Mtech Fourth progress presentation
Mtech Fourth progress presentationMtech Fourth progress presentation
Mtech Fourth progress presentationNEERAJ BAGHEL
 
Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence Shrutika Oswal
 
Graphical Password Authentication using Cued click point technique with zero ...
Graphical Password Authentication using Cued click point technique with zero ...Graphical Password Authentication using Cued click point technique with zero ...
Graphical Password Authentication using Cued click point technique with zero ...NurrulHafizza
 
Association Analysis in Data Mining
Association Analysis in Data MiningAssociation Analysis in Data Mining
Association Analysis in Data MiningKamal Acharya
 
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...Simplilearn
 
Weapon detection using artificial intelligence and deep learning for security...
Weapon detection using artificial intelligence and deep learning for security...Weapon detection using artificial intelligence and deep learning for security...
Weapon detection using artificial intelligence and deep learning for security...Venkat Projects
 
Movie lens movie recommendation system
Movie lens movie recommendation systemMovie lens movie recommendation system
Movie lens movie recommendation systemGaurav Sawant
 
Emotion detection using cnn.pptx
Emotion detection using cnn.pptxEmotion detection using cnn.pptx
Emotion detection using cnn.pptxRADO7900
 
Active Learning in Collaborative Filtering Recommender Systems : a Survey
Active Learning in Collaborative Filtering Recommender Systems : a SurveyActive Learning in Collaborative Filtering Recommender Systems : a Survey
Active Learning in Collaborative Filtering Recommender Systems : a SurveyUniversity of Bergen
 

What's hot (20)

Bayesian network
Bayesian networkBayesian network
Bayesian network
 
Unit 4
Unit 4Unit 4
Unit 4
 
KNN
KNNKNN
KNN
 
4.3 multimedia datamining
4.3 multimedia datamining4.3 multimedia datamining
4.3 multimedia datamining
 
Ensemble learning
Ensemble learningEnsemble learning
Ensemble learning
 
A Study on Credit Card Fraud Detection using Machine Learning
A Study on Credit Card Fraud Detection using Machine LearningA Study on Credit Card Fraud Detection using Machine Learning
A Study on Credit Card Fraud Detection using Machine Learning
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier
 
Entity2rec recsys
Entity2rec recsysEntity2rec recsys
Entity2rec recsys
 
Mtech Fourth progress presentation
Mtech Fourth progress presentationMtech Fourth progress presentation
Mtech Fourth progress presentation
 
Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence
 
Graphical Password Authentication using Cued click point technique with zero ...
Graphical Password Authentication using Cued click point technique with zero ...Graphical Password Authentication using Cued click point technique with zero ...
Graphical Password Authentication using Cued click point technique with zero ...
 
Association Analysis in Data Mining
Association Analysis in Data MiningAssociation Analysis in Data Mining
Association Analysis in Data Mining
 
SPADE -
SPADE - SPADE -
SPADE -
 
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
 
Weapon detection using artificial intelligence and deep learning for security...
Weapon detection using artificial intelligence and deep learning for security...Weapon detection using artificial intelligence and deep learning for security...
Weapon detection using artificial intelligence and deep learning for security...
 
Movie lens movie recommendation system
Movie lens movie recommendation systemMovie lens movie recommendation system
Movie lens movie recommendation system
 
Emotion detection using cnn.pptx
Emotion detection using cnn.pptxEmotion detection using cnn.pptx
Emotion detection using cnn.pptx
 
A Comparison of Block-Matching Motion Estimation Algorithms
A Comparison of Block-Matching Motion Estimation AlgorithmsA Comparison of Block-Matching Motion Estimation Algorithms
A Comparison of Block-Matching Motion Estimation Algorithms
 
Active Learning in Collaborative Filtering Recommender Systems : a Survey
Active Learning in Collaborative Filtering Recommender Systems : a SurveyActive Learning in Collaborative Filtering Recommender Systems : a Survey
Active Learning in Collaborative Filtering Recommender Systems : a Survey
 
MetaCDN
MetaCDNMetaCDN
MetaCDN
 

Similar to Mtech First progress PRESENTATION ON VIDEO SUMMARIZATION

M.tech Third progress Presentation
M.tech Third progress PresentationM.tech Third progress Presentation
M.tech Third progress PresentationNEERAJ BAGHEL
 
ROAD POTHOLE DETECTION USING YOLOV4 DARKNET
ROAD POTHOLE DETECTION USING YOLOV4 DARKNETROAD POTHOLE DETECTION USING YOLOV4 DARKNET
ROAD POTHOLE DETECTION USING YOLOV4 DARKNETIRJET Journal
 
IRJET - NETRA: Android Application for Visually Challenged People to Dete...
IRJET -  	  NETRA: Android Application for Visually Challenged People to Dete...IRJET -  	  NETRA: Android Application for Visually Challenged People to Dete...
IRJET - NETRA: Android Application for Visually Challenged People to Dete...IRJET Journal
 
Motion capture for Animation
Motion capture for AnimationMotion capture for Animation
Motion capture for AnimationIRJET Journal
 
IRJET - Applications of Image and Video Deduplication: A Survey
IRJET -  	  Applications of Image and Video Deduplication: A SurveyIRJET -  	  Applications of Image and Video Deduplication: A Survey
IRJET - Applications of Image and Video Deduplication: A SurveyIRJET Journal
 
Voice Enable Blind Assistance System -Real time Object Detection
Voice Enable Blind Assistance System -Real time Object DetectionVoice Enable Blind Assistance System -Real time Object Detection
Voice Enable Blind Assistance System -Real time Object DetectionIRJET Journal
 
User centric machine learning for cyber security operation center
User centric machine learning for cyber security operation centerUser centric machine learning for cyber security operation center
User centric machine learning for cyber security operation centerSai Chandra Chittuluri
 
Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization ijcga
 
Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization ijcga
 
IRJET- Object Detection and Recognition for Blind Assistance
IRJET- Object Detection and Recognition for Blind AssistanceIRJET- Object Detection and Recognition for Blind Assistance
IRJET- Object Detection and Recognition for Blind AssistanceIRJET Journal
 
Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...
Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...
Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...Amélie Gyrard
 
Video Liveness Verification
Video Liveness VerificationVideo Liveness Verification
Video Liveness Verificationijtsrd
 
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...IRJET Journal
 
Real Time Moving Object Detection for Day-Night Surveillance using AI
Real Time Moving Object Detection for Day-Night Surveillance using AIReal Time Moving Object Detection for Day-Night Surveillance using AI
Real Time Moving Object Detection for Day-Night Surveillance using AIIRJET Journal
 
Unsupervised video summarization framework using keyframe extraction and vide...
Unsupervised video summarization framework using keyframe extraction and vide...Unsupervised video summarization framework using keyframe extraction and vide...
Unsupervised video summarization framework using keyframe extraction and vide...Shruti Jadon
 
Image processing research proposal
Image processing research proposalImage processing research proposal
Image processing research proposalIftikhar Ahmad
 
Multimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to ContentMultimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to ContentBenoit HUET
 
SUMMARY GENERATION FOR LECTURING VIDEOS
SUMMARY GENERATION FOR LECTURING VIDEOSSUMMARY GENERATION FOR LECTURING VIDEOS
SUMMARY GENERATION FOR LECTURING VIDEOSIRJET Journal
 
Precaution for Covid-19 based on Mask detection and sensor
Precaution for Covid-19 based on Mask detection and sensorPrecaution for Covid-19 based on Mask detection and sensor
Precaution for Covid-19 based on Mask detection and sensorIRJET Journal
 
Real Time Head Generation for Video Conferencing
Real Time Head Generation for Video ConferencingReal Time Head Generation for Video Conferencing
Real Time Head Generation for Video ConferencingIRJET Journal
 

Similar to Mtech First progress PRESENTATION ON VIDEO SUMMARIZATION (20)

M.tech Third progress Presentation
M.tech Third progress PresentationM.tech Third progress Presentation
M.tech Third progress Presentation
 
ROAD POTHOLE DETECTION USING YOLOV4 DARKNET
ROAD POTHOLE DETECTION USING YOLOV4 DARKNETROAD POTHOLE DETECTION USING YOLOV4 DARKNET
ROAD POTHOLE DETECTION USING YOLOV4 DARKNET
 
IRJET - NETRA: Android Application for Visually Challenged People to Dete...
IRJET -  	  NETRA: Android Application for Visually Challenged People to Dete...IRJET -  	  NETRA: Android Application for Visually Challenged People to Dete...
IRJET - NETRA: Android Application for Visually Challenged People to Dete...
 
Motion capture for Animation
Motion capture for AnimationMotion capture for Animation
Motion capture for Animation
 
IRJET - Applications of Image and Video Deduplication: A Survey
IRJET -  	  Applications of Image and Video Deduplication: A SurveyIRJET -  	  Applications of Image and Video Deduplication: A Survey
IRJET - Applications of Image and Video Deduplication: A Survey
 
Voice Enable Blind Assistance System -Real time Object Detection
Voice Enable Blind Assistance System -Real time Object DetectionVoice Enable Blind Assistance System -Real time Object Detection
Voice Enable Blind Assistance System -Real time Object Detection
 
User centric machine learning for cyber security operation center
User centric machine learning for cyber security operation centerUser centric machine learning for cyber security operation center
User centric machine learning for cyber security operation center
 
Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization
 
Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization
 
IRJET- Object Detection and Recognition for Blind Assistance
IRJET- Object Detection and Recognition for Blind AssistanceIRJET- Object Detection and Recognition for Blind Assistance
IRJET- Object Detection and Recognition for Blind Assistance
 
Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...
Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...
Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...
 
Video Liveness Verification
Video Liveness VerificationVideo Liveness Verification
Video Liveness Verification
 
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
 
Real Time Moving Object Detection for Day-Night Surveillance using AI
Real Time Moving Object Detection for Day-Night Surveillance using AIReal Time Moving Object Detection for Day-Night Surveillance using AI
Real Time Moving Object Detection for Day-Night Surveillance using AI
 
Unsupervised video summarization framework using keyframe extraction and vide...
Unsupervised video summarization framework using keyframe extraction and vide...Unsupervised video summarization framework using keyframe extraction and vide...
Unsupervised video summarization framework using keyframe extraction and vide...
 
Image processing research proposal
Image processing research proposalImage processing research proposal
Image processing research proposal
 
Multimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to ContentMultimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to Content
 
SUMMARY GENERATION FOR LECTURING VIDEOS
SUMMARY GENERATION FOR LECTURING VIDEOSSUMMARY GENERATION FOR LECTURING VIDEOS
SUMMARY GENERATION FOR LECTURING VIDEOS
 
Precaution for Covid-19 based on Mask detection and sensor
Precaution for Covid-19 based on Mask detection and sensorPrecaution for Covid-19 based on Mask detection and sensor
Precaution for Covid-19 based on Mask detection and sensor
 
Real Time Head Generation for Video Conferencing
Real Time Head Generation for Video ConferencingReal Time Head Generation for Video Conferencing
Real Time Head Generation for Video Conferencing
 

More from NEERAJ BAGHEL

Generating super resolution images using transformers
Generating super resolution images using transformersGenerating super resolution images using transformers
Generating super resolution images using transformersNEERAJ BAGHEL
 
Hierarchical structure adaptive
Hierarchical structure adaptiveHierarchical structure adaptive
Hierarchical structure adaptiveNEERAJ BAGHEL
 
Unsupervised object-level video summarization with online motion auto-encoder
Unsupervised object-level video summarization with online motion auto-encoderUnsupervised object-level video summarization with online motion auto-encoder
Unsupervised object-level video summarization with online motion auto-encoderNEERAJ BAGHEL
 
Host rank:Exploiting the Hierarchical Structure for Link Analysis
Host rank:Exploiting the Hierarchical Structure for Link AnalysisHost rank:Exploiting the Hierarchical Structure for Link Analysis
Host rank:Exploiting the Hierarchical Structure for Link AnalysisNEERAJ BAGHEL
 
TVSum: Summarizing Web Videos Using Titles
TVSum: Summarizing Web Videos Using TitlesTVSum: Summarizing Web Videos Using Titles
TVSum: Summarizing Web Videos Using TitlesNEERAJ BAGHEL
 
Query focused video summarization
Query focused video summarizationQuery focused video summarization
Query focused video summarizationNEERAJ BAGHEL
 
Traffic behavior of local area network based on
Traffic behavior of local area network based onTraffic behavior of local area network based on
Traffic behavior of local area network based onNEERAJ BAGHEL
 
A Framework For Dynamic Hand Gesture Recognition Using Key Frames Extraction
A Framework For Dynamic Hand Gesture Recognition Using Key Frames ExtractionA Framework For Dynamic Hand Gesture Recognition Using Key Frames Extraction
A Framework For Dynamic Hand Gesture Recognition Using Key Frames ExtractionNEERAJ BAGHEL
 
Fingerprint recognition
Fingerprint recognitionFingerprint recognition
Fingerprint recognitionNEERAJ BAGHEL
 
SMOWSER (A VOICE BASED BROWSER)
SMOWSER (A VOICE BASED BROWSER)SMOWSER (A VOICE BASED BROWSER)
SMOWSER (A VOICE BASED BROWSER)NEERAJ BAGHEL
 

More from NEERAJ BAGHEL (13)

Generating super resolution images using transformers
Generating super resolution images using transformersGenerating super resolution images using transformers
Generating super resolution images using transformers
 
Latex intro
Latex introLatex intro
Latex intro
 
Hierarchical structure adaptive
Hierarchical structure adaptiveHierarchical structure adaptive
Hierarchical structure adaptive
 
Unsupervised object-level video summarization with online motion auto-encoder
Unsupervised object-level video summarization with online motion auto-encoderUnsupervised object-level video summarization with online motion auto-encoder
Unsupervised object-level video summarization with online motion auto-encoder
 
Host rank:Exploiting the Hierarchical Structure for Link Analysis
Host rank:Exploiting the Hierarchical Structure for Link AnalysisHost rank:Exploiting the Hierarchical Structure for Link Analysis
Host rank:Exploiting the Hierarchical Structure for Link Analysis
 
TVSum: Summarizing Web Videos Using Titles
TVSum: Summarizing Web Videos Using TitlesTVSum: Summarizing Web Videos Using Titles
TVSum: Summarizing Web Videos Using Titles
 
Query focused video summarization
Query focused video summarizationQuery focused video summarization
Query focused video summarization
 
Traffic behavior of local area network based on
Traffic behavior of local area network based onTraffic behavior of local area network based on
Traffic behavior of local area network based on
 
A Framework For Dynamic Hand Gesture Recognition Using Key Frames Extraction
A Framework For Dynamic Hand Gesture Recognition Using Key Frames ExtractionA Framework For Dynamic Hand Gesture Recognition Using Key Frames Extraction
A Framework For Dynamic Hand Gesture Recognition Using Key Frames Extraction
 
Fingerprint recognition
Fingerprint recognitionFingerprint recognition
Fingerprint recognition
 
Disk scheduling
Disk schedulingDisk scheduling
Disk scheduling
 
SMOWSER (A VOICE BASED BROWSER)
SMOWSER (A VOICE BASED BROWSER)SMOWSER (A VOICE BASED BROWSER)
SMOWSER (A VOICE BASED BROWSER)
 
Itvv project ppt
Itvv project pptItvv project ppt
Itvv project ppt
 

Recently uploaded

Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxJuliansyahHarahap1
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptMsecMca
 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf203318pmpc
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayEpec Engineered Technologies
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...tanu pandey
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringmulugeta48
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaOmar Fathy
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.Kamal Acharya
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projectssmsksolar
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 

Recently uploaded (20)

Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 

Mtech First progress PRESENTATION ON VIDEO SUMMARIZATION

  • 1. SEPTEMBER 26th, 2017 Neeraj Baghel M.Tech, 178150005 Under the Supervision of Prof. Charul Bhatnagar Professor, Deptt. of CEA GLA University, Mathura 1/20 FIRST PROGRESS PRESENTATION ON VIDEO SUMMARIZATION
  • 2. OUTLINE  Video Summarization  Types of Video Summarization  Applications  Issues & Challenges  Tools & Datasets  Journals & Conferences  Researchers & Groups  References 2/20
  • 3. Video • Video data is a great asset for information extraction and knowledge discovery. • Due to its size an variability, it is extremely hard for users to monitor.[4] Video Summarization • Intelligent video summarization algorithms allow us to quickly browse a lengthy video by capturing the essence and removing redundant information.[4] 3/20 Video Summarization [4] Sharghi, Aidean, "Query-focused video summarization: Dataset, evaluation, and a memory network based approach." The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017. [9] https://www.slideshare.net/MikolajLeszczuk/results-on-video-summarization (D.L.V 01/09/18) Fig 1: Video Summarization Work Flow [9]
  • 4. Video can be summarized by two different ways which are as follows. 4/20 Types of Video summarization Fig 2: Video Summarization Technique Classification [7] [7] Mundur, Padmavathi, Yong Rao, and Yelena Yesha. "Keyframe-based video summarization using Delaunay clustering." International Journal on Digital Libraries 6.2 (2006): 219-232. (D.L.V 20/08/18)
  • 5. Key Frame Extraction Fig 3: Key Frame Extraction [8] 5/20 [8] Souza, Celso L. de, et al. "A unified approach to content-based indexing and retrieval of digital videos from television archives." (2014). (D.L.V 05/09/18)
  • 6. Video Skims • This is also called a moving-image abstract, moving story board, or summary sequence. • The original video is segmented into various parts which is a video clip with shorter duration. 6/20 [11] https://www.cs.cmu.edu/~msmith/skim_homepage.html Fig 4: Automated Video Skimming Informedia Digital Video Library Project [11]
  • 7. Applications The application of video summarization can be divided into three main categories: 1) Consumer Video Applications  Browsing the recorded content  View the interesting parts quickly 7/20 Fig 4: View The Interesting Parts Quickly [12] [12] https://www.youtube.com/watch?v=OHAWwaYu2H0&t=46s (D.L.V 20/09/18)
  • 8. Cont… 2) Image-Video Databases Management  Video search engine  Digital video library  Object indexing and retrieval  Automatic object labeling 8/20 Fig 5: Digital video library [13] [13] https://www.searchenginejournal.com/deep-learning-powers-video-seo/175145/ (D.L.V 21/09/18)
  • 9. Cont… 2) Surveillance  Outdoor Perimeter Security  Internet Security Systems  Parking Lots  Traffic Monitoring Fig 6 :Traffic Monitoring[14] Fig 7:Outdoor Perimeter Security[14] 9/20 [14] https://www.framos.com/en/solutions/mobility/ (D.L.V 21/09/18)
  • 10. Issues and Challenges Some general issues and Challengesrelated to video summarization:  Loss of information  Computationally expensive  Evaluate the performance of a video summarizer  No single video summarizer fits all users 10/20
  • 11. Tools  Matlab Matlab is a commercial product that is pretty widely-used in the image /video processing community. It also has an adequate image processing `toolbox,' and toolboxes for things like Kalman filters, neural networks, genetic algorithms, and so on. It runs on most Unices, including Linux, and on Windows 95/NT. For people who are researching into vision algorithms, the lack of source code is a killer.  OpenCV is a library of programming functions mainly aimed at real-time computer vision. Originally developed by Intel. The library is cross-platform and free for use under the open-source BSD license 11/20
  • 12. Datasets  UT Egocnetric (UTE) The dataset contains 4 videos from head-mounted cameras, each about 3- 5 hours long. (Size: 1.4Gb)  SumMe The dataset consists of 25 videos which are single-shot and range in length from 1-6 minutes. The dataset contains summaries created by 15 to 18 users with the constraint in length being that the summaries should be 5% to 15% of the original video. (Size: 2.2 GB) 12/20
  • 13. Datasets Cont… Dataset  YouTube-8M YouTube-8M is a large-scale labeled video dataset that consists of millions of YouTube video IDs and associated labels from a diverse vocabulary of 4700+ visual entities • Each video must be public and have at least 1000 views • Each video must be between 120 and 500 seconds long • Each video must be associated with at least one entity from our target vocabulary • Adult & sensitive content is removed (as determined by automated classifiers) May 2018 version (current): 6.1M videos, 3862 classes, 3.0 labels/video, 2.6B audio-visual features 13/20
  • 14. Datasets Cont… Dataset  MED Summaries The "MED Summaries" is a dataset for evaluation of dynamic video summaries. It contains annotations of 160 videos: a validation set of 60 videos and a test set of 100 videos. There are 10 event categories in the test set. The current available dataset is from 235 users, all images are in bitmap(*.bmp)format. The resolution of these images is 800 * 600 pixels. (size:12Gb). 14/20
  • 15. Journals  IEEE Transactions on Pattern Analysis and Machine Intelligence  IEEE Transactions on Image Processing  SPINGER-IPSJ Transactions on Computer Vision and Applications (CVA)  ELSEVIER- Computer Vision and Image Understanding  ELSEVIER-Pattern Recognition  IJCV - International Journal of Computer Vision  IJIPA- International Journal of Image Processing and Applications  IET- The Institution of Engineering and Technology 15/20
  • 16. Conferences  IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)  IEEE International Conference on Image Processing (ICIP)  IEEE/CVF International Conference on Computer Vision (ICCV)  IEEE Winter Conference on Applications of Computer Vision (WACV)  ACCV - Asian Conference on Computer Vision  ECCV - European Conference on Computer Vision  CVIP- International Conference on Computer Vision and Image Processing , India  NCVPRIPG -National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics , India 16/20
  • 17. Research Group 17/20 Fei-Fei Li Professor Director, Stanford AI Lab Computer Science Department Feifeili@cs.stanford.edu Stanford Computer Vision Lab Animesh Garg Professor ,Stanford AI Lab Computer Science Department garg@cs.standford.edu
  • 18. Research Group 18/20 Aidean Sharghi Center for Research in Computer Vision, University of Central Florida aidean.sharghi@gmail.com Boqing Gong Assistant Professor Center for Research in Computer Vision Department of Computer Science University of Central Florida boqingGo@outlook.com Center for Research in Computer Vision, University of Central Florida
  • 19. Research Group 19/20 Abhishek Sarkar Senior Research Scientist International Institute of Information Technology Hyderabad, INDIA Abhishek.sarkar@iiit.ac.in Dr. C. V. Jawahar Researcher, International Institute of Information Technology Hyderabad, INDIA jawahar@iiit.ac.in International Institute of Information Technology
  • 20. References [1] Song, Yale, et al. "Tvsum: Summarizing web videos using titles." Proceedings of the IEEE conference on computer vision and pattern recognition. 2015. [2] Zhuang, Yueting, Ruogui Xiao, and Fei Wu. "Key issues in video summarization and its application." Information, Communications and Signal Processing, 2003 and Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint Conference of the Fourth International Conference on. Vol. 1. IEEE, 2003. [3] Kansagara, Ravi, Darshak Thakore, and Mahasweta Joshi. "A study on video summarization tech-niques." International journal of innovative research in computer and communication engi-neering 2 (2014). [4] Sharghi, Aidean, Jacob S. Laurel, and Boqing Gong. "Query-focused video summarization: Dataset, evaluation, and a memory network based approach." The IEEE Conference on Computer Vision and Pattern Recognition ( (CVPR). 2017. [5] Ramesh, Animesh, et al. "Video Summarization: An Overview of Techniques.“ 20/20
  • 21. References [6] Sabbar, W.; Chergui, A.; Bekkhoucha, A., "Video summarization using shot segmentation and local motion estimation," InnovativeComputing Technology (INTECH), 2012 Second International Conference on, vol., no., pp.190, 193, 18-20 Sept. 2012 [7] Mundur, Padmavathi, Yong Rao, and Yelena Yesha. "Keyframe-based video summarization using Delaunay clustering." International Journal on Digital Libraries 6.2 (2006): 219-232. [8] Souza, Celso L. de, et al. "A unified approach to content-based indexing and retrieval of digital videos from television archives." (2014). [9] https://www.slideshare.net/MikolajLeszczuk/results-on-video-summarization [10] Landy, Michael S., Yoav Cohen, and George Sperling. "HIPS: A Unix-based image processing system." Computer Vision, Graphics, and Image Processing 25.3 (1984): 331-347. 21/20
  • 22. References [11] https://www.cs.cmu.edu/~msmith/skim_homepage.html [12] https://www.youtube.com/watch?v=OHAWwaYu2H0&t=46s [13] https://www.searchenginejournal.com/deep-learning-powers-video- seo/175145/ [14] https://www.framos.com/en/solutions/mobility/ 22/20
  • 23. 23