SlideShare a Scribd company logo
1 
Generating a time shrunk 
lecture video by event 
detection 
Presented by: Mona Ragheb 
Yara Ali 
Supervised by: Dr. Aliaa Youssif
2 
Agenda 
 Introduction 
 Generating lecture video using virtual camera work 
 Event detection steps 
 Evaluation 
 Results 
 Conclusion 
 References
3 
Introduction 
 E-learning has become a popular method used in higher 
education. 
 However, video recording 
by a cameraman and video 
editing takes a long time 
and costs a great deal. 
 To solve this problem, a 
system has been developed 
to generate dynamic lecture 
video using virtual camerawork from the high resolution 
images recorded by a HDV (high-definition video) camcorder
4 
How the system works? 
 The system generates a lecture video: 
1. Using virtual camerawork based on shooting 
techniques of broadcast cameramen 
2. By cropping from the high resolution image to track 
the region of interest (ROI) such as the instructor. 
3. Generate time shrunk video using event detection 
 Camera motion analysis is used to detect scene 
changes.
5 
Shooting techniques 
 People invariably make the 
same sets of mistakes when 
they first start shooting video: 
1. Trees or telephone poles sticking out of the back 
of someone's head 
2. Interview subjects who are just darkened blurs 
because there was bright light in the background 
3. Boring shots of buildings with no action
6 
Event Detection 
 Two kinds of events were detected: 
1. A speech period 
2. Chalkboard writing period
7 
Kinds of event detection 
1. Speech period: 
It’s detected by voice activity detection with LPC 
cepstrum and classified into speech or non-speech 
using Mahalanobis distance
8 
LPC 
 Linear predictive coding (LPC) is a tool used mostly in 
audio signal processing and speech processing for 
representing the spectral envelope of a digital signal of 
speech in compressed form, using the information of a 
linear predictive model. 
 one of the most useful methods for encoding good quality 
speech at a low bit rate and provides extremely accurate 
estimates of speech parameters. 
 A spectral envelope is a curve in the frequency-amplitude 
plane, derived from a Fourier magnitude spectrum. It 
describes one point in time (one window, to be precise).
9 
How LPC works? 
 LPC analyzes the speech signal by estimating the 
formants, removing their effects from the speech signal, 
and estimating the intensity and frequency of the 
remaining buzz. 
 The process of removing the formants is called inverse 
filtering, and the remaining signal after the subtraction of 
the filtered modeled signal is called the residue. 
 use the buzz parameters and the residue to create a 
source signal, use the formants to create a filter (which 
represents the tube), and run the source through the 
filter, resulting in speech.
10 
Formants
11 
Kinds of event detection (Cont.) 
2. Chalkboard writing period 
 It’s detected by using a graph cuts technique to 
segment a precise region of interests such as an 
instructor. 
 By deleting content-free period, i.e, period without 
the events of speech and writing, and fast-forwarding 
writing periods, our method can 
generate a time shrunk lecture video 
automatically.
12 
Generating lecture video using virtual 
camera work 
 A HDV camcorder is located at the back of the 
classroom to videotape images with high resolution 
(1,400 × 810 pixels), which contain the whole area 
of the chalkboard, so that students can read the 
handwritten characters on the chalkboard. 
 Problem: it is impossible to display the high 
resolution image on the small screen of a general 
notebook PC.
13 
Generating lecture video using virtual 
camera work (Cont.)
14 
Solution? 
1. The system detects a moving object by 
temporal differencing 
2. The timing for virtual camerawork is 
detected using bilateral filtering and zero 
crossing.
Bilateral Filter 
 The bilateral filter was introduced by 
Tomasi et al. as a non-iterative means of 
smoothing images while retaining edge 
detail. 
 It involves a weighted convolution in which 
the weight for each pixel depends not only 
on its distance from the center pixel, but 
also its relative intensity. 
15
16 
Bilateral Filter 
• (a) and (b) show the potential of bilateral filtering for the 
removal of texture. The picture "simplification" illustrated 
by figure 2 (b) can be useful for data reduction without loss 
of overall shape features in applications such as image 
transmission, picture editing and manipulation, image 
description for retrieval.
17 
Generating lecture video using virtual 
camerawork (Cont.) 
 If the ROI has a large movement, this period of the video is classified into 
panning, and if the ROI has no motion but voice activity, this period is 
classified into zooming. 
 Panning is used to show motion and speed. It is a technique that requires 
practice since it has to be done in one smooth, continuous motion.
18 
Event detection steps 
1. Voice activity detection 
2. Chalkboard writing detection 
1. Object detection and segmentation 
2. Generation of current chalkboard image 
3. Chalkboard writing detection 
3. Generating a time shrunk video
19 
1- Voice activity detection
1- Voice activity detection (Cont.) 
 Whenever you do a finite Fourier transform, you're 
implicitly applying it to an infinitely repeating signal. 
So, for instance, if the start and end of your finite 
sample don't match then that will look just like a 
discontinuity in the signal, and show up as lots of 
high-frequency nonsense in the Fourier transform, 
which you don't really want. 
20
1- Voice activity detection (Cont.) 
 If your sample happens to be a beautiful sinusoid 
but an integer number of periods don't happen to fit 
exactly into the finite sample, your FT will show 
appreciable energy in all sorts of places nowhere 
near the real frequency. You don't want any of that. 
 Windowing the data makes sure that the ends 
match up while keeping everything reasonably 
smooth; this greatly reduces the sort of "spectral 
leakage" described in the previous paragraph 
21
2-Chalkboard writing detection 
1-Object detection and segmentation 
 Extracting a precise object region is needed for 
detecting periods of writing characters on 
chalkboard. 
 Temporal differencing ( object detection ) is robust 
to lighting change. 
 Temporal differencing can not extract all foreground 
pixels of moving objects so another technique is 
used to support ( Graph cuts technique )
23 
2-Chalkboard writing detection 
1-Object detection and segmentation
2- Chalkboard writing detection 
2-1-Object detection and segmentation (Cont.) 
24
Image Segmentation 
 Image segmentation is an important problem 
in computer vision and medical image 
analysis. 
 The objective of image segmentation is to 
provide a visually meaningful partition of the 
image domain. Although it is usually an easy 
task for human to separate background and 
different objects for a given image. 
25
26 
2- Chalkboard writing detection 
2-2- Generation of current chalkboard image
27 
2- Chalkboard writing detection 
2-3- Chalkboard writing detection
28 
2-3-Chalkboard writing detection 
(Cont.)
29 
3- Generating a time shrunk video
30 
3- Generating a time shrunk video
31 
Evaluation 
 We videotaped 3 lectures (each video is 90 min 
long) by HDV camcorder. 
 In this evaluation, we use “recall” and “precision” for 
determining effectiveness of detection result.
Precision Vs Recall 
• Precision (also called positive predictive value) 
• Recall (also known as sensitivity) both calculate no of 
correct returned in in total frames 
 In this figure the relevant items are to the left of the straight 
line while the retrieved items are within the oval. The red 
regions represent errors. On the 
left these are the relevant items 
not retrieved (false negatives), 
while on the right they are the 
retrieved items that are not 
relevant (false positives).
33 
Results 
 There are 200 false positive frames in a 90 
min video because of the students’ voices.
34 
Results
35 
Results
36 
Conclusion 
 This paper presents a novel approach for generating a time 
shrunk lecture video using event detection. 
 Our method detects speech periods by voice activity detection 
and chalk board writing periods by a combination of object 
detection and segmentation techniques. 
 By deleting the content-free periods and fast-forwarding the 
chalkboard writing periods, our method can generate a time 
shrunk lecture video automatically. 
 The resulting generated video is about 20%∼30% shorter than 
the original video in time. This is almost the same as the results 
of manual editing by a human operator.
37 
References 
1. http://www.vision.cs.chubu.ac.jp/04/pdf/e-learning08.pdf 
2. http://research.cs.tamu.edu/prism/lectures/sp/l9.pdf 
3. http://www.princeton.edu/~achaney/tmve/wiki100k/docs/4. http://www.ee.columbia.edu/~dpwe/e4896/lectures 
/E4896-L06.pdf
38 
ANY QUESTIONS?
39 
THANK YOU !

More Related Content

What's hot

IJSRDV3I40293
IJSRDV3I40293IJSRDV3I40293
IJSRDV3I40293
Christal Jebi
 
PhD_ppt_2012
PhD_ppt_2012PhD_ppt_2012
PhD_ppt_2012
Diego Cheda
 
Digital Image Processing
Digital Image ProcessingDigital Image Processing
Digital Image Processing
Sahil Biswas
 
Image Processing (General Topic)
Image Processing (General Topic)Image Processing (General Topic)
Image Processing (General Topic)
mcc.jeppiaar
 
Image Sensing & Acquisition
Image Sensing & AcquisitionImage Sensing & Acquisition
Image Sensing & Acquisition
O.V.S Shashank Ram
 
Digital Image Forensics: camera fingerprint and its robustness
Digital Image Forensics: camera fingerprint and its robustness Digital Image Forensics: camera fingerprint and its robustness
Digital Image Forensics: camera fingerprint and its robustness
Francesco Forestieri
 
IMAGE COMPRESSION AND DECOMPRESSION SYSTEM
IMAGE COMPRESSION AND DECOMPRESSION SYSTEMIMAGE COMPRESSION AND DECOMPRESSION SYSTEM
IMAGE COMPRESSION AND DECOMPRESSION SYSTEM
Vishesh Banga
 
Digital Image Processing: An Introduction
Digital Image Processing: An IntroductionDigital Image Processing: An Introduction
Digital Image Processing: An Introduction
Mostafa G. M. Mostafa
 
“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...
“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...
“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...
Edge AI and Vision Alliance
 
Hardware Implementation of Genetic Algorithm Based Digital Colour Image Water...
Hardware Implementation of Genetic Algorithm Based Digital Colour Image Water...Hardware Implementation of Genetic Algorithm Based Digital Colour Image Water...
Hardware Implementation of Genetic Algorithm Based Digital Colour Image Water...
IDES Editor
 
Adaptive denoising technique for colour images
Adaptive denoising technique for colour imagesAdaptive denoising technique for colour images
Adaptive denoising technique for colour images
eSAT Journals
 
G0523444
G0523444G0523444
G0523444
IOSR Journals
 
פוסטר דר פרידמן
פוסטר דר פרידמןפוסטר דר פרידמן
Performance Analysis of Digital Watermarking Of Video in the Spatial Domain
Performance Analysis of Digital Watermarking Of Video in the Spatial DomainPerformance Analysis of Digital Watermarking Of Video in the Spatial Domain
Performance Analysis of Digital Watermarking Of Video in the Spatial Domain
paperpublications3
 
robust image watermarking
robust image watermarkingrobust image watermarking
robust image watermarking
sravanthi mandala
 
Fractal Image Compression Using Quadtree Decomposition
Fractal Image Compression Using Quadtree DecompositionFractal Image Compression Using Quadtree Decomposition
Fractal Image Compression Using Quadtree Decomposition
Harshit Varshney
 
CATWALKGRADER: A CATWALK ANALYSIS AND CORRECTION SYSTEM USING MACHINE LEARNIN...
CATWALKGRADER: A CATWALK ANALYSIS AND CORRECTION SYSTEM USING MACHINE LEARNIN...CATWALKGRADER: A CATWALK ANALYSIS AND CORRECTION SYSTEM USING MACHINE LEARNIN...
CATWALKGRADER: A CATWALK ANALYSIS AND CORRECTION SYSTEM USING MACHINE LEARNIN...
mlaij
 
Design of Shadow Detection and Removal System
Design of Shadow Detection and Removal SystemDesign of Shadow Detection and Removal System
Design of Shadow Detection and Removal System
ijsrd.com
 
Blur Detection Methods for Digital Images-A Survey
Blur Detection Methods for Digital Images-A SurveyBlur Detection Methods for Digital Images-A Survey
Blur Detection Methods for Digital Images-A Survey
Editor IJCATR
 
Dip sdit 7
Dip sdit 7Dip sdit 7
Dip sdit 7
Karan Joshi
 

What's hot (20)

IJSRDV3I40293
IJSRDV3I40293IJSRDV3I40293
IJSRDV3I40293
 
PhD_ppt_2012
PhD_ppt_2012PhD_ppt_2012
PhD_ppt_2012
 
Digital Image Processing
Digital Image ProcessingDigital Image Processing
Digital Image Processing
 
Image Processing (General Topic)
Image Processing (General Topic)Image Processing (General Topic)
Image Processing (General Topic)
 
Image Sensing & Acquisition
Image Sensing & AcquisitionImage Sensing & Acquisition
Image Sensing & Acquisition
 
Digital Image Forensics: camera fingerprint and its robustness
Digital Image Forensics: camera fingerprint and its robustness Digital Image Forensics: camera fingerprint and its robustness
Digital Image Forensics: camera fingerprint and its robustness
 
IMAGE COMPRESSION AND DECOMPRESSION SYSTEM
IMAGE COMPRESSION AND DECOMPRESSION SYSTEMIMAGE COMPRESSION AND DECOMPRESSION SYSTEM
IMAGE COMPRESSION AND DECOMPRESSION SYSTEM
 
Digital Image Processing: An Introduction
Digital Image Processing: An IntroductionDigital Image Processing: An Introduction
Digital Image Processing: An Introduction
 
“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...
“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...
“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...
 
Hardware Implementation of Genetic Algorithm Based Digital Colour Image Water...
Hardware Implementation of Genetic Algorithm Based Digital Colour Image Water...Hardware Implementation of Genetic Algorithm Based Digital Colour Image Water...
Hardware Implementation of Genetic Algorithm Based Digital Colour Image Water...
 
Adaptive denoising technique for colour images
Adaptive denoising technique for colour imagesAdaptive denoising technique for colour images
Adaptive denoising technique for colour images
 
G0523444
G0523444G0523444
G0523444
 
פוסטר דר פרידמן
פוסטר דר פרידמןפוסטר דר פרידמן
פוסטר דר פרידמן
 
Performance Analysis of Digital Watermarking Of Video in the Spatial Domain
Performance Analysis of Digital Watermarking Of Video in the Spatial DomainPerformance Analysis of Digital Watermarking Of Video in the Spatial Domain
Performance Analysis of Digital Watermarking Of Video in the Spatial Domain
 
robust image watermarking
robust image watermarkingrobust image watermarking
robust image watermarking
 
Fractal Image Compression Using Quadtree Decomposition
Fractal Image Compression Using Quadtree DecompositionFractal Image Compression Using Quadtree Decomposition
Fractal Image Compression Using Quadtree Decomposition
 
CATWALKGRADER: A CATWALK ANALYSIS AND CORRECTION SYSTEM USING MACHINE LEARNIN...
CATWALKGRADER: A CATWALK ANALYSIS AND CORRECTION SYSTEM USING MACHINE LEARNIN...CATWALKGRADER: A CATWALK ANALYSIS AND CORRECTION SYSTEM USING MACHINE LEARNIN...
CATWALKGRADER: A CATWALK ANALYSIS AND CORRECTION SYSTEM USING MACHINE LEARNIN...
 
Design of Shadow Detection and Removal System
Design of Shadow Detection and Removal SystemDesign of Shadow Detection and Removal System
Design of Shadow Detection and Removal System
 
Blur Detection Methods for Digital Images-A Survey
Blur Detection Methods for Digital Images-A SurveyBlur Detection Methods for Digital Images-A Survey
Blur Detection Methods for Digital Images-A Survey
 
Dip sdit 7
Dip sdit 7Dip sdit 7
Dip sdit 7
 

Viewers also liked

Localization with mobile anchor points in wireless sensor networks
Localization with mobile anchor points in wireless sensor networksLocalization with mobile anchor points in wireless sensor networks
Localization with mobile anchor points in wireless sensor networks
Habibur Rahman
 
A Survey on Localization of Wireless Sensors
A Survey on Localization of Wireless SensorsA Survey on Localization of Wireless Sensors
A Survey on Localization of Wireless Sensors
Karthik Mohan
 
Sensor Localization presentation1&2
Sensor Localization  presentation1&2Sensor Localization  presentation1&2
Sensor Localization presentation1&2
gamalsallam1989
 
Localization in WSN
Localization in WSNLocalization in WSN
Localization in WSN
Yara Ali
 
Basics of Wireless sensor networks
Basics of Wireless sensor networksBasics of Wireless sensor networks
Basics of Wireless sensor networks
Rushin Shah
 
Wireless Sensor Networks
Wireless Sensor NetworksWireless Sensor Networks
Wireless Sensor Networks
Karthik
 
Wireless Sensor Networks
Wireless Sensor NetworksWireless Sensor Networks
Wireless Sensor Networks
rajatmal4
 
wireless sensor network my seminar ppt
wireless sensor network my seminar pptwireless sensor network my seminar ppt
wireless sensor network my seminar ppt
Eisha Madhwal
 

Viewers also liked (8)

Localization with mobile anchor points in wireless sensor networks
Localization with mobile anchor points in wireless sensor networksLocalization with mobile anchor points in wireless sensor networks
Localization with mobile anchor points in wireless sensor networks
 
A Survey on Localization of Wireless Sensors
A Survey on Localization of Wireless SensorsA Survey on Localization of Wireless Sensors
A Survey on Localization of Wireless Sensors
 
Sensor Localization presentation1&2
Sensor Localization  presentation1&2Sensor Localization  presentation1&2
Sensor Localization presentation1&2
 
Localization in WSN
Localization in WSNLocalization in WSN
Localization in WSN
 
Basics of Wireless sensor networks
Basics of Wireless sensor networksBasics of Wireless sensor networks
Basics of Wireless sensor networks
 
Wireless Sensor Networks
Wireless Sensor NetworksWireless Sensor Networks
Wireless Sensor Networks
 
Wireless Sensor Networks
Wireless Sensor NetworksWireless Sensor Networks
Wireless Sensor Networks
 
wireless sensor network my seminar ppt
wireless sensor network my seminar pptwireless sensor network my seminar ppt
wireless sensor network my seminar ppt
 

Similar to Generating a time shrunk lecture video by event

Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformHuman Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Fadwa Fouad
 
Multimodel Operation for Visually1.docx
Multimodel Operation for Visually1.docxMultimodel Operation for Visually1.docx
Multimodel Operation for Visually1.docx
AROCKIAJAYAIECW
 
VIDEO BASED SIGN LANGUAGE RECOGNITION USING CNN-LSTM
VIDEO BASED SIGN LANGUAGE RECOGNITION USING CNN-LSTMVIDEO BASED SIGN LANGUAGE RECOGNITION USING CNN-LSTM
VIDEO BASED SIGN LANGUAGE RECOGNITION USING CNN-LSTM
IRJET Journal
 
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDSFACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
IRJET Journal
 
SVM Based Saliency Map Technique for Reducing Time Complexity in HEVC
SVM Based Saliency Map Technique for Reducing Time Complexity in HEVCSVM Based Saliency Map Technique for Reducing Time Complexity in HEVC
SVM Based Saliency Map Technique for Reducing Time Complexity in HEVC
IRJET Journal
 
Video Hyperlinking Tutorial (Part B)
Video Hyperlinking Tutorial (Part B)Video Hyperlinking Tutorial (Part B)
Video Hyperlinking Tutorial (Part B)
LinkedTV
 
Surveillance using Video Analytics
Surveillance using Video AnalyticsSurveillance using Video Analytics
Surveillance using Video Analytics
idescitation
 
Design and Analysis of Quantization Based Low Bit Rate Encoding System
Design and Analysis of Quantization Based Low Bit Rate Encoding SystemDesign and Analysis of Quantization Based Low Bit Rate Encoding System
Design and Analysis of Quantization Based Low Bit Rate Encoding System
ijtsrd
 
Parameterized Image Filtering Using fuzzy Logic
Parameterized Image Filtering Using fuzzy LogicParameterized Image Filtering Using fuzzy Logic
Parameterized Image Filtering Using fuzzy Logic
Editor IJCATR
 
AN ENHANCEMENT FOR THE CONSISTENT DEPTH ESTIMATION OF MONOCULAR VIDEOS USING ...
AN ENHANCEMENT FOR THE CONSISTENT DEPTH ESTIMATION OF MONOCULAR VIDEOS USING ...AN ENHANCEMENT FOR THE CONSISTENT DEPTH ESTIMATION OF MONOCULAR VIDEOS USING ...
AN ENHANCEMENT FOR THE CONSISTENT DEPTH ESTIMATION OF MONOCULAR VIDEOS USING ...
mlaij
 
Motion detection system
Motion detection systemMotion detection system
Motion detection system
WritingHubUK
 
Machine vision Application
Machine vision ApplicationMachine vision Application
Machine vision Application
Abhishek Sainkar
 
Machine Vision On Embedded Platform
Machine Vision On Embedded Platform Machine Vision On Embedded Platform
Machine Vision On Embedded Platform
Omkar Rane
 
Object Tracking with Instance Matching and Online Learning
Object Tracking with Instance Matching and Online LearningObject Tracking with Instance Matching and Online Learning
Object Tracking with Instance Matching and Online Learning
Jui-Hsin (Larry) Lai
 
Poster cs543
Poster cs543Poster cs543
Poster cs543
Ramin Anushiravani
 
final_project
final_projectfinal_project
final_project
Inderpreet Kaur
 
LANE DETECTION USING IMAGE PROCESSING IN PYTHON
LANE DETECTION USING IMAGE PROCESSING IN PYTHONLANE DETECTION USING IMAGE PROCESSING IN PYTHON
LANE DETECTION USING IMAGE PROCESSING IN PYTHON
IRJET Journal
 
Review of Pose Recognition Systems
Review of Pose Recognition SystemsReview of Pose Recognition Systems
Review of Pose Recognition Systems
vivatechijri
 
How to prepare a perfect video abstract for your research paper – Pubrica.pptx
How to prepare a perfect video abstract for your research paper – Pubrica.pptxHow to prepare a perfect video abstract for your research paper – Pubrica.pptx
How to prepare a perfect video abstract for your research paper – Pubrica.pptx
Pubrica
 
IRJET- Object Detection and Recognition for Blind Assistance
IRJET- Object Detection and Recognition for Blind AssistanceIRJET- Object Detection and Recognition for Blind Assistance
IRJET- Object Detection and Recognition for Blind Assistance
IRJET Journal
 

Similar to Generating a time shrunk lecture video by event (20)

Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformHuman Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
 
Multimodel Operation for Visually1.docx
Multimodel Operation for Visually1.docxMultimodel Operation for Visually1.docx
Multimodel Operation for Visually1.docx
 
VIDEO BASED SIGN LANGUAGE RECOGNITION USING CNN-LSTM
VIDEO BASED SIGN LANGUAGE RECOGNITION USING CNN-LSTMVIDEO BASED SIGN LANGUAGE RECOGNITION USING CNN-LSTM
VIDEO BASED SIGN LANGUAGE RECOGNITION USING CNN-LSTM
 
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDSFACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
 
SVM Based Saliency Map Technique for Reducing Time Complexity in HEVC
SVM Based Saliency Map Technique for Reducing Time Complexity in HEVCSVM Based Saliency Map Technique for Reducing Time Complexity in HEVC
SVM Based Saliency Map Technique for Reducing Time Complexity in HEVC
 
Video Hyperlinking Tutorial (Part B)
Video Hyperlinking Tutorial (Part B)Video Hyperlinking Tutorial (Part B)
Video Hyperlinking Tutorial (Part B)
 
Surveillance using Video Analytics
Surveillance using Video AnalyticsSurveillance using Video Analytics
Surveillance using Video Analytics
 
Design and Analysis of Quantization Based Low Bit Rate Encoding System
Design and Analysis of Quantization Based Low Bit Rate Encoding SystemDesign and Analysis of Quantization Based Low Bit Rate Encoding System
Design and Analysis of Quantization Based Low Bit Rate Encoding System
 
Parameterized Image Filtering Using fuzzy Logic
Parameterized Image Filtering Using fuzzy LogicParameterized Image Filtering Using fuzzy Logic
Parameterized Image Filtering Using fuzzy Logic
 
AN ENHANCEMENT FOR THE CONSISTENT DEPTH ESTIMATION OF MONOCULAR VIDEOS USING ...
AN ENHANCEMENT FOR THE CONSISTENT DEPTH ESTIMATION OF MONOCULAR VIDEOS USING ...AN ENHANCEMENT FOR THE CONSISTENT DEPTH ESTIMATION OF MONOCULAR VIDEOS USING ...
AN ENHANCEMENT FOR THE CONSISTENT DEPTH ESTIMATION OF MONOCULAR VIDEOS USING ...
 
Motion detection system
Motion detection systemMotion detection system
Motion detection system
 
Machine vision Application
Machine vision ApplicationMachine vision Application
Machine vision Application
 
Machine Vision On Embedded Platform
Machine Vision On Embedded Platform Machine Vision On Embedded Platform
Machine Vision On Embedded Platform
 
Object Tracking with Instance Matching and Online Learning
Object Tracking with Instance Matching and Online LearningObject Tracking with Instance Matching and Online Learning
Object Tracking with Instance Matching and Online Learning
 
Poster cs543
Poster cs543Poster cs543
Poster cs543
 
final_project
final_projectfinal_project
final_project
 
LANE DETECTION USING IMAGE PROCESSING IN PYTHON
LANE DETECTION USING IMAGE PROCESSING IN PYTHONLANE DETECTION USING IMAGE PROCESSING IN PYTHON
LANE DETECTION USING IMAGE PROCESSING IN PYTHON
 
Review of Pose Recognition Systems
Review of Pose Recognition SystemsReview of Pose Recognition Systems
Review of Pose Recognition Systems
 
How to prepare a perfect video abstract for your research paper – Pubrica.pptx
How to prepare a perfect video abstract for your research paper – Pubrica.pptxHow to prepare a perfect video abstract for your research paper – Pubrica.pptx
How to prepare a perfect video abstract for your research paper – Pubrica.pptx
 
IRJET- Object Detection and Recognition for Blind Assistance
IRJET- Object Detection and Recognition for Blind AssistanceIRJET- Object Detection and Recognition for Blind Assistance
IRJET- Object Detection and Recognition for Blind Assistance
 

Recently uploaded

Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
shadow0702a
 
People as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimalaPeople as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimala
riddhimaagrawal986
 
integral complex analysis chapter 06 .pdf
integral complex analysis chapter 06 .pdfintegral complex analysis chapter 06 .pdf
integral complex analysis chapter 06 .pdf
gaafergoudaay7aga
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
Hitesh Mohapatra
 
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURSCompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
RamonNovais6
 
An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...
IJECEIAES
 
BRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdfBRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdf
LAXMAREDDY22
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
KrishnaveniKrishnara1
 
Material for memory and display system h
Material for memory and display system hMaterial for memory and display system h
Material for memory and display system h
gowrishankartb2005
 
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
171ticu
 
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.pptUnit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
KrishnaveniKrishnara1
 
cnn.pptx Convolutional neural network used for image classication
cnn.pptx Convolutional neural network used for image classicationcnn.pptx Convolutional neural network used for image classication
cnn.pptx Convolutional neural network used for image classication
SakkaravarthiShanmug
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
VANDANAMOHANGOUDA
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
IJECEIAES
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
insn4465
 
Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...
bijceesjournal
 
Welding Metallurgy Ferrous Materials.pdf
Welding Metallurgy Ferrous Materials.pdfWelding Metallurgy Ferrous Materials.pdf
Welding Metallurgy Ferrous Materials.pdf
AjmalKhan50578
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
bijceesjournal
 
132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
kandramariana6
 

Recently uploaded (20)

Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
 
People as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimalaPeople as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimala
 
integral complex analysis chapter 06 .pdf
integral complex analysis chapter 06 .pdfintegral complex analysis chapter 06 .pdf
integral complex analysis chapter 06 .pdf
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
 
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURSCompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
 
An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...
 
BRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdfBRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdf
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
 
Material for memory and display system h
Material for memory and display system hMaterial for memory and display system h
Material for memory and display system h
 
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
 
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.pptUnit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
 
cnn.pptx Convolutional neural network used for image classication
cnn.pptx Convolutional neural network used for image classicationcnn.pptx Convolutional neural network used for image classication
cnn.pptx Convolutional neural network used for image classication
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
 
Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...
 
Welding Metallurgy Ferrous Materials.pdf
Welding Metallurgy Ferrous Materials.pdfWelding Metallurgy Ferrous Materials.pdf
Welding Metallurgy Ferrous Materials.pdf
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
 
132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
 

Generating a time shrunk lecture video by event

  • 1. 1 Generating a time shrunk lecture video by event detection Presented by: Mona Ragheb Yara Ali Supervised by: Dr. Aliaa Youssif
  • 2. 2 Agenda  Introduction  Generating lecture video using virtual camera work  Event detection steps  Evaluation  Results  Conclusion  References
  • 3. 3 Introduction  E-learning has become a popular method used in higher education.  However, video recording by a cameraman and video editing takes a long time and costs a great deal.  To solve this problem, a system has been developed to generate dynamic lecture video using virtual camerawork from the high resolution images recorded by a HDV (high-definition video) camcorder
  • 4. 4 How the system works?  The system generates a lecture video: 1. Using virtual camerawork based on shooting techniques of broadcast cameramen 2. By cropping from the high resolution image to track the region of interest (ROI) such as the instructor. 3. Generate time shrunk video using event detection  Camera motion analysis is used to detect scene changes.
  • 5. 5 Shooting techniques  People invariably make the same sets of mistakes when they first start shooting video: 1. Trees or telephone poles sticking out of the back of someone's head 2. Interview subjects who are just darkened blurs because there was bright light in the background 3. Boring shots of buildings with no action
  • 6. 6 Event Detection  Two kinds of events were detected: 1. A speech period 2. Chalkboard writing period
  • 7. 7 Kinds of event detection 1. Speech period: It’s detected by voice activity detection with LPC cepstrum and classified into speech or non-speech using Mahalanobis distance
  • 8. 8 LPC  Linear predictive coding (LPC) is a tool used mostly in audio signal processing and speech processing for representing the spectral envelope of a digital signal of speech in compressed form, using the information of a linear predictive model.  one of the most useful methods for encoding good quality speech at a low bit rate and provides extremely accurate estimates of speech parameters.  A spectral envelope is a curve in the frequency-amplitude plane, derived from a Fourier magnitude spectrum. It describes one point in time (one window, to be precise).
  • 9. 9 How LPC works?  LPC analyzes the speech signal by estimating the formants, removing their effects from the speech signal, and estimating the intensity and frequency of the remaining buzz.  The process of removing the formants is called inverse filtering, and the remaining signal after the subtraction of the filtered modeled signal is called the residue.  use the buzz parameters and the residue to create a source signal, use the formants to create a filter (which represents the tube), and run the source through the filter, resulting in speech.
  • 11. 11 Kinds of event detection (Cont.) 2. Chalkboard writing period  It’s detected by using a graph cuts technique to segment a precise region of interests such as an instructor.  By deleting content-free period, i.e, period without the events of speech and writing, and fast-forwarding writing periods, our method can generate a time shrunk lecture video automatically.
  • 12. 12 Generating lecture video using virtual camera work  A HDV camcorder is located at the back of the classroom to videotape images with high resolution (1,400 × 810 pixels), which contain the whole area of the chalkboard, so that students can read the handwritten characters on the chalkboard.  Problem: it is impossible to display the high resolution image on the small screen of a general notebook PC.
  • 13. 13 Generating lecture video using virtual camera work (Cont.)
  • 14. 14 Solution? 1. The system detects a moving object by temporal differencing 2. The timing for virtual camerawork is detected using bilateral filtering and zero crossing.
  • 15. Bilateral Filter  The bilateral filter was introduced by Tomasi et al. as a non-iterative means of smoothing images while retaining edge detail.  It involves a weighted convolution in which the weight for each pixel depends not only on its distance from the center pixel, but also its relative intensity. 15
  • 16. 16 Bilateral Filter • (a) and (b) show the potential of bilateral filtering for the removal of texture. The picture "simplification" illustrated by figure 2 (b) can be useful for data reduction without loss of overall shape features in applications such as image transmission, picture editing and manipulation, image description for retrieval.
  • 17. 17 Generating lecture video using virtual camerawork (Cont.)  If the ROI has a large movement, this period of the video is classified into panning, and if the ROI has no motion but voice activity, this period is classified into zooming.  Panning is used to show motion and speed. It is a technique that requires practice since it has to be done in one smooth, continuous motion.
  • 18. 18 Event detection steps 1. Voice activity detection 2. Chalkboard writing detection 1. Object detection and segmentation 2. Generation of current chalkboard image 3. Chalkboard writing detection 3. Generating a time shrunk video
  • 19. 19 1- Voice activity detection
  • 20. 1- Voice activity detection (Cont.)  Whenever you do a finite Fourier transform, you're implicitly applying it to an infinitely repeating signal. So, for instance, if the start and end of your finite sample don't match then that will look just like a discontinuity in the signal, and show up as lots of high-frequency nonsense in the Fourier transform, which you don't really want. 20
  • 21. 1- Voice activity detection (Cont.)  If your sample happens to be a beautiful sinusoid but an integer number of periods don't happen to fit exactly into the finite sample, your FT will show appreciable energy in all sorts of places nowhere near the real frequency. You don't want any of that.  Windowing the data makes sure that the ends match up while keeping everything reasonably smooth; this greatly reduces the sort of "spectral leakage" described in the previous paragraph 21
  • 22. 2-Chalkboard writing detection 1-Object detection and segmentation  Extracting a precise object region is needed for detecting periods of writing characters on chalkboard.  Temporal differencing ( object detection ) is robust to lighting change.  Temporal differencing can not extract all foreground pixels of moving objects so another technique is used to support ( Graph cuts technique )
  • 23. 23 2-Chalkboard writing detection 1-Object detection and segmentation
  • 24. 2- Chalkboard writing detection 2-1-Object detection and segmentation (Cont.) 24
  • 25. Image Segmentation  Image segmentation is an important problem in computer vision and medical image analysis.  The objective of image segmentation is to provide a visually meaningful partition of the image domain. Although it is usually an easy task for human to separate background and different objects for a given image. 25
  • 26. 26 2- Chalkboard writing detection 2-2- Generation of current chalkboard image
  • 27. 27 2- Chalkboard writing detection 2-3- Chalkboard writing detection
  • 28. 28 2-3-Chalkboard writing detection (Cont.)
  • 29. 29 3- Generating a time shrunk video
  • 30. 30 3- Generating a time shrunk video
  • 31. 31 Evaluation  We videotaped 3 lectures (each video is 90 min long) by HDV camcorder.  In this evaluation, we use “recall” and “precision” for determining effectiveness of detection result.
  • 32. Precision Vs Recall • Precision (also called positive predictive value) • Recall (also known as sensitivity) both calculate no of correct returned in in total frames  In this figure the relevant items are to the left of the straight line while the retrieved items are within the oval. The red regions represent errors. On the left these are the relevant items not retrieved (false negatives), while on the right they are the retrieved items that are not relevant (false positives).
  • 33. 33 Results  There are 200 false positive frames in a 90 min video because of the students’ voices.
  • 36. 36 Conclusion  This paper presents a novel approach for generating a time shrunk lecture video using event detection.  Our method detects speech periods by voice activity detection and chalk board writing periods by a combination of object detection and segmentation techniques.  By deleting the content-free periods and fast-forwarding the chalkboard writing periods, our method can generate a time shrunk lecture video automatically.  The resulting generated video is about 20%∼30% shorter than the original video in time. This is almost the same as the results of manual editing by a human operator.
  • 37. 37 References 1. http://www.vision.cs.chubu.ac.jp/04/pdf/e-learning08.pdf 2. http://research.cs.tamu.edu/prism/lectures/sp/l9.pdf 3. http://www.princeton.edu/~achaney/tmve/wiki100k/docs/4. http://www.ee.columbia.edu/~dpwe/e4896/lectures /E4896-L06.pdf

Editor's Notes

  1. هو تنسيق لتسجيل الفيديو عالي الوضوح على DV كاسيت الشريط . [1] وضعت أصلا تنسيق من قبل JVC وبدعم من سوني ، كانون و شارب . [2] إن الشركات الأربع شكلت كونسورتيوم HDV في سبتمبر 2003.
  2. We propose a method for generating a dynamic lecture video from the high resolution images recorded by a HDV camcorder. The lecture images are cropped to track the region of the instructor. Our approach uses bilateral filtering to avoid jittery motion caused by temporal differencing, and pseudo camera motion (panning) based on shooting technique of broadcast cameraman is generated. We evaluated our method, and showed that the effectiveness of our algorithm was verified through subjective experiments.
  3. bilateral filtering method to avoid jitter motion. Bilateral Filtering smoothes images while
  4. هي أداة تستخدم في الغالب في معالجة الإشارات السمعية و معالجة الكلام عن تمثيل المغلف الطيفية ل الرقمية إشارة من الكلام في مضغوط شكل، وذلك باستخدام معلومات ذات التنبؤي الخطي نموذج. [1] وهو واحد من تقنيات تحليل الخطاب أقوى، وإحدى الطرق الأكثر فائدة لترميز الكلام نوعية جيدة بسعر منخفض قليلا، ويقدم تقديرات دقيقة للغاية من المعلمات الكلام.
  5. Linear pre diction is a mathematical operation where future values of a discrete-time signal are estimated as a linear function of previous samples التنبؤ الخطي هو عملية حسابية حيث يتم تقدير القيمة المستقبلية للإشارة متقطعة الزمن كدالة خطية من العينات السابقة
  6. Panning is used to show motion and speed. It is a technique that requires practice since it has to be done in one smooth, continuous motion.