SlideShare a Scribd company logo
CMU-Informedia @ TRECVID 2011
Surveillance Event Detection
Speaker: Lu Jiang
Longfei Zhang , Lu Jiang , Lei Bao, Shohei Takahashi, Yuanpeng Li,
Alexander Hauptmann
Carnegie Mellon University
SED11 Team
 Team members:
Longfei Lu Lei Shohei Yuanpeng
Alex
Outline
 Framework
 MoSIFT based Action Recognition
 MoSIFT feature
 Spatial Bag of Word
 Tackling highly imbalanced datasets
 Experiment Results
Framework
Video Person
Detection
Cascade
SVM
Filtering
Spatio-
Temporal
Feature
Detection
Background
Subtraction
Spatial Bag-
of-Word
Slidingwindow
Random
Forest
Visual vocabulary
K-means
(k = 3000)
Hot Region
detection
Classification
• Augmented Boosted Cascade
Framework
Video Person
Detection
Cascade
SVM
Filtering
Spatio-
Temporal
Feature
Detection
Background
Subtraction
Spatial Bag-
of-Word
Slidingwindow
Random
Forest
Visual vocabulary
K-means
(k = 3000)
Hot Region
detection
Classification
• Augmented Boosted Cascade
MoSIFT
• Given pairs of video frames, detect spatio-temporal interest points
at multiple scales.
• SIFT point detection with sufficient optical flow.
• Describing SIFT points through SIFT descriptor and optical flow.
Spatial Bag of Words
• Each frame is divided into a set of non-overlapping rectangular tiles.
• The resulting BoW features are derived by concatenating the BoW
features captured in each tile.
• Encode the spatial (tile) information in BoW.
Hot Region Detection
• Person Detection: Person detection based on
Histogram of Oriented Gradient (HOG) features.
• Background subtraction.
Spatial Bag of Features
• Each frame is divided into a set of rectangular tiles or grids.
• The resulting Bow features are derived by concatenating the BoW
features captured in each grid.
• Encode the adjusted spatial information in BoW.
1×1 2×2 1×3
720
576
720 720
173
173
230
Spatial Bag of Features
• Each frame is divided into a set of rectangular tiles or grids.
• The resulting Bow features are derived by concatenating the BoW
features captured in each grid.
• Encode the adjusted spatial information in BoW.
Tackling the highly imbalanced data
• Augmented Cascade SVM.
• Bagging classification method except it adopts
probabilistic sampling to select negative samples in
a sequential manner.
Training
Dataset
Sub Dataset 1
Classifier 1
Positive Samples
Negative Samples
Tackling the highly imbalanced data
• Augmented Cascade SVM.
• Bagging classification method except it adopts
probabilistic sampling to select negative samples in
a sequential manner.
Training
Dataset
Sub Dataset 1
Classifier 1
Tackling the highly imbalanced data
• Augmented Cascade SVM.
• Bagging classification method except it adopts
probabilistic sampling to select negative samples in
a sequential manner.
Training
Dataset
Sub Dataset 1
Classifier 1
0.8
0.7
0.3
0.9
0.1
0.1
Tackling the highly imbalanced data
• Augmented Cascade SVM.
• Bagging classification method except it adopts
probabilistic sampling to select negative samples in
a sequential manner.
Training
Dataset
Sub Dataset 1
Classifier 1
0.8
0.2
0.3
0.9
0.1
0.1
Sub Dataset 2
Classifier 2
… …
Tackling the highly imbalanced data
• Augmented Cascade SVM.
• Bagging classification method except it adopts
probabilistic sampling to select negative samples in
a sequential manner. N = 10 layers.
Training
Dataset
Sub Dataset 1
Classifier 1
Sub Dataset 2
Classifier 2
Sub Dataset N
Classifier N
… …
Tackling highly imbalanced data
Bagging Ensemble of Random Forests
• Random Forest is a forest of decision trees.
• Two parameters:
– n is the number of trees in the forest.
– m the number of features in each decision tree.
• Build each decision tree by randomly selecting m
features and use C4.5.
• Each tree is grown without pruning.
Tackling highly imbalanced data
Bagging Random Forest: Ensemble of Random Forests
• Random Forest is a forest of decision trees.
• Two parameters:
– n is the number of trees in the forest.
– m the number of features in each decision tree.
• Build each decision tree by randomly selecting m features
• Each tree is grown without pruning.
Cascade SVM vs. Bagging Random Forest
Cascade SVM
(chi2 kernel)
Bagging Random Forest
Effectiveness Most Effective Usually 3-8% less in Average Precision
Efficiency Time consuming Usually tens to hundreds of times faster
Sensitive to
Parameter
settings
Sensitive Relatively insensitive
Results
• 8 Submissions:
• The first 6 runs use cascade SVM with different
sliding window sizes and parameter sets.
• Last 2 runs use bagging random forest method.
Results
• Results for Primary run:
Inputs Actual DCR Minimum DCR
#Targ #NTarg #Sys #CorDet #CorDet #FA #Miss DCR DCR
CellToEar 194 127 128 1 0 127 193 1.0365 1.0003
Embrace 175 657 715 58 0 657 117 0.8840 0.8658
ObjectPut 621 57 58 1 0 57 620 1.0171 1.0003
PeopleMeet 449 336 381 45 0 336 404 1.0100 0.9724
PeopleSplitUp 187 115 118 3 0 115 184 1.0217 1.0003
PersonRuns 107 413 439 26 0 413 81 0.8924 0.8370
Pointing 1063 1960 2092 132 0 1960 931 1.5186 1.0001
Results
Compared with our primary run with those of other teams.
We have the best Min DCR in 3 out of 6 events.
Results
Compared with our primary run with those of other teams.
We have the best Actual DCR in 3 out of 7 events.
Results
Compared with our last year’s result, we get improvement in
terms of MIN DCR in 5 events “Embrace”, “People Meet”,
“People Slit up”, “Person Runs” and “Pointing”.
• Best event results over all CMU runs
Min DCR Cell To
Ear
Embrace Object
Put
People
Meet
People
Split Up
Person
Runs Pointing
2010 CMU 1.0003 0.9838 1.0003 0.9793 0.9889 0.9477 1.0003
2010 Overall
Best Event
1 0.9663 0.9971 0.9787 0.9889 0.6818 0.996
2011 CMU 1.0003 0.8658 1.0003 0.9684 0.7838 0.837 0.9996
Min DCR Cell To
Ear
Embrace Object
Put
People
Meet
People
Split Up
Person
Runs Pointing
2010 CMU 1.0003 0.9838 1.0003 0.9793 0.9889 0.9477 1.0003
2010 Overall
Best Event
1 0.9663 0.9971 0.9787 0.9889 0.6818 0.996
2011 CMU 1.0003 0.8658 1.0003 0.9684 0.7838 0.837 0.9996
Results
Compared with the best event results in TRECVID 2010, for event
“Embrace”, “PeopleMeet” and “People Split Up” ours are the
best system.
Cascade SVM vs. Random Forest
• Comparison between Run 1 (Cascade SVM) and Run
7 (Random Forest) in terms of Min DCR.
Threshold Search
• Searching for Min DCR using cross validation.
• Actual DCR provides reasonable estimates of Min
DCR on all runs.
Primary Run
Impact of sliding window size
• Results for all events with sliding window size 25 frames (Run 3).
Impact of sliding window size
• Results for all events with sliding window size 60 (Run 5).
Event-specific sliding window size
• For PersonRuns, CellToEar, Embrace and Pointing a good sliding window is small.
• For Embrace, ObjectPut and PeopleMeet a good sliding window size is larger.
Conclusions
 Observations:
 MoSIFT feature captures salient motions in videos.
 Spatial Bag of Words can boost the performance over last
year’s result.
 Event-specific sliding window size impacts the final result.
 Both cascade SVM and bagging random forest can handle
highly imbalanced data sets. Random forest is much faster.
CMU Trecvid sed11

More Related Content

Similar to CMU Trecvid sed11

[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
KAIST
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
KAIST
 
01-pengantar.pdf
01-pengantar.pdf01-pengantar.pdf
01-pengantar.pdf
ssuseradaf5f
 
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Sergey Karayev
 
Battista Biggio @ AISec 2013 - Is Data Clustering in Adversarial Settings Sec...
Battista Biggio @ AISec 2013 - Is Data Clustering in Adversarial Settings Sec...Battista Biggio @ AISec 2013 - Is Data Clustering in Adversarial Settings Sec...
Battista Biggio @ AISec 2013 - Is Data Clustering in Adversarial Settings Sec...
Pluribus One
 
Threat Detection in Surveillance Videos
Threat Detection in Surveillance VideosThreat Detection in Surveillance Videos
Threat Detection in Surveillance Videos
Databricks
 
Machine learning algorithm for classification of activity of daily life’s
Machine learning algorithm for classification of activity of daily life’sMachine learning algorithm for classification of activity of daily life’s
Machine learning algorithm for classification of activity of daily life’s
Siddharth Chakravarty
 
Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...
Wesley De Neve
 
20141030 Feature-Value DSAA 2014 V6_FINAL
20141030 Feature-Value DSAA 2014 V6_FINAL20141030 Feature-Value DSAA 2014 V6_FINAL
20141030 Feature-Value DSAA 2014 V6_FINAL
Gunarto Sindoro
 
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Chih-Chuan Cheng
 
Network Intrusion Detection System Using Machine Learning and Deep Learning F...
Network Intrusion Detection System Using Machine Learning and Deep Learning F...Network Intrusion Detection System Using Machine Learning and Deep Learning F...
Network Intrusion Detection System Using Machine Learning and Deep Learning F...
Leaving A Legacy
 
Adam Ashenfelter - Finding the Oddballs
Adam Ashenfelter - Finding the OddballsAdam Ashenfelter - Finding the Oddballs
Adam Ashenfelter - Finding the Oddballs
Machine Learning Prague
 
AI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
AI Class Topic 6: Easy Way to Learn Deep Learning AI TechnologiesAI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
AI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
Value Amplify Consulting
 
Declarative data analysis
Declarative data analysisDeclarative data analysis
Declarative data analysis
South West Data Meetup
 
Deep Learning in Computer Vision
Deep Learning in Computer VisionDeep Learning in Computer Vision
Deep Learning in Computer Vision
Sungjoon Choi
 
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
Vikash Kumar
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)
DonghyunKang12
 
Regression and Classification: An Artificial Neural Network Approach
Regression and Classification: An Artificial Neural Network ApproachRegression and Classification: An Artificial Neural Network Approach
Regression and Classification: An Artificial Neural Network Approach
Khulna University
 
slide-171212080528.pptx
slide-171212080528.pptxslide-171212080528.pptx
slide-171212080528.pptx
SharanrajK22MMT1003
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learning
pratik pratyay
 

Similar to CMU Trecvid sed11 (20)

[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
 
01-pengantar.pdf
01-pengantar.pdf01-pengantar.pdf
01-pengantar.pdf
 
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
 
Battista Biggio @ AISec 2013 - Is Data Clustering in Adversarial Settings Sec...
Battista Biggio @ AISec 2013 - Is Data Clustering in Adversarial Settings Sec...Battista Biggio @ AISec 2013 - Is Data Clustering in Adversarial Settings Sec...
Battista Biggio @ AISec 2013 - Is Data Clustering in Adversarial Settings Sec...
 
Threat Detection in Surveillance Videos
Threat Detection in Surveillance VideosThreat Detection in Surveillance Videos
Threat Detection in Surveillance Videos
 
Machine learning algorithm for classification of activity of daily life’s
Machine learning algorithm for classification of activity of daily life’sMachine learning algorithm for classification of activity of daily life’s
Machine learning algorithm for classification of activity of daily life’s
 
Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...
 
20141030 Feature-Value DSAA 2014 V6_FINAL
20141030 Feature-Value DSAA 2014 V6_FINAL20141030 Feature-Value DSAA 2014 V6_FINAL
20141030 Feature-Value DSAA 2014 V6_FINAL
 
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
 
Network Intrusion Detection System Using Machine Learning and Deep Learning F...
Network Intrusion Detection System Using Machine Learning and Deep Learning F...Network Intrusion Detection System Using Machine Learning and Deep Learning F...
Network Intrusion Detection System Using Machine Learning and Deep Learning F...
 
Adam Ashenfelter - Finding the Oddballs
Adam Ashenfelter - Finding the OddballsAdam Ashenfelter - Finding the Oddballs
Adam Ashenfelter - Finding the Oddballs
 
AI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
AI Class Topic 6: Easy Way to Learn Deep Learning AI TechnologiesAI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
AI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
 
Declarative data analysis
Declarative data analysisDeclarative data analysis
Declarative data analysis
 
Deep Learning in Computer Vision
Deep Learning in Computer VisionDeep Learning in Computer Vision
Deep Learning in Computer Vision
 
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)
 
Regression and Classification: An Artificial Neural Network Approach
Regression and Classification: An Artificial Neural Network ApproachRegression and Classification: An Artificial Neural Network Approach
Regression and Classification: An Artificial Neural Network Approach
 
slide-171212080528.pptx
slide-171212080528.pptxslide-171212080528.pptx
slide-171212080528.pptx
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learning
 

Recently uploaded

Oracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptxOracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptx
Remote DBA Services
 
All you need to know about Spring Boot and GraalVM
All you need to know about Spring Boot and GraalVMAll you need to know about Spring Boot and GraalVM
All you need to know about Spring Boot and GraalVM
Alina Yurenko
 
UI5con 2024 - Bring Your Own Design System
UI5con 2024 - Bring Your Own Design SystemUI5con 2024 - Bring Your Own Design System
UI5con 2024 - Bring Your Own Design System
Peter Muessig
 
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
gapen1
 
GreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-JurisicGreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-Jurisic
Green Software Development
 
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
safelyiotech
 
WWDC 2024 Keynote Review: For CocoaCoders Austin
WWDC 2024 Keynote Review: For CocoaCoders AustinWWDC 2024 Keynote Review: For CocoaCoders Austin
WWDC 2024 Keynote Review: For CocoaCoders Austin
Patrick Weigel
 
J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...
J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...
J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...
Bert Jan Schrijver
 
Mobile App Development Company In Noida | Drona Infotech
Mobile App Development Company In Noida | Drona InfotechMobile App Development Company In Noida | Drona Infotech
Mobile App Development Company In Noida | Drona Infotech
Drona Infotech
 
Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf
Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdfTop Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf
Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf
VALiNTRY360
 
zOS Mainframe JES2-JES3 JCL-JECL Differences
zOS Mainframe JES2-JES3 JCL-JECL DifferenceszOS Mainframe JES2-JES3 JCL-JECL Differences
zOS Mainframe JES2-JES3 JCL-JECL Differences
YousufSait3
 
Oracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptxOracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptx
Remote DBA Services
 
Malibou Pitch Deck For Its €3M Seed Round
Malibou Pitch Deck For Its €3M Seed RoundMalibou Pitch Deck For Its €3M Seed Round
Malibou Pitch Deck For Its €3M Seed Round
sjcobrien
 
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling ExtensionsUI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
Peter Muessig
 
一比一原版(UMN毕业证)明尼苏达大学毕业证如何办理
一比一原版(UMN毕业证)明尼苏达大学毕业证如何办理一比一原版(UMN毕业证)明尼苏达大学毕业证如何办理
一比一原版(UMN毕业证)明尼苏达大学毕业证如何办理
dakas1
 
The Key to Digital Success_ A Comprehensive Guide to Continuous Testing Integ...
The Key to Digital Success_ A Comprehensive Guide to Continuous Testing Integ...The Key to Digital Success_ A Comprehensive Guide to Continuous Testing Integ...
The Key to Digital Success_ A Comprehensive Guide to Continuous Testing Integ...
kalichargn70th171
 
How Can Hiring A Mobile App Development Company Help Your Business Grow?
How Can Hiring A Mobile App Development Company Help Your Business Grow?How Can Hiring A Mobile App Development Company Help Your Business Grow?
How Can Hiring A Mobile App Development Company Help Your Business Grow?
ToXSL Technologies
 
Using Query Store in Azure PostgreSQL to Understand Query Performance
Using Query Store in Azure PostgreSQL to Understand Query PerformanceUsing Query Store in Azure PostgreSQL to Understand Query Performance
Using Query Store in Azure PostgreSQL to Understand Query Performance
Grant Fritchey
 
ALGIT - Assembly Line for Green IT - Numbers, Data, Facts
ALGIT - Assembly Line for Green IT - Numbers, Data, FactsALGIT - Assembly Line for Green IT - Numbers, Data, Facts
ALGIT - Assembly Line for Green IT - Numbers, Data, Facts
Green Software Development
 
Lecture 2 - software testing SE 412.pptx
Lecture 2 - software testing SE 412.pptxLecture 2 - software testing SE 412.pptx
Lecture 2 - software testing SE 412.pptx
TaghreedAltamimi
 

Recently uploaded (20)

Oracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptxOracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptx
 
All you need to know about Spring Boot and GraalVM
All you need to know about Spring Boot and GraalVMAll you need to know about Spring Boot and GraalVM
All you need to know about Spring Boot and GraalVM
 
UI5con 2024 - Bring Your Own Design System
UI5con 2024 - Bring Your Own Design SystemUI5con 2024 - Bring Your Own Design System
UI5con 2024 - Bring Your Own Design System
 
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
 
GreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-JurisicGreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-Jurisic
 
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
 
WWDC 2024 Keynote Review: For CocoaCoders Austin
WWDC 2024 Keynote Review: For CocoaCoders AustinWWDC 2024 Keynote Review: For CocoaCoders Austin
WWDC 2024 Keynote Review: For CocoaCoders Austin
 
J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...
J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...
J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...
 
Mobile App Development Company In Noida | Drona Infotech
Mobile App Development Company In Noida | Drona InfotechMobile App Development Company In Noida | Drona Infotech
Mobile App Development Company In Noida | Drona Infotech
 
Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf
Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdfTop Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf
Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf
 
zOS Mainframe JES2-JES3 JCL-JECL Differences
zOS Mainframe JES2-JES3 JCL-JECL DifferenceszOS Mainframe JES2-JES3 JCL-JECL Differences
zOS Mainframe JES2-JES3 JCL-JECL Differences
 
Oracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptxOracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptx
 
Malibou Pitch Deck For Its €3M Seed Round
Malibou Pitch Deck For Its €3M Seed RoundMalibou Pitch Deck For Its €3M Seed Round
Malibou Pitch Deck For Its €3M Seed Round
 
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling ExtensionsUI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
 
一比一原版(UMN毕业证)明尼苏达大学毕业证如何办理
一比一原版(UMN毕业证)明尼苏达大学毕业证如何办理一比一原版(UMN毕业证)明尼苏达大学毕业证如何办理
一比一原版(UMN毕业证)明尼苏达大学毕业证如何办理
 
The Key to Digital Success_ A Comprehensive Guide to Continuous Testing Integ...
The Key to Digital Success_ A Comprehensive Guide to Continuous Testing Integ...The Key to Digital Success_ A Comprehensive Guide to Continuous Testing Integ...
The Key to Digital Success_ A Comprehensive Guide to Continuous Testing Integ...
 
How Can Hiring A Mobile App Development Company Help Your Business Grow?
How Can Hiring A Mobile App Development Company Help Your Business Grow?How Can Hiring A Mobile App Development Company Help Your Business Grow?
How Can Hiring A Mobile App Development Company Help Your Business Grow?
 
Using Query Store in Azure PostgreSQL to Understand Query Performance
Using Query Store in Azure PostgreSQL to Understand Query PerformanceUsing Query Store in Azure PostgreSQL to Understand Query Performance
Using Query Store in Azure PostgreSQL to Understand Query Performance
 
ALGIT - Assembly Line for Green IT - Numbers, Data, Facts
ALGIT - Assembly Line for Green IT - Numbers, Data, FactsALGIT - Assembly Line for Green IT - Numbers, Data, Facts
ALGIT - Assembly Line for Green IT - Numbers, Data, Facts
 
Lecture 2 - software testing SE 412.pptx
Lecture 2 - software testing SE 412.pptxLecture 2 - software testing SE 412.pptx
Lecture 2 - software testing SE 412.pptx
 

CMU Trecvid sed11

  • 1. CMU-Informedia @ TRECVID 2011 Surveillance Event Detection Speaker: Lu Jiang Longfei Zhang , Lu Jiang , Lei Bao, Shohei Takahashi, Yuanpeng Li, Alexander Hauptmann Carnegie Mellon University
  • 2. SED11 Team  Team members: Longfei Lu Lei Shohei Yuanpeng Alex
  • 3. Outline  Framework  MoSIFT based Action Recognition  MoSIFT feature  Spatial Bag of Word  Tackling highly imbalanced datasets  Experiment Results
  • 6. MoSIFT • Given pairs of video frames, detect spatio-temporal interest points at multiple scales. • SIFT point detection with sufficient optical flow. • Describing SIFT points through SIFT descriptor and optical flow.
  • 7. Spatial Bag of Words • Each frame is divided into a set of non-overlapping rectangular tiles. • The resulting BoW features are derived by concatenating the BoW features captured in each tile. • Encode the spatial (tile) information in BoW.
  • 8. Hot Region Detection • Person Detection: Person detection based on Histogram of Oriented Gradient (HOG) features. • Background subtraction.
  • 9. Spatial Bag of Features • Each frame is divided into a set of rectangular tiles or grids. • The resulting Bow features are derived by concatenating the BoW features captured in each grid. • Encode the adjusted spatial information in BoW. 1×1 2×2 1×3 720 576 720 720 173 173 230
  • 10. Spatial Bag of Features • Each frame is divided into a set of rectangular tiles or grids. • The resulting Bow features are derived by concatenating the BoW features captured in each grid. • Encode the adjusted spatial information in BoW.
  • 11. Tackling the highly imbalanced data • Augmented Cascade SVM. • Bagging classification method except it adopts probabilistic sampling to select negative samples in a sequential manner. Training Dataset Sub Dataset 1 Classifier 1 Positive Samples Negative Samples
  • 12. Tackling the highly imbalanced data • Augmented Cascade SVM. • Bagging classification method except it adopts probabilistic sampling to select negative samples in a sequential manner. Training Dataset Sub Dataset 1 Classifier 1
  • 13. Tackling the highly imbalanced data • Augmented Cascade SVM. • Bagging classification method except it adopts probabilistic sampling to select negative samples in a sequential manner. Training Dataset Sub Dataset 1 Classifier 1 0.8 0.7 0.3 0.9 0.1 0.1
  • 14. Tackling the highly imbalanced data • Augmented Cascade SVM. • Bagging classification method except it adopts probabilistic sampling to select negative samples in a sequential manner. Training Dataset Sub Dataset 1 Classifier 1 0.8 0.2 0.3 0.9 0.1 0.1 Sub Dataset 2 Classifier 2 … …
  • 15. Tackling the highly imbalanced data • Augmented Cascade SVM. • Bagging classification method except it adopts probabilistic sampling to select negative samples in a sequential manner. N = 10 layers. Training Dataset Sub Dataset 1 Classifier 1 Sub Dataset 2 Classifier 2 Sub Dataset N Classifier N … …
  • 16. Tackling highly imbalanced data Bagging Ensemble of Random Forests • Random Forest is a forest of decision trees. • Two parameters: – n is the number of trees in the forest. – m the number of features in each decision tree. • Build each decision tree by randomly selecting m features and use C4.5. • Each tree is grown without pruning.
  • 17. Tackling highly imbalanced data Bagging Random Forest: Ensemble of Random Forests • Random Forest is a forest of decision trees. • Two parameters: – n is the number of trees in the forest. – m the number of features in each decision tree. • Build each decision tree by randomly selecting m features • Each tree is grown without pruning.
  • 18. Cascade SVM vs. Bagging Random Forest Cascade SVM (chi2 kernel) Bagging Random Forest Effectiveness Most Effective Usually 3-8% less in Average Precision Efficiency Time consuming Usually tens to hundreds of times faster Sensitive to Parameter settings Sensitive Relatively insensitive
  • 19. Results • 8 Submissions: • The first 6 runs use cascade SVM with different sliding window sizes and parameter sets. • Last 2 runs use bagging random forest method.
  • 20. Results • Results for Primary run: Inputs Actual DCR Minimum DCR #Targ #NTarg #Sys #CorDet #CorDet #FA #Miss DCR DCR CellToEar 194 127 128 1 0 127 193 1.0365 1.0003 Embrace 175 657 715 58 0 657 117 0.8840 0.8658 ObjectPut 621 57 58 1 0 57 620 1.0171 1.0003 PeopleMeet 449 336 381 45 0 336 404 1.0100 0.9724 PeopleSplitUp 187 115 118 3 0 115 184 1.0217 1.0003 PersonRuns 107 413 439 26 0 413 81 0.8924 0.8370 Pointing 1063 1960 2092 132 0 1960 931 1.5186 1.0001
  • 21. Results Compared with our primary run with those of other teams. We have the best Min DCR in 3 out of 6 events.
  • 22. Results Compared with our primary run with those of other teams. We have the best Actual DCR in 3 out of 7 events.
  • 23. Results Compared with our last year’s result, we get improvement in terms of MIN DCR in 5 events “Embrace”, “People Meet”, “People Slit up”, “Person Runs” and “Pointing”. • Best event results over all CMU runs Min DCR Cell To Ear Embrace Object Put People Meet People Split Up Person Runs Pointing 2010 CMU 1.0003 0.9838 1.0003 0.9793 0.9889 0.9477 1.0003 2010 Overall Best Event 1 0.9663 0.9971 0.9787 0.9889 0.6818 0.996 2011 CMU 1.0003 0.8658 1.0003 0.9684 0.7838 0.837 0.9996
  • 24. Min DCR Cell To Ear Embrace Object Put People Meet People Split Up Person Runs Pointing 2010 CMU 1.0003 0.9838 1.0003 0.9793 0.9889 0.9477 1.0003 2010 Overall Best Event 1 0.9663 0.9971 0.9787 0.9889 0.6818 0.996 2011 CMU 1.0003 0.8658 1.0003 0.9684 0.7838 0.837 0.9996 Results Compared with the best event results in TRECVID 2010, for event “Embrace”, “PeopleMeet” and “People Split Up” ours are the best system.
  • 25. Cascade SVM vs. Random Forest • Comparison between Run 1 (Cascade SVM) and Run 7 (Random Forest) in terms of Min DCR.
  • 26. Threshold Search • Searching for Min DCR using cross validation. • Actual DCR provides reasonable estimates of Min DCR on all runs. Primary Run
  • 27. Impact of sliding window size • Results for all events with sliding window size 25 frames (Run 3).
  • 28. Impact of sliding window size • Results for all events with sliding window size 60 (Run 5).
  • 29. Event-specific sliding window size • For PersonRuns, CellToEar, Embrace and Pointing a good sliding window is small. • For Embrace, ObjectPut and PeopleMeet a good sliding window size is larger.
  • 30. Conclusions  Observations:  MoSIFT feature captures salient motions in videos.  Spatial Bag of Words can boost the performance over last year’s result.  Event-specific sliding window size impacts the final result.  Both cascade SVM and bagging random forest can handle highly imbalanced data sets. Random forest is much faster.