SlideShare a Scribd company logo
1 of 18
Download to read offline
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
A brief review on video representation
Xiang Xiang
Department of Computer Science
Johns Hopkins University
May 16, 2018
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Outline
1 Problems
Action recognition and action quality assessment
Spatial/temporal action detection/segmentation
2 Approaches
Image-set models
Temporally local motion models
Temporally global models
Locally spatiotemporal models
3 Tools used
Compressed sensing and sparse recovery
Representation learning
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Outline
1 Problems
Action recognition and action quality assessment
Spatial/temporal action detection/segmentation
2 Approaches
Image-set models
Temporally local motion models
Temporally global models
Locally spatiotemporal models
3 Tools used
Compressed sensing and sparse recovery
Representation learning
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Action recognition and action
quality assessment
• Facial expression recognition.
• Facial action intensity estimation.
• Human action recognition.
• Human action quality assessment.
• Video face recognition.
• Video categorization.
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Outline
1 Problems
Action recognition and action quality assessment
Spatial/temporal action detection/segmentation
2 Approaches
Image-set models
Temporally local motion models
Temporally global models
Locally spatiotemporal models
3 Tools used
Compressed sensing and sparse recovery
Representation learning
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Spatial/temporal action
detection/segmentation
• Facial expression detection.
• Facial action unit detection.
• Camera motion estimation.
• Action detection.
• Object tracking.
• Change detection.
• Video summarization.
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Outline
1 Problems
Action recognition and action quality assessment
Spatial/temporal action detection/segmentation
2 Approaches
Image-set models
Temporally local motion models
Temporally global models
Locally spatiotemporal models
3 Tools used
Compressed sensing and sparse recovery
Representation learning
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Image-set models
• Set theory.
• Distribution based: KL divergence on Gaussian mixtures.
• Subspace based: linear subspaces on Grassmann manifold.
• Sample based: feature averaging, aggregation, covariance
matrix, and Vector of Locally Aggregated Descriptors
(VLAD).
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Outline
1 Problems
Action recognition and action quality assessment
Spatial/temporal action detection/segmentation
2 Approaches
Image-set models
Temporally local motion models
Temporally global models
Locally spatiotemporal models
3 Tools used
Compressed sensing and sparse recovery
Representation learning
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Temporally local motion models
• Data association, regional proposal linking, and thread
building.
• Motion model: optical flow, Two-Stream CNN and
Bayesian filter.
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Outline
1 Problems
Action recognition and action quality assessment
Spatial/temporal action detection/segmentation
2 Approaches
Image-set models
Temporally local motion models
Temporally global models
Locally spatiotemporal models
3 Tools used
Compressed sensing and sparse recovery
Representation learning
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Temporally global models
• Temporal aggregation: NetVLAD.
• Hidden Markov Models (HMM) and Conditional Random
Fields (CRF).
• Temporal coding: Temporal Convolutional Networks
(TCN), Temporal Segment Networks (TSN), and Deep
Temporal Linear Encoding Networks.
• Recurrent neural networks (RNN): Long Short-Term
Memoory (LSTM).
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Outline
1 Problems
Action recognition and action quality assessment
Spatial/temporal action detection/segmentation
2 Approaches
Image-set models
Temporally local motion models
Temporally global models
Locally spatiotemporal models
3 Tools used
Compressed sensing and sparse recovery
Representation learning
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Locally spatiotemporal models
• Spatiotemporal aggregation: ActionVLAD.
• Volumetric modeling: 3D graph models.
• Action tube/tubelet proposal: 3D Convolutional Neural
Network (CNN).
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Outline
1 Problems
Action recognition and action quality assessment
Spatial/temporal action detection/segmentation
2 Approaches
Image-set models
Temporally local motion models
Temporally global models
Locally spatiotemporal models
3 Tools used
Compressed sensing and sparse recovery
Representation learning
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Compressed sensing and sparse
recovery
One goal is to find underlying structures in noisy observations.
• Linear algebra: y = Dx + e = [ D | I ] ×
x
e
= Dx.
• Sparse coding: [x∗, e∗]T = x∗ = arg minx sparsity(x).
• Convex optimization: x∗ = y − Dx 2
2 + λ x 1.
• Matrix theory and matrix approximation: Y = L + E so
that L∗ = arg minL Y − L 2
F + λ L ∗ which is the robust
version of Principal Component Analysis (PCA). PCA finds
projection U = arg minU Y − UUT
Y 2
F s.t. UUT
= I.
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Outline
1 Problems
Action recognition and action quality assessment
Spatial/temporal action detection/segmentation
2 Approaches
Image-set models
Temporally local motion models
Temporally global models
Locally spatiotemporal models
3 Tools used
Compressed sensing and sparse recovery
Representation learning
A brief review
on video
representation
Xiang Xiang
Problems
Action
recognition and
action quality
assessment
Spatial/temporal
action detec-
tion/segmentation
Approaches
Image-set
models
Temporally local
motion models
Temporally
global models
Locally
spatiotemporal
models
Tools used
Compressed
sensing and
sparse recovery
Representation
learning
Representation learning
Representation here means that we somehow transform the
data so that its essential structure is made more visible or
accessible.
• PCA.
• K-means clustering.
• Linear Discriminant Analysis (LDA).
• Independent component analysis (ICA).
• Deep CNN and RNN.

More Related Content

Similar to A brief review on video representation

Eye tracking in usability studies
Eye tracking in usability studiesEye tracking in usability studies
Eye tracking in usability studiesNana Nielsen
 
Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...
Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...
Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...민진 최
 
Object Tracking with Instance Matching and Online Learning
Object Tracking with Instance Matching and Online LearningObject Tracking with Instance Matching and Online Learning
Object Tracking with Instance Matching and Online LearningJui-Hsin (Larry) Lai
 
Human Action Recognition Based on Spacio-temporal features-Poster
Human Action Recognition Based on Spacio-temporal features-PosterHuman Action Recognition Based on Spacio-temporal features-Poster
Human Action Recognition Based on Spacio-temporal features-Posternikhilus85
 
Talk 2011-buet-perception-event
Talk 2011-buet-perception-eventTalk 2011-buet-perception-event
Talk 2011-buet-perception-eventMahfuzul Haque
 
Deep VO and SLAM IV
Deep VO and SLAM IVDeep VO and SLAM IV
Deep VO and SLAM IVYu Huang
 
Event recognition image & video segmentation
Event recognition image & video segmentationEvent recognition image & video segmentation
Event recognition image & video segmentationeSAT Journals
 
Automated mobility mode detection based on GPS tracking data
Automated mobility mode detection based on GPS tracking dataAutomated mobility mode detection based on GPS tracking data
Automated mobility mode detection based on GPS tracking dataYongyao Jiang
 
Optical Flow with Semantic Segmentation and Localized Layers
Optical Flow with Semantic Segmentation and Localized LayersOptical Flow with Semantic Segmentation and Localized Layers
Optical Flow with Semantic Segmentation and Localized LayersSeval Çapraz
 
[SOCRS2013]Differential Context Modeling in Collaborative Filtering
[SOCRS2013]Differential Context Modeling in Collaborative Filtering[SOCRS2013]Differential Context Modeling in Collaborative Filtering
[SOCRS2013]Differential Context Modeling in Collaborative FilteringYONG ZHENG
 
Silhouette analysis based action recognition via exploiting human poses
Silhouette analysis based action recognition via exploiting human posesSilhouette analysis based action recognition via exploiting human poses
Silhouette analysis based action recognition via exploiting human posesAVVENIRE TECHNOLOGIES
 
Action_recognition-topic.pptx
Action_recognition-topic.pptxAction_recognition-topic.pptx
Action_recognition-topic.pptxcomputerscience98
 
IRJET- Criminal Recognization in CCTV Surveillance Video
IRJET-  	  Criminal Recognization in CCTV Surveillance VideoIRJET-  	  Criminal Recognization in CCTV Surveillance Video
IRJET- Criminal Recognization in CCTV Surveillance VideoIRJET Journal
 
ramya_Motion_Detection
ramya_Motion_Detectionramya_Motion_Detection
ramya_Motion_Detectionramya1591
 
[UMAP2013] Recommendation with Differential Context Weighting
[UMAP2013] Recommendation with Differential Context Weighting[UMAP2013] Recommendation with Differential Context Weighting
[UMAP2013] Recommendation with Differential Context WeightingYONG ZHENG
 
Fast and Noise Robust Depth from Focus using Ring Difference Filter with Your...
Fast and Noise Robust Depth from Focus using Ring Difference Filter with Your...Fast and Noise Robust Depth from Focus using Ring Difference Filter with Your...
Fast and Noise Robust Depth from Focus using Ring Difference Filter with Your...NAVER Engineering
 

Similar to A brief review on video representation (20)

JCC_2016011515340886
JCC_2016011515340886JCC_2016011515340886
JCC_2016011515340886
 
Eye tracking in usability studies
Eye tracking in usability studiesEye tracking in usability studies
Eye tracking in usability studies
 
Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...
Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...
Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...
 
Temporal based Recommendation System
Temporal based Recommendation SystemTemporal based Recommendation System
Temporal based Recommendation System
 
Object Tracking with Instance Matching and Online Learning
Object Tracking with Instance Matching and Online LearningObject Tracking with Instance Matching and Online Learning
Object Tracking with Instance Matching and Online Learning
 
Human Action Recognition Based on Spacio-temporal features-Poster
Human Action Recognition Based on Spacio-temporal features-PosterHuman Action Recognition Based on Spacio-temporal features-Poster
Human Action Recognition Based on Spacio-temporal features-Poster
 
Talk 2011-buet-perception-event
Talk 2011-buet-perception-eventTalk 2011-buet-perception-event
Talk 2011-buet-perception-event
 
Deep VO and SLAM IV
Deep VO and SLAM IVDeep VO and SLAM IV
Deep VO and SLAM IV
 
Event recognition image & video segmentation
Event recognition image & video segmentationEvent recognition image & video segmentation
Event recognition image & video segmentation
 
Automated mobility mode detection based on GPS tracking data
Automated mobility mode detection based on GPS tracking dataAutomated mobility mode detection based on GPS tracking data
Automated mobility mode detection based on GPS tracking data
 
Optical Flow with Semantic Segmentation and Localized Layers
Optical Flow with Semantic Segmentation and Localized LayersOptical Flow with Semantic Segmentation and Localized Layers
Optical Flow with Semantic Segmentation and Localized Layers
 
[SOCRS2013]Differential Context Modeling in Collaborative Filtering
[SOCRS2013]Differential Context Modeling in Collaborative Filtering[SOCRS2013]Differential Context Modeling in Collaborative Filtering
[SOCRS2013]Differential Context Modeling in Collaborative Filtering
 
Silhouette analysis based action recognition via exploiting human poses
Silhouette analysis based action recognition via exploiting human posesSilhouette analysis based action recognition via exploiting human poses
Silhouette analysis based action recognition via exploiting human poses
 
Action_recognition-topic.pptx
Action_recognition-topic.pptxAction_recognition-topic.pptx
Action_recognition-topic.pptx
 
Raskar COSI invited talk Oct 2009
Raskar COSI invited talk Oct 2009Raskar COSI invited talk Oct 2009
Raskar COSI invited talk Oct 2009
 
IRJET- Criminal Recognization in CCTV Surveillance Video
IRJET-  	  Criminal Recognization in CCTV Surveillance VideoIRJET-  	  Criminal Recognization in CCTV Surveillance Video
IRJET- Criminal Recognization in CCTV Surveillance Video
 
Making sense
Making senseMaking sense
Making sense
 
ramya_Motion_Detection
ramya_Motion_Detectionramya_Motion_Detection
ramya_Motion_Detection
 
[UMAP2013] Recommendation with Differential Context Weighting
[UMAP2013] Recommendation with Differential Context Weighting[UMAP2013] Recommendation with Differential Context Weighting
[UMAP2013] Recommendation with Differential Context Weighting
 
Fast and Noise Robust Depth from Focus using Ring Difference Filter with Your...
Fast and Noise Robust Depth from Focus using Ring Difference Filter with Your...Fast and Noise Robust Depth from Focus using Ring Difference Filter with Your...
Fast and Noise Robust Depth from Focus using Ring Difference Filter with Your...
 

Recently uploaded

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 

Recently uploaded (20)

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 

A brief review on video representation

  • 1. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning A brief review on video representation Xiang Xiang Department of Computer Science Johns Hopkins University May 16, 2018
  • 2. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Outline 1 Problems Action recognition and action quality assessment Spatial/temporal action detection/segmentation 2 Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models 3 Tools used Compressed sensing and sparse recovery Representation learning
  • 3. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Outline 1 Problems Action recognition and action quality assessment Spatial/temporal action detection/segmentation 2 Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models 3 Tools used Compressed sensing and sparse recovery Representation learning
  • 4. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Action recognition and action quality assessment • Facial expression recognition. • Facial action intensity estimation. • Human action recognition. • Human action quality assessment. • Video face recognition. • Video categorization.
  • 5. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Outline 1 Problems Action recognition and action quality assessment Spatial/temporal action detection/segmentation 2 Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models 3 Tools used Compressed sensing and sparse recovery Representation learning
  • 6. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Spatial/temporal action detection/segmentation • Facial expression detection. • Facial action unit detection. • Camera motion estimation. • Action detection. • Object tracking. • Change detection. • Video summarization.
  • 7. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Outline 1 Problems Action recognition and action quality assessment Spatial/temporal action detection/segmentation 2 Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models 3 Tools used Compressed sensing and sparse recovery Representation learning
  • 8. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Image-set models • Set theory. • Distribution based: KL divergence on Gaussian mixtures. • Subspace based: linear subspaces on Grassmann manifold. • Sample based: feature averaging, aggregation, covariance matrix, and Vector of Locally Aggregated Descriptors (VLAD).
  • 9. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Outline 1 Problems Action recognition and action quality assessment Spatial/temporal action detection/segmentation 2 Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models 3 Tools used Compressed sensing and sparse recovery Representation learning
  • 10. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Temporally local motion models • Data association, regional proposal linking, and thread building. • Motion model: optical flow, Two-Stream CNN and Bayesian filter.
  • 11. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Outline 1 Problems Action recognition and action quality assessment Spatial/temporal action detection/segmentation 2 Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models 3 Tools used Compressed sensing and sparse recovery Representation learning
  • 12. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Temporally global models • Temporal aggregation: NetVLAD. • Hidden Markov Models (HMM) and Conditional Random Fields (CRF). • Temporal coding: Temporal Convolutional Networks (TCN), Temporal Segment Networks (TSN), and Deep Temporal Linear Encoding Networks. • Recurrent neural networks (RNN): Long Short-Term Memoory (LSTM).
  • 13. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Outline 1 Problems Action recognition and action quality assessment Spatial/temporal action detection/segmentation 2 Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models 3 Tools used Compressed sensing and sparse recovery Representation learning
  • 14. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Locally spatiotemporal models • Spatiotemporal aggregation: ActionVLAD. • Volumetric modeling: 3D graph models. • Action tube/tubelet proposal: 3D Convolutional Neural Network (CNN).
  • 15. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Outline 1 Problems Action recognition and action quality assessment Spatial/temporal action detection/segmentation 2 Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models 3 Tools used Compressed sensing and sparse recovery Representation learning
  • 16. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Compressed sensing and sparse recovery One goal is to find underlying structures in noisy observations. • Linear algebra: y = Dx + e = [ D | I ] × x e = Dx. • Sparse coding: [x∗, e∗]T = x∗ = arg minx sparsity(x). • Convex optimization: x∗ = y − Dx 2 2 + λ x 1. • Matrix theory and matrix approximation: Y = L + E so that L∗ = arg minL Y − L 2 F + λ L ∗ which is the robust version of Principal Component Analysis (PCA). PCA finds projection U = arg minU Y − UUT Y 2 F s.t. UUT = I.
  • 17. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Outline 1 Problems Action recognition and action quality assessment Spatial/temporal action detection/segmentation 2 Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models 3 Tools used Compressed sensing and sparse recovery Representation learning
  • 18. A brief review on video representation Xiang Xiang Problems Action recognition and action quality assessment Spatial/temporal action detec- tion/segmentation Approaches Image-set models Temporally local motion models Temporally global models Locally spatiotemporal models Tools used Compressed sensing and sparse recovery Representation learning Representation learning Representation here means that we somehow transform the data so that its essential structure is made more visible or accessible. • PCA. • K-means clustering. • Linear Discriminant Analysis (LDA). • Independent component analysis (ICA). • Deep CNN and RNN.