SlideShare a Scribd company logo
1 of 25
Download to read offline
Sparse Representation-based Human Action Recognition
using an Action Region-aware Dictionary
ISM 2013
December 11, 2013

Hyun-seok Min, Wesley De Neve, and Yong Man Ro
Image and Video Systems Lab
Department of Electrical Engineering
Korea Advanced Institute of Science and Technology (KAIST)
e-mail: hsmin@kaist.ac.kr
web: http://ivylab.kaist.ac.kr

IEEE International Symposium on Multimedia 2013
Outline
•
•
•
•

Introduction
Sparse representation-based human action recognition
Experiments
Conclusions and future research

IEEE International Symposium on Multimedia 2013

2
Outline
• Introduction
– human action recognition
– problems
– contributions

• Sparse representation-based human action recognition
• Experiments
• Conclusions and future research

IEEE International Symposium on Multimedia 2013

3
Conventional approach for
human action recognition
Pr epr ocessing

Input
Input

Classification

Human Action Recognition Framework
Segmentation
Object Detection
Object Tracking

Video
Sequence

Featur e Ex tr action

Cuboid

SR

2D-Harris

SVM

LBP-TOP

Keypoint
Detection

Output

Random Forest

Descriptor

LMP, CUBOID

IEEE International Symposium on Multimedia 2013

“Skating”

4
Action detection vs. action recognition
• A video clip consists of a context region and an action region [1]
– action detection (segmentation) is required for effective action recognition [2]

=

Human action video clip

+

Context region

Action region

• Shortcomings of action detection
– despite the great emphasis on action recognition, there is comparatively little
work available on action detection [2]
– there is currently no general action detection method available that shows a
high level of effectiveness for every action
[1] K K. Reddy and M.Shah, “Recognizing 50 Human Action Categories of Web Videos,” Machine Vision and Applications Journal , vol. 24, no. 5, pp. 971-981, 2012.
[2] S.Sadanand and J.J.Corso, “Action bank: A high-level representation of activity in video,” IEEE Conf. on Computer Vision and Pattern Recognition , pp.1234-1241, 2012.

IEEE International Symposium on Multimedia 2013

5
Context information
for human action recognition
• Usefulness of context depends on the action class

(a)

(b)

(c)

– e.g., context is
• helpful for making a distinction between (a) and (b) [3]
• not helpful for making a distinction between (b) and (c)
[3] Tian Lan, Yang Wang, and Greg Mori, “Discriminative Figure-Centric Models for Joint Action Localization and Recognition,” IEEE International Conference on Computer
Vision (ICCV), 2011

IEEE International Symposium on Multimedia 2013

6
Research challenges & contributions
• Challenges

– lack of a general method for effective and efficient action detection
– the usefulness of context information depends on the type of action

• Contributions
– we propose a novel human action recognition method
• that does not require complex action detection during testing
• that uses context information in an adaptive way

IEEE International Symposium on Multimedia 2013

7
Outline
• Introduction
• Sparse representation-based human action recognition
– conventional method
– proposed method
• construction of an action region-aware dictionary
• use of an action region-aware dictionary
• adaptive classification using split sparse coefficients

• Experiments
• Conclusions and future research

IEEE International Symposium on Multimedia 2013

8
Conventional SR-based method:
dictionary construction
…

Action class 1

…

…

Feature
extraction

Action class i

…
…

…

…

Action class K

…

…

…

…

…

i
K
D = [z1 ,..., z1 1 ,..., z1 ,..., ziNi ,...., z1 ,..., z K K ] ∈ ℜ d × N
1
N
N

IEEE International Symposium on Multimedia 2013

9
Conventional SR-based method:
classification
•

Input video clip, depicting
'Lifting' (true action)

Given a dictionary D, the feature
vector y of a test video clip V can be
represented as follows
y ≈ Dx∈ ℜ d ,

Sparse coefficients belonging to
the true class

Sparse coefficient value

y : feature vector of V
D : dictionary
x : sparse coefficient vector

•

Given the sparse solution x, we can
calculate the residual error for each
human action as follows:
ri (y) = y − Dδi (x) 1

1

2

3

4

5

Human action class: 1: diving 2: golf swing
6: running 7: skating

6

7

3: kicking
8: swing1

8

4: lifting
9: swing2

9

10

5: riding
10: walking

ri(y) : residual for ith action
δi (x) : a new vector whose only nonzero entries
are the entries in x that are associated
with class i

IEEE International Symposium on Multimedia 2013

10
Conventional SR-based method:
dictionary shortcomings
Input video clip, depicting
'Golf' (true action)

• The dictionary only contains
class information

Sparse coefficients belonging to
the true class

Sparse coefficient value

– we do not know the location and
size of the action region of a test
video clip during classification
– however, we do know the
location and size of the action
regions in the training video clips

• Research question
1

2

3

4

5

Human action class: 1: diving 2: golf swing
6: running 7: skating

6

7

3: kicking
8: swing1

8

4: lifting
9: swing2

9

10

5: riding
10: walking

– how about putting the action
region information of the training
video clips in the dictionary?

IEEE International Symposium on Multimedia 2013

11
Proposed SR-based method:
construction of an action region-aware dictionary
Training video clips
...

...

Segmentation during training

• We propose to construct a
dictionary that consists of
two split dictionaries:
– context region dictionary DC
– action region dictionary DA

Segmented regions

...

...
Action regions

Context regions

Feature extraction
Action region-aware dictionary
D=

...

...

DC

DA

D = [DC | D A ]∈ ℜ d × N

IEEE International Symposium on Multimedia 2013

12
Proposed SR-based method:
use of an action region-aware dictionary (1/3)
• Given an action region-aware dictionary D and the feature
vector y of a test video clip V, we can compute the sparse
representation of y as follows
x 
y ≈ D R x ≅ [D C | D A ] C  = D C xC + D A x A
x A 

1
i
K
xC = [ x1,C ,..., x1 1 ,C ,..., x1i,C ,..., xN i ,C ,..., x1KC ,..., xN K ,C ]
,
N
1
i
i
K
x A = [ x1, A ,..., x1 1 , A ,..., x1, A ,..., x Ni , A ,..., x1KA ,..., x N K , A ]
,
N

– xij,C and xij,A: the sparse coefficient values that are associated with the
context and the action region of the jth training video clip
of the ith human action
During testing, the proposed method for human action recognition
is able to automatically make a distinction between information
originating from the context region and information originating from
the action region in a test video clip.

IEEE International Symposium on Multimedia 2013

13
Proposed SR-based method:
use of an action region-aware dictionary (2/3)
Input video clip, depicting
'golf swing' (true action)

The sparse coefficients belonging
to the context region of the ‘golf
swing’ test video clip are
dispersed over the different
classes. This can be attributed to
the fact that the background of
‘golf swing’ is visually similar to
the background of ‘kicking’, ‘riding’,
and ‘walking’.

...

Sparse coefficients belonging to
the context region

Sparse coefficient value

Sparse coefficients belonging to
the action region

DC
1

2

3

4

5

6

DA
7

8

9

10

Human action class: 1: diving 2: golf swing
6: running 7: skating

1

2

3

3: kicking
8: swing1

4

5

6

4: lifting
9: swing2

7

8

9

10

5: riding
10: walking

IEEE International Symposium on Multimedia 2013

14
Proposed SR-based method:
use of an action region-aware dictionary (3/3)
Input video clip, depicting
'diving' (true action)
Sparse coefficients belonging to
the context region

...

The sparse coefficients belonging
to the context region of the
‘diving’ test video clip are
concentrated in the true class.
This means that the context
region of ‘diving’ is different from
the context regions of the other
human actions.

Sparse coefficient value

Sparse coefficients belonging to
the action region

DC
1

2

3

4

5

6

DA
7

8

9

10

Human action class: 1: diving 2: golf swing
6: running 7: skating

1

2

3

3: kicking
8: swing1

4

5

6

4: lifting
9: swing2

7

8

9

10

5: riding
10: walking

IEEE International Symposium on Multimedia 2013

15
Adaptive classification using
split sparse coefficients
• Given the above observations, we can hypothesize that
– information originating from context regions can help in successfully classifying
human actions, on the condition that the sparse coefficients associated with the
context regions are concentrated in the true class

• Measurement of the concentration of sparse coefficients
– Maximum Sparse Coefficient Concentration (MSCC)
MSCC (x) = max
k

δk (x) 1
x1

• We can then use the following criterion to determine whether information
of context regions can help in successfully classifying human actions
MSCC (xC )
> ξ ratio
MSCC (x A )

IEEE International Symposium on Multimedia 2013

16
Outline
• Introduction
• Sparse representation-based human action recognition
• Experiments
– experimental setup
– experimental results

• Conclusions and future research

IEEE International Symposium on Multimedia 2013

17
Experimental setup (1/2)
• Use of the UCF Sports Action data set
– contains 150 action video clips with a resolution of 720×480, collected
for various sports that are typically featured on broadcast television
channels such as BBC and ESPN
– for each frame, a bounding box is available around the person
performing the action of interest
– available action classes: diving, golf swinging, kicking, lifting, riding,
running, skating, swinging, and walking
Diving

Running

Golf swinging

Kicking

Lifting

Skating

Swinging

Walking

IEEE International Symposium on Multimedia 2013

Riding

18
Experimental setup (2/2)
• Comparison with
– SR with action region
• only makes use of action regions in the test video clips considered, thus
taking advantage of segmentation information
– SR with whole region
• uses whole video frames, thus not exploiting segmentation information

SR with whole
region

SR with action
region

IEEE International Symposium on Multimedia 2013

19
Experimental results (1/2)
• The accuracy of the proposed SR-based method for human action
recognition is more stable over the different human action classes
• The accuracy of the proposed method is highly independent of the
type of human action
– thanks to the use of a context-adaptive classification strategy

IEEE International Symposium on Multimedia 2013

20
Experimental results (2/2)
• We can observe that what method is most accurate depends on the
human action class considered
– “SR with action region” is usually more accurate when the concentration of
the sparse coefficients associated with the action region is higher than the
concentration of the sparse coefficients associated with the context region
– Otherwise, “SR with whole region” or “Proposed method” are more effective

IEEE International Symposium on Multimedia 2013

21
Outline
•
•
•
•

Introduction
Sparse representation-based human action recognition
Experiments
Conclusions and future research
– conclusions
– future research directions

IEEE International Symposium on Multimedia 2013

22
Conclusions
• We proposed a novel SR-based method for human action
recognition, having the following two major characteristics
– first, classification does not have to apply explicit segmentation to a
given test video clip
– second, classification is context adaptive in nature, only leveraging
information about the context in which the action took place when
the concentration of the corresponding sparse coefficients is high

IEEE International Symposium on Multimedia 2013

23
Future research directions
• Use of dictionary learning techniques that allow for more
effective and efficient construction of an overcomplete
dictionary
• Perform experiments with actions that have a lower variation
in background
• Study how to leverage SRC by means of an action regionaware dictionary in other application scenarios

IEEE International Symposium on Multimedia 2013

24
Thank you!
Any questions?
e-mail: hsmin@kaist.ac.kr .
web: http://ivylab.kaist.ac.kr

IEEE International Symposium on Multimedia 2013

25

More Related Content

What's hot

Predicting Media Memorability with Audio, Video, and Text representations
Predicting Media Memorability with Audio, Video, and Text representationsPredicting Media Memorability with Audio, Video, and Text representations
Predicting Media Memorability with Audio, Video, and Text representationsAlison Reboud
 
TVSum: Summarizing Web Videos Using Titles
TVSum: Summarizing Web Videos Using TitlesTVSum: Summarizing Web Videos Using Titles
TVSum: Summarizing Web Videos Using TitlesNEERAJ BAGHEL
 
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016Universitat Politècnica de Catalunya
 
auto-assistance system for visually impaired person
auto-assistance system for visually impaired personauto-assistance system for visually impaired person
auto-assistance system for visually impaired personshahsamkit73
 
Александр Заричковый "Faster than real-time face detection"
Александр Заричковый "Faster than real-time face detection"Александр Заричковый "Faster than real-time face detection"
Александр Заричковый "Faster than real-time face detection"Fwdays
 
Face detection ppt by Batyrbek
Face detection ppt by Batyrbek Face detection ppt by Batyrbek
Face detection ppt by Batyrbek Batyrbek Ryskhan
 
Backbone can not be trained at once rolling back to pre trained network for p...
Backbone can not be trained at once rolling back to pre trained network for p...Backbone can not be trained at once rolling back to pre trained network for p...
Backbone can not be trained at once rolling back to pre trained network for p...NAVER Engineering
 
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...Universitat Politècnica de Catalunya
 
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...adil raja
 
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...adil raja
 
Discovering Anomalies Based on Saliency Detection and Segmentation in Surveil...
Discovering Anomalies Based on Saliency Detection and Segmentation in Surveil...Discovering Anomalies Based on Saliency Detection and Segmentation in Surveil...
Discovering Anomalies Based on Saliency Detection and Segmentation in Surveil...ijtsrd
 
Atari Game State Representation using Convolutional Neural Networks
Atari Game State Representation using Convolutional Neural NetworksAtari Game State Representation using Convolutional Neural Networks
Atari Game State Representation using Convolutional Neural Networksjohnstamford
 
Avihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slidesAvihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slideswolf
 
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCEHUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCEAswinraj Manickam
 
Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...
Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...
Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...CSCJournals
 
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘CHENHuiMei
 
IRJET - Direct Me-Nevigation for Blind People
IRJET -  	  Direct Me-Nevigation for Blind PeopleIRJET -  	  Direct Me-Nevigation for Blind People
IRJET - Direct Me-Nevigation for Blind PeopleIRJET Journal
 

What's hot (20)

Predicting Media Memorability with Audio, Video, and Text representations
Predicting Media Memorability with Audio, Video, and Text representationsPredicting Media Memorability with Audio, Video, and Text representations
Predicting Media Memorability with Audio, Video, and Text representations
 
TVSum: Summarizing Web Videos Using Titles
TVSum: Summarizing Web Videos Using TitlesTVSum: Summarizing Web Videos Using Titles
TVSum: Summarizing Web Videos Using Titles
 
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
 
auto-assistance system for visually impaired person
auto-assistance system for visually impaired personauto-assistance system for visually impaired person
auto-assistance system for visually impaired person
 
Александр Заричковый "Faster than real-time face detection"
Александр Заричковый "Faster than real-time face detection"Александр Заричковый "Faster than real-time face detection"
Александр Заричковый "Faster than real-time face detection"
 
Face detection ppt by Batyrbek
Face detection ppt by Batyrbek Face detection ppt by Batyrbek
Face detection ppt by Batyrbek
 
Deep Learning for Computer Vision: Face Recognition (UPC 2016)
Deep Learning for Computer Vision: Face Recognition (UPC 2016)Deep Learning for Computer Vision: Face Recognition (UPC 2016)
Deep Learning for Computer Vision: Face Recognition (UPC 2016)
 
Step zhedong
Step zhedongStep zhedong
Step zhedong
 
Face Recognition - Elisa Sayrol - UPC Barcelona 2018
Face Recognition - Elisa Sayrol - UPC Barcelona 2018Face Recognition - Elisa Sayrol - UPC Barcelona 2018
Face Recognition - Elisa Sayrol - UPC Barcelona 2018
 
Backbone can not be trained at once rolling back to pre trained network for p...
Backbone can not be trained at once rolling back to pre trained network for p...Backbone can not be trained at once rolling back to pre trained network for p...
Backbone can not be trained at once rolling back to pre trained network for p...
 
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
 
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
 
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
 
Discovering Anomalies Based on Saliency Detection and Segmentation in Surveil...
Discovering Anomalies Based on Saliency Detection and Segmentation in Surveil...Discovering Anomalies Based on Saliency Detection and Segmentation in Surveil...
Discovering Anomalies Based on Saliency Detection and Segmentation in Surveil...
 
Atari Game State Representation using Convolutional Neural Networks
Atari Game State Representation using Convolutional Neural NetworksAtari Game State Representation using Convolutional Neural Networks
Atari Game State Representation using Convolutional Neural Networks
 
Avihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slidesAvihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slides
 
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCEHUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
 
Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...
Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...
Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...
 
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
 
IRJET - Direct Me-Nevigation for Blind People
IRJET -  	  Direct Me-Nevigation for Blind PeopleIRJET -  	  Direct Me-Nevigation for Blind People
IRJET - Direct Me-Nevigation for Blind People
 

Similar to Sparse representation based human action recognition using an action region-aware dictionary

A Framework for Human Action Detection via Extraction of Multimodal Features
A Framework for Human Action Detection via Extraction of Multimodal FeaturesA Framework for Human Action Detection via Extraction of Multimodal Features
A Framework for Human Action Detection via Extraction of Multimodal FeaturesCSCJournals
 
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...IRJET Journal
 
Flow Trajectory Approach for Human Action Recognition
Flow Trajectory Approach for Human Action RecognitionFlow Trajectory Approach for Human Action Recognition
Flow Trajectory Approach for Human Action RecognitionIRJET Journal
 
Human action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptorHuman action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptorSoma Boubou
 
Dance With AI – An interactive dance learning platform
Dance With AI – An interactive dance learning platformDance With AI – An interactive dance learning platform
Dance With AI – An interactive dance learning platformIRJET Journal
 
Action_recognition-topic.pptx
Action_recognition-topic.pptxAction_recognition-topic.pptx
Action_recognition-topic.pptxcomputerscience98
 
Human Activity Recognition Using Neural Network
Human Activity Recognition Using Neural NetworkHuman Activity Recognition Using Neural Network
Human Activity Recognition Using Neural NetworkIRJET Journal
 
Human Action Recognition Using Deep Learning
Human Action Recognition Using Deep LearningHuman Action Recognition Using Deep Learning
Human Action Recognition Using Deep LearningIRJET Journal
 
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene GraphsAction Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene GraphsSangmin Woo
 
On the Influence Propagation of Web Videos
On the Influence Propagation of Web VideosOn the Influence Propagation of Web Videos
On the Influence Propagation of Web Videosabidhavp
 
Video Manifold Feature Extraction Based on ISOMAP
Video Manifold Feature Extraction Based on ISOMAPVideo Manifold Feature Extraction Based on ISOMAP
Video Manifold Feature Extraction Based on ISOMAPinventionjournals
 
Intelligent Video Surveillance System using Deep Learning
Intelligent Video Surveillance System using Deep LearningIntelligent Video Surveillance System using Deep Learning
Intelligent Video Surveillance System using Deep LearningIRJET Journal
 
Real-Time Pertinent Maneuver Recognition for Surveillance
Real-Time Pertinent Maneuver Recognition for SurveillanceReal-Time Pertinent Maneuver Recognition for Surveillance
Real-Time Pertinent Maneuver Recognition for SurveillanceIRJET Journal
 
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...Francisco (Paco) Florez-Revuelta
 
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformHuman Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformFadwa Fouad
 
Development of wearable object detection system & blind stick for visuall...
Development of wearable object detection system & blind stick for visuall...Development of wearable object detection system & blind stick for visuall...
Development of wearable object detection system & blind stick for visuall...Arkadev Kundu
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision Chen Sagiv
 
Silhouette analysis based action recognition via exploiting human poses
Silhouette analysis based action recognition via exploiting human posesSilhouette analysis based action recognition via exploiting human poses
Silhouette analysis based action recognition via exploiting human posesAVVENIRE TECHNOLOGIES
 
K-Means Clustering in Moving Objects Extraction with Selective Background
K-Means Clustering in Moving Objects Extraction with Selective BackgroundK-Means Clustering in Moving Objects Extraction with Selective Background
K-Means Clustering in Moving Objects Extraction with Selective BackgroundIJCSIS Research Publications
 

Similar to Sparse representation based human action recognition using an action region-aware dictionary (20)

A Framework for Human Action Detection via Extraction of Multimodal Features
A Framework for Human Action Detection via Extraction of Multimodal FeaturesA Framework for Human Action Detection via Extraction of Multimodal Features
A Framework for Human Action Detection via Extraction of Multimodal Features
 
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
 
Flow Trajectory Approach for Human Action Recognition
Flow Trajectory Approach for Human Action RecognitionFlow Trajectory Approach for Human Action Recognition
Flow Trajectory Approach for Human Action Recognition
 
Human action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptorHuman action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptor
 
med_poster_spie
med_poster_spiemed_poster_spie
med_poster_spie
 
Dance With AI – An interactive dance learning platform
Dance With AI – An interactive dance learning platformDance With AI – An interactive dance learning platform
Dance With AI – An interactive dance learning platform
 
Action_recognition-topic.pptx
Action_recognition-topic.pptxAction_recognition-topic.pptx
Action_recognition-topic.pptx
 
Human Activity Recognition Using Neural Network
Human Activity Recognition Using Neural NetworkHuman Activity Recognition Using Neural Network
Human Activity Recognition Using Neural Network
 
Human Action Recognition Using Deep Learning
Human Action Recognition Using Deep LearningHuman Action Recognition Using Deep Learning
Human Action Recognition Using Deep Learning
 
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene GraphsAction Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
 
On the Influence Propagation of Web Videos
On the Influence Propagation of Web VideosOn the Influence Propagation of Web Videos
On the Influence Propagation of Web Videos
 
Video Manifold Feature Extraction Based on ISOMAP
Video Manifold Feature Extraction Based on ISOMAPVideo Manifold Feature Extraction Based on ISOMAP
Video Manifold Feature Extraction Based on ISOMAP
 
Intelligent Video Surveillance System using Deep Learning
Intelligent Video Surveillance System using Deep LearningIntelligent Video Surveillance System using Deep Learning
Intelligent Video Surveillance System using Deep Learning
 
Real-Time Pertinent Maneuver Recognition for Surveillance
Real-Time Pertinent Maneuver Recognition for SurveillanceReal-Time Pertinent Maneuver Recognition for Surveillance
Real-Time Pertinent Maneuver Recognition for Surveillance
 
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
 
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformHuman Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
 
Development of wearable object detection system & blind stick for visuall...
Development of wearable object detection system & blind stick for visuall...Development of wearable object detection system & blind stick for visuall...
Development of wearable object detection system & blind stick for visuall...
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision
 
Silhouette analysis based action recognition via exploiting human poses
Silhouette analysis based action recognition via exploiting human posesSilhouette analysis based action recognition via exploiting human poses
Silhouette analysis based action recognition via exploiting human poses
 
K-Means Clustering in Moving Objects Extraction with Selective Background
K-Means Clustering in Moving Objects Extraction with Selective BackgroundK-Means Clustering in Moving Objects Extraction with Selective Background
K-Means Clustering in Moving Objects Extraction with Selective Background
 

More from Wesley De Neve

Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Wesley De Neve
 
Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Wesley De Neve
 
Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Wesley De Neve
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Wesley De Neve
 
The 5th Aslla Symposium
The 5th Aslla SymposiumThe 5th Aslla Symposium
The 5th Aslla SymposiumWesley De Neve
 
Ghent University Global Campus 101
Ghent University Global Campus 101Ghent University Global Campus 101
Ghent University Global Campus 101Wesley De Neve
 
Booklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumBooklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumWesley De Neve
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusWesley De Neve
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusWesley De Neve
 
Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Wesley De Neve
 
Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesWesley De Neve
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Wesley De Neve
 
GUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsGUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsWesley De Neve
 
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Wesley De Neve
 
Ghent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research ActivitiesGhent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research ActivitiesWesley De Neve
 
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Wesley De Neve
 
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 Exploring Deep Machine Learning for Automatic Right Whale Recognition and No... Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...Wesley De Neve
 
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Wesley De Neve
 
Towards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingTowards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingWesley De Neve
 
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Wesley De Neve
 

More from Wesley De Neve (20)

Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
 
Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...
 
Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
 
The 5th Aslla Symposium
The 5th Aslla SymposiumThe 5th Aslla Symposium
The 5th Aslla Symposium
 
Ghent University Global Campus 101
Ghent University Global Campus 101Ghent University Global Campus 101
Ghent University Global Campus 101
 
Booklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumBooklet for the First GUGC Research Symposium
Booklet for the First GUGC Research Symposium
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global Campus
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global Campus
 
Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...
 
Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniques
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
 
GUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsGUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and Bioinformatics
 
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
 
Ghent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research ActivitiesGhent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research Activities
 
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
 
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 Exploring Deep Machine Learning for Automatic Right Whale Recognition and No... Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
 
Towards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingTowards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processing
 
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
 

Recently uploaded

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Recently uploaded (20)

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

Sparse representation based human action recognition using an action region-aware dictionary

  • 1. Sparse Representation-based Human Action Recognition using an Action Region-aware Dictionary ISM 2013 December 11, 2013 Hyun-seok Min, Wesley De Neve, and Yong Man Ro Image and Video Systems Lab Department of Electrical Engineering Korea Advanced Institute of Science and Technology (KAIST) e-mail: hsmin@kaist.ac.kr web: http://ivylab.kaist.ac.kr IEEE International Symposium on Multimedia 2013
  • 2. Outline • • • • Introduction Sparse representation-based human action recognition Experiments Conclusions and future research IEEE International Symposium on Multimedia 2013 2
  • 3. Outline • Introduction – human action recognition – problems – contributions • Sparse representation-based human action recognition • Experiments • Conclusions and future research IEEE International Symposium on Multimedia 2013 3
  • 4. Conventional approach for human action recognition Pr epr ocessing Input Input Classification Human Action Recognition Framework Segmentation Object Detection Object Tracking Video Sequence Featur e Ex tr action Cuboid SR 2D-Harris SVM LBP-TOP Keypoint Detection Output Random Forest Descriptor LMP, CUBOID IEEE International Symposium on Multimedia 2013 “Skating” 4
  • 5. Action detection vs. action recognition • A video clip consists of a context region and an action region [1] – action detection (segmentation) is required for effective action recognition [2] = Human action video clip + Context region Action region • Shortcomings of action detection – despite the great emphasis on action recognition, there is comparatively little work available on action detection [2] – there is currently no general action detection method available that shows a high level of effectiveness for every action [1] K K. Reddy and M.Shah, “Recognizing 50 Human Action Categories of Web Videos,” Machine Vision and Applications Journal , vol. 24, no. 5, pp. 971-981, 2012. [2] S.Sadanand and J.J.Corso, “Action bank: A high-level representation of activity in video,” IEEE Conf. on Computer Vision and Pattern Recognition , pp.1234-1241, 2012. IEEE International Symposium on Multimedia 2013 5
  • 6. Context information for human action recognition • Usefulness of context depends on the action class (a) (b) (c) – e.g., context is • helpful for making a distinction between (a) and (b) [3] • not helpful for making a distinction between (b) and (c) [3] Tian Lan, Yang Wang, and Greg Mori, “Discriminative Figure-Centric Models for Joint Action Localization and Recognition,” IEEE International Conference on Computer Vision (ICCV), 2011 IEEE International Symposium on Multimedia 2013 6
  • 7. Research challenges & contributions • Challenges – lack of a general method for effective and efficient action detection – the usefulness of context information depends on the type of action • Contributions – we propose a novel human action recognition method • that does not require complex action detection during testing • that uses context information in an adaptive way IEEE International Symposium on Multimedia 2013 7
  • 8. Outline • Introduction • Sparse representation-based human action recognition – conventional method – proposed method • construction of an action region-aware dictionary • use of an action region-aware dictionary • adaptive classification using split sparse coefficients • Experiments • Conclusions and future research IEEE International Symposium on Multimedia 2013 8
  • 9. Conventional SR-based method: dictionary construction … Action class 1 … … Feature extraction Action class i … … … … Action class K … … … … … i K D = [z1 ,..., z1 1 ,..., z1 ,..., ziNi ,...., z1 ,..., z K K ] ∈ ℜ d × N 1 N N IEEE International Symposium on Multimedia 2013 9
  • 10. Conventional SR-based method: classification • Input video clip, depicting 'Lifting' (true action) Given a dictionary D, the feature vector y of a test video clip V can be represented as follows y ≈ Dx∈ ℜ d , Sparse coefficients belonging to the true class Sparse coefficient value y : feature vector of V D : dictionary x : sparse coefficient vector • Given the sparse solution x, we can calculate the residual error for each human action as follows: ri (y) = y − Dδi (x) 1 1 2 3 4 5 Human action class: 1: diving 2: golf swing 6: running 7: skating 6 7 3: kicking 8: swing1 8 4: lifting 9: swing2 9 10 5: riding 10: walking ri(y) : residual for ith action δi (x) : a new vector whose only nonzero entries are the entries in x that are associated with class i IEEE International Symposium on Multimedia 2013 10
  • 11. Conventional SR-based method: dictionary shortcomings Input video clip, depicting 'Golf' (true action) • The dictionary only contains class information Sparse coefficients belonging to the true class Sparse coefficient value – we do not know the location and size of the action region of a test video clip during classification – however, we do know the location and size of the action regions in the training video clips • Research question 1 2 3 4 5 Human action class: 1: diving 2: golf swing 6: running 7: skating 6 7 3: kicking 8: swing1 8 4: lifting 9: swing2 9 10 5: riding 10: walking – how about putting the action region information of the training video clips in the dictionary? IEEE International Symposium on Multimedia 2013 11
  • 12. Proposed SR-based method: construction of an action region-aware dictionary Training video clips ... ... Segmentation during training • We propose to construct a dictionary that consists of two split dictionaries: – context region dictionary DC – action region dictionary DA Segmented regions ... ... Action regions Context regions Feature extraction Action region-aware dictionary D= ... ... DC DA D = [DC | D A ]∈ ℜ d × N IEEE International Symposium on Multimedia 2013 12
  • 13. Proposed SR-based method: use of an action region-aware dictionary (1/3) • Given an action region-aware dictionary D and the feature vector y of a test video clip V, we can compute the sparse representation of y as follows x  y ≈ D R x ≅ [D C | D A ] C  = D C xC + D A x A x A  1 i K xC = [ x1,C ,..., x1 1 ,C ,..., x1i,C ,..., xN i ,C ,..., x1KC ,..., xN K ,C ] , N 1 i i K x A = [ x1, A ,..., x1 1 , A ,..., x1, A ,..., x Ni , A ,..., x1KA ,..., x N K , A ] , N – xij,C and xij,A: the sparse coefficient values that are associated with the context and the action region of the jth training video clip of the ith human action During testing, the proposed method for human action recognition is able to automatically make a distinction between information originating from the context region and information originating from the action region in a test video clip. IEEE International Symposium on Multimedia 2013 13
  • 14. Proposed SR-based method: use of an action region-aware dictionary (2/3) Input video clip, depicting 'golf swing' (true action) The sparse coefficients belonging to the context region of the ‘golf swing’ test video clip are dispersed over the different classes. This can be attributed to the fact that the background of ‘golf swing’ is visually similar to the background of ‘kicking’, ‘riding’, and ‘walking’. ... Sparse coefficients belonging to the context region Sparse coefficient value Sparse coefficients belonging to the action region DC 1 2 3 4 5 6 DA 7 8 9 10 Human action class: 1: diving 2: golf swing 6: running 7: skating 1 2 3 3: kicking 8: swing1 4 5 6 4: lifting 9: swing2 7 8 9 10 5: riding 10: walking IEEE International Symposium on Multimedia 2013 14
  • 15. Proposed SR-based method: use of an action region-aware dictionary (3/3) Input video clip, depicting 'diving' (true action) Sparse coefficients belonging to the context region ... The sparse coefficients belonging to the context region of the ‘diving’ test video clip are concentrated in the true class. This means that the context region of ‘diving’ is different from the context regions of the other human actions. Sparse coefficient value Sparse coefficients belonging to the action region DC 1 2 3 4 5 6 DA 7 8 9 10 Human action class: 1: diving 2: golf swing 6: running 7: skating 1 2 3 3: kicking 8: swing1 4 5 6 4: lifting 9: swing2 7 8 9 10 5: riding 10: walking IEEE International Symposium on Multimedia 2013 15
  • 16. Adaptive classification using split sparse coefficients • Given the above observations, we can hypothesize that – information originating from context regions can help in successfully classifying human actions, on the condition that the sparse coefficients associated with the context regions are concentrated in the true class • Measurement of the concentration of sparse coefficients – Maximum Sparse Coefficient Concentration (MSCC) MSCC (x) = max k δk (x) 1 x1 • We can then use the following criterion to determine whether information of context regions can help in successfully classifying human actions MSCC (xC ) > ξ ratio MSCC (x A ) IEEE International Symposium on Multimedia 2013 16
  • 17. Outline • Introduction • Sparse representation-based human action recognition • Experiments – experimental setup – experimental results • Conclusions and future research IEEE International Symposium on Multimedia 2013 17
  • 18. Experimental setup (1/2) • Use of the UCF Sports Action data set – contains 150 action video clips with a resolution of 720×480, collected for various sports that are typically featured on broadcast television channels such as BBC and ESPN – for each frame, a bounding box is available around the person performing the action of interest – available action classes: diving, golf swinging, kicking, lifting, riding, running, skating, swinging, and walking Diving Running Golf swinging Kicking Lifting Skating Swinging Walking IEEE International Symposium on Multimedia 2013 Riding 18
  • 19. Experimental setup (2/2) • Comparison with – SR with action region • only makes use of action regions in the test video clips considered, thus taking advantage of segmentation information – SR with whole region • uses whole video frames, thus not exploiting segmentation information SR with whole region SR with action region IEEE International Symposium on Multimedia 2013 19
  • 20. Experimental results (1/2) • The accuracy of the proposed SR-based method for human action recognition is more stable over the different human action classes • The accuracy of the proposed method is highly independent of the type of human action – thanks to the use of a context-adaptive classification strategy IEEE International Symposium on Multimedia 2013 20
  • 21. Experimental results (2/2) • We can observe that what method is most accurate depends on the human action class considered – “SR with action region” is usually more accurate when the concentration of the sparse coefficients associated with the action region is higher than the concentration of the sparse coefficients associated with the context region – Otherwise, “SR with whole region” or “Proposed method” are more effective IEEE International Symposium on Multimedia 2013 21
  • 22. Outline • • • • Introduction Sparse representation-based human action recognition Experiments Conclusions and future research – conclusions – future research directions IEEE International Symposium on Multimedia 2013 22
  • 23. Conclusions • We proposed a novel SR-based method for human action recognition, having the following two major characteristics – first, classification does not have to apply explicit segmentation to a given test video clip – second, classification is context adaptive in nature, only leveraging information about the context in which the action took place when the concentration of the corresponding sparse coefficients is high IEEE International Symposium on Multimedia 2013 23
  • 24. Future research directions • Use of dictionary learning techniques that allow for more effective and efficient construction of an overcomplete dictionary • Perform experiments with actions that have a lower variation in background • Study how to leverage SRC by means of an action regionaware dictionary in other application scenarios IEEE International Symposium on Multimedia 2013 24
  • 25. Thank you! Any questions? e-mail: hsmin@kaist.ac.kr . web: http://ivylab.kaist.ac.kr IEEE International Symposium on Multimedia 2013 25