[VFS 2019] Human Activity Recognition Approaches

—Lao Tzu
“The journey of a thousand miles begins with one step.”

Human Activity Recognition
Approaches
Nguyễn Việt Thái / CTO of Asilla Vietnam
Vietnam Frontier Summit
2019

Some of the products of ...
Our clients include...

Introduction
Approaches for Human Activity Recognition
HAR in the future
Some applications of human activity recognition
Vision Based approaches
We are doing, Approaches in future
01
02
03

6
Atomic
actions
Group
Actions
Gestures
Human
Behaviors
Events
Human
Activities
Inte
ractions

Some dataset
Name Number of Data Categories
UCF101 13,320 101
HMDB51 6,849 51
MSRAction3D 20
ActivityNet 27,811 203
AVA 430 80
NTU RGB+D 56,880 60
NTU RGB+D 120 11,448 120

Human
Activity
Recognition
Sensor Based Vision Based

Device selection
Surveillance
camera
Data collection,
RGB, RGB-D
Preprocessing,
Feature
selection
Machine Learning
/ Deep learning
The general process of HAR

Vision
Based
Using Hand-
crafted motion
features
Depth
information
based methods
Deep learning
based methods
Approaches

Using Hand-Crafted
Motion Features
● Key-framing
● Spatial-temporal features
● Space-time interest points
● Histograms of optical flow.

Depth Information
Based Methods
● RGBD
● The interest of applying
depth data captured from
depth cameras for the
action recognition

Deep Learning
Based Methods
● CNN
● Long-Short Term
Memory
● Two-Stream Architecture
● Skeleton Based

Deep Learning Based Methods
Deep Bi-
Directional LSTM
With CNN
Features
Long-term
recurrent
convolutional
networks
Temporal
Segment Network
(TSN)
Two-Stream
Inflated 3D(I3D)
15 16
Long-term
temporal
convolutions
1817
Skeleton Based

Skeleton Based
Spatial Temporal Graph Convolutional Networks

Pose Estimation
● Top-Down approaches
○ Alphapose: MAP: 78.6% on Coco, FPS: ~20
○ Mask R-CNN: MAP: 75.6% on Coco, FPS: ~3
● Bottom-Up approaches
○ Deepercut: MAP: 70% on MPII Multi-Person, FPS: ~15 (myself)
○ Openpose: MAP: 68.2% on Coco, FPS ~ 21.7 with body+
● Other
○ Posefix: Increate 3~8% for each approaches

Depth Camera The power of
modern AI
Embedded
Ability

Requirements
● Ability to deploy on compact devices.
○ Camera, hardware device compact.
● Ability to run realtime.
● Ability to secure user information.
● Easy to deploy in different environments.

Potential
● Use 3D Features
● Unsupervised learning.
● Skeleton Based for safe in privacy
● Optimization & quantization

Does anyone have any
questions?
(+84) 0988144281
thai@asilla.net
THANKS!

Instructions for use
1. Different Approaches for Human Activity Recognition– A Survey Zawar Hussain, Michael
Sheng, Senior Member, IEEE, Wei Emma Zhang, Member, IEEE
2. Going Deeper into Action Recognition: A Survey. Samitha Herath, Mehrtash Harandi,
Fatih Porikli
3. Video Activity Recognition: State-of-the-Art. Itsaso Rodríguez-Moreno, José María
Martínez-Otzeta, Basilio Sierra, Igor Rodriguez andEkaitz Jauregi.
4. This document is the official template of presentation at Vietnam Frontier Summit 2019
(VFS) and will only be used for purposes relating to the event.
5. Temporal Segment Networks. Limin Wang1 , Yuanjun Xiong2 , Zhe Wang3 , Yu Qiao3 ,
Dahua Lin2 , Xiaoou Tang2 , and Luc Van Gool1
6. Long-term Temporal Convolutions for Action Recognition. Gul Varol, Ivan Laptev, and
Cordelia Schmid, ¨ Fellow, IEEE

[VFS 2019] Human Activity Recognition Approaches

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to [VFS 2019] Human Activity Recognition Approaches

Similar to [VFS 2019] Human Activity Recognition Approaches (20)

More from Nexus FrontierTech

More from Nexus FrontierTech (20)

Recently uploaded

Recently uploaded (20)

[VFS 2019] Human Activity Recognition Approaches