SlideShare a Scribd company logo
GaitSet
R e ga r d i n g G a i t a s a S e t Fo r
C r o s s - Vi e w G a i t R e c o g n i t i o n
Gait Recognition: Identify persons by gait
PAMIGroup FDU
Long Distance: Gait VS. Face, Fingerprint, iris…
No Need of Cooperation: Gait VS. Fingerprint, iris…
Has broad applications in crime prevention,
forensic identification and social security
Robust to the Appearance Change: Gait VS. Person re-id
Classical approach
Gait template
Segmentation, Alignment
Remove color
& texture
Extract temporal
information
Pixel-level Operations
A template aggregates information in pixel-level.
It’s not rational. Obviously, it will lose temporal information.
PAMIGroup FDU
A template aggregates information in pixel-level.
It’s not rational. Obviously, it will lose temporal information.
Neural network ought to have access to each frame.
Then… → Gait Sequence?
PAMIGroup FDU
A template aggregates information in pixel-level.
It’s not rational. Obviously, it will lose temporal information.
Neural network ought to have access to each frame.
Then… → Gait Sequence?
Order in a sequence could cause a bunch of issues:
How to unify frame rate? Unify walking speed? Align first frame?
PAMIGroup FDU
A template aggregates information in pixel-level.
It’s not rational. Obviously, it will lose temporal information.
Neural network ought to have access to each frame.
Then… → Gait Sequence?
Order in a sequence could cause a bunch of issues:
How to unify frame rate? Unify walking speed? Align first frame?
Let’s get rid of order!
SET
PAMIGroup FDU
Regarding Gait as Set
PAMIGroup FDU
Single Image Set of Image
Sequence of
Image
√
Permutation
invariance
Views
Walking
Condition
Single Multiple Multiple
Multiple
×
SingleSingle
√
GaitSet: Set Pooling (SP)
Use CNN to extract feature of each silhouette in an input set
Then What?
CNN &
Pooling
CNN &
Pooling
CNN &
Pooling
PAMIGroup FDU
GaitSet: Set Pooling (SP)
CNN &
Pooling
CNN &
Pooling
CNN &
Pooling
Set of feature maps → Feature map of set
Use CNN to extract feature of each silhouette in an input set
PAMIGroup FDU
GaitSet: Set Pooling (SP)
• A permutation invariant function
CNN &
Pooling
CNN &
Pooling
CNN &
Pooling
Set Pooling
(SP)
• Able to take a set with arbitrary cardinality
The permutation of the element in the input set
should not influence the output, formulated as:
𝐺 𝑣 𝑗
𝑗 = 1,2, … , 𝑛 = 𝐺 𝑣 𝜋(𝑗)
𝑗 = 1,2, … , 𝑛
where 𝜋 is any permutation.
To ensure the flexibility of the model
Since in real life scenario, the number of a person’s
gait silhouettes can be arbitrary
PAMIGroup FDU
GaitSet: Set Pooling (SP)
• Statistical Function
CNN &
Pooling
CNN &
Pooling
CNN &
Pooling
Set Pooling
(SP)
• Joint Functions
max ⋅ , mean ⋅ , median(⋅)
• max ⋅ + mean ⋅ + median ⋅
• 1_1𝐶(𝑐𝑎𝑡 max ⋅ , mean ⋅ , median ⋅ )
1_1𝐶: 1 × 1 convolutional layer,
𝑐𝑎𝑡: concatenate
• Attention
Copy𝑛times
1,𝑐,ℎ,𝑤
→(𝑛,𝑐,ℎ,𝑤)
𝐴
(1, 𝑐, ℎ, 𝑤)
×
(𝑛, 𝑐, ℎ, 𝑤) 1_1𝐶
𝑚𝑒𝑑𝑖𝑎𝑛
𝑚𝑒𝑎𝑛
𝑚𝑎𝑥
𝑐𝑎𝑡 + 𝑚𝑎𝑥 𝑧(𝑐, ℎ, 𝑤)
Use the global information to learn an attention
map for each frame-level feature map to refine itTorefine
featuremap
GaitSet: Horizontal Pyramid Mapping
• Strips: help the network focus on features with different scales
• Different discriminative information has different scales:
movement of hands, foots, heads, shoulders…
movement of arms, legs…
movement of whole body
• It is commonly used in person re-id
, , …
PAMIGroup FDU
GAP+GMP
𝑤
ℎ 𝑐 𝑑
𝑛=෍
𝑠=1
𝑆
2𝑠−1
fc1,1
fc2,2
fc2,1
fcS,1~2 𝑆−1
𝑑
𝑓′ 𝑓
𝑧2,1
𝑧2,2
GaitSet: Horizontal Pyramid Mapping
• Strips: help the network focus on features with different scales
• To be more efficient, we split the set-level feature map
instead of the original silhouettes.
CNN &
Pooling
CNN &
Pooling
CNN &
Pooling
SP
𝑐 𝑤
ℎ
GAP: Global Average Pooling
GMP: Global Max Pooling
• Parameters in FCs are not shared
Different strips represents features with
different scales in different positions.
PAMIGroup FDU
GaitSet: Main Pipeline
PAMIGroup FDU
CNN: extracting information from each silhouette
→ SP: aggregate frame information into set information
→ HPM: get discriminative representation
• Loss: Triplet Loss for each strip
• Test: concatenate all strips to
get the representation of the
input silhouette set
GaitSet: Pipeline
PAMIGroup FDU
• Multi-layer Global Pipeline (MGP)
Shallow layer focus on local and fine-grained information
Deep layer focus on global and coarse-grained information
Use MGP to collect various-level set information
Ablation
• Set VS. GEI (Line 1~2)
With identical network
Exceeds for over 10%
• HPM: Share VS. NOT (Line 2~3)
Exceeds around 2~3%
• MGP (Last 2 line)
Exceeds around 1~3%
• Different SP (Line 3~8)
CL: max(⋅)
BG: 1_1𝐶(𝑐𝑎𝑡 max, mean, median )
NM: attention
max(⋅) is chosen for its simplicity
CASIA-B:
124subjects×11views ×3walking conditions(10videos/view) = 13,640 videos
Walking Condition: NM: normal; BG: carrying bag; CL: with coat
PAMIGroup FDU
Effective CASIA-B:
124subjects×11views ×3walking conditions(10videos/view) = 13,640 videos
Effective
OU-MVLP:
10,307subjects(5153train/5154test)×14views ×2videos/view = 288,596 videos
• Exceed for
over 40%
PAMIGroup FDU
Fast
Directly learns the representation instead of measuring the similarity between a pair of gait.
DATA Network Representation
Learn representation
√ Linear: 𝑛 + 𝑚 × network complexity
For 𝑛 samples in probe and 𝑚 samples in gallery
Measure similarity between a pair of sample
× Quadratic: n × 𝑚 × network complexity
DATA1 Network
Same ID?
DATA2 Network
The average computational
complexity for one sample in
CASIA-B is 8.6GFLOPs
PAMIGroup FDU
Flexible: Limited Silhouettes
25.0
44.1
58.5
68.5
75.2
79.5
82.5
84.7 86.1
87.7
92.9
94.3 94.6 94.8
0.0
20.0
40.0
60.0
80.0
100.0
0 10 20 30 40 50 60 70 80 90 100
Random Images
All Images
95.0
25
Rank-1accuracy(%)
Number of selected Images
• Robustness: reach an 82.5% accuracy with 7 silhouettes
• Our method does learn the motional gait information
• The accuracy rises monotonically with the increase of the silhouette’s number
• One gait period contains around 25 silhouettes. More silhouettes will NOT bring much more
motional information. Consistently, the accuracy is close to the best performance at this position.
PAMIGroup FDU
Flexible: Multiple Views & Walking Conditions
Make full use of each silhouette
An input set can contain any number of non-consecutive silhouettes filmed
under different viewpoints with different walking conditions
Set Contains Two Views
• Combine both parallel and vertical information
• Generally, the larger the difference between
two views is, the better the results are
Set Contains Two Walking Conditions
• The accuracies rise with the increase of
silhouettes number.
• BG & CL have complementary information
PAMIGroup FDU
Thanks
Hanqing Chao
hqchao16@fudan.edu.cn
Yiwei He
heyw15@fudan.edu.cn
Junping Zhang *
jpzhang@fudan.edu.cn
Jianfeng Feng
jffeng@fudan.edu.cn

More Related Content

What's hot

1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
NAVER Engineering
 
GTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerceGTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerce
GrubhubTech
 
Bias and variance trade off
Bias and variance trade offBias and variance trade off
Bias and variance trade off
VARUN KUMAR
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
Francesco Casalegno
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender Systems
David Zibriczky
 
Machine learning
Machine learningMachine learning
Machine learning
vaishnavip23
 
Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)
Hsing-chuan Hsieh
 
Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial
Alexandros Karatzoglou
 
Deep learning for person re-identification
Deep learning for person re-identificationDeep learning for person re-identification
Deep learning for person re-identification
哲东 郑
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
Trieu Nguyen
 
Ray tracing
Ray tracingRay tracing
Ray tracing
Farah Shaikh
 
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic ModerationHate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation
Universitat Politècnica de Catalunya
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation System
Anamta Sayyed
 
Dimension Reduction: What? Why? and How?
Dimension Reduction: What? Why? and How?Dimension Reduction: What? Why? and How?
Dimension Reduction: What? Why? and How?
Kazi Toufiq Wadud
 
Decision Tree Learning
Decision Tree LearningDecision Tree Learning
Decision Tree Learning
Milind Gokhale
 
Understanding Bagging and Boosting
Understanding Bagging and BoostingUnderstanding Bagging and Boosting
Understanding Bagging and Boosting
Mohit Rajput
 
Visual Object Tracking: review
Visual Object Tracking: reviewVisual Object Tracking: review
Visual Object Tracking: review
Dmytro Mishkin
 
Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...
Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...
Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...
Analytics India Magazine
 
Machine Learning: Bias and Variance Trade-off
Machine Learning: Bias and Variance Trade-offMachine Learning: Bias and Variance Trade-off
Machine Learning: Bias and Variance Trade-off
International Institute of Information Technology (I²IT)
 
Reinforcement learning:policy gradient (part 1)
Reinforcement learning:policy gradient (part 1)Reinforcement learning:policy gradient (part 1)
Reinforcement learning:policy gradient (part 1)
Bean Yen
 

What's hot (20)

1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
 
GTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerceGTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerce
 
Bias and variance trade off
Bias and variance trade offBias and variance trade off
Bias and variance trade off
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender Systems
 
Machine learning
Machine learningMachine learning
Machine learning
 
Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)
 
Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial
 
Deep learning for person re-identification
Deep learning for person re-identificationDeep learning for person re-identification
Deep learning for person re-identification
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
 
Ray tracing
Ray tracingRay tracing
Ray tracing
 
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic ModerationHate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation System
 
Dimension Reduction: What? Why? and How?
Dimension Reduction: What? Why? and How?Dimension Reduction: What? Why? and How?
Dimension Reduction: What? Why? and How?
 
Decision Tree Learning
Decision Tree LearningDecision Tree Learning
Decision Tree Learning
 
Understanding Bagging and Boosting
Understanding Bagging and BoostingUnderstanding Bagging and Boosting
Understanding Bagging and Boosting
 
Visual Object Tracking: review
Visual Object Tracking: reviewVisual Object Tracking: review
Visual Object Tracking: review
 
Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...
Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...
Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...
 
Machine Learning: Bias and Variance Trade-off
Machine Learning: Bias and Variance Trade-offMachine Learning: Bias and Variance Trade-off
Machine Learning: Bias and Variance Trade-off
 
Reinforcement learning:policy gradient (part 1)
Reinforcement learning:policy gradient (part 1)Reinforcement learning:policy gradient (part 1)
Reinforcement learning:policy gradient (part 1)
 

Similar to GaitSet: Regarding Gait as a Set for Cross-View Gait Recognition (tutorial)

All projects
All projectsAll projects
All projects
Karishma Jain
 
Deep Learning in Computer Vision
Deep Learning in Computer VisionDeep Learning in Computer Vision
Deep Learning in Computer Vision
Sungjoon Choi
 
Computer Vision for Beginners
Computer Vision for BeginnersComputer Vision for Beginners
Computer Vision for Beginners
Sanghamitra Deb
 
Working with Shader with Unity
Working with Shader with UnityWorking with Shader with Unity
Working with Shader with Unity
Minh Nghiem
 
Simple APIs and innovative documentation
Simple APIs and innovative documentationSimple APIs and innovative documentation
Simple APIs and innovative documentation
PyDataParis
 
Seeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewSeeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper review
QuantUniversity
 
05 contours seg_matching
05 contours seg_matching05 contours seg_matching
05 contours seg_matching
ankit_ppt
 
그림 그리는 AI
그림 그리는 AI그림 그리는 AI
그림 그리는 AI
NAVER Engineering
 
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
StampedeCon
 
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ..."Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
Edge AI and Vision Alliance
 
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
Universitat Politècnica de Catalunya
 
AI and Deep Learning
AI and Deep Learning AI and Deep Learning
AI and Deep Learning
Subrat Panda, PhD
 
Video Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFTVideo Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFT
IRJET Journal
 
Tree building 2
Tree building 2Tree building 2
Stereo Matching by Deep Learning
Stereo Matching by Deep LearningStereo Matching by Deep Learning
Stereo Matching by Deep Learning
Yu Huang
 
Evaluation of conditional images synthesis: generating a photorealistic image...
Evaluation of conditional images synthesis: generating a photorealistic image...Evaluation of conditional images synthesis: generating a photorealistic image...
Evaluation of conditional images synthesis: generating a photorealistic image...
SamanthaGallone
 
backpropagation in neural networks
backpropagation in neural networksbackpropagation in neural networks
backpropagation in neural networks
Akash Goel
 
Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesis
Naeem Shehzad
 
D3 D10 Unleashed New Features And Effects
D3 D10 Unleashed   New Features And EffectsD3 D10 Unleashed   New Features And Effects
D3 D10 Unleashed New Features And EffectsThomas Goddard
 

Similar to GaitSet: Regarding Gait as a Set for Cross-View Gait Recognition (tutorial) (20)

All projects
All projectsAll projects
All projects
 
Deep Learning in Computer Vision
Deep Learning in Computer VisionDeep Learning in Computer Vision
Deep Learning in Computer Vision
 
Computer Vision for Beginners
Computer Vision for BeginnersComputer Vision for Beginners
Computer Vision for Beginners
 
Working with Shader with Unity
Working with Shader with UnityWorking with Shader with Unity
Working with Shader with Unity
 
Mattar_PhD_Thesis
Mattar_PhD_ThesisMattar_PhD_Thesis
Mattar_PhD_Thesis
 
Simple APIs and innovative documentation
Simple APIs and innovative documentationSimple APIs and innovative documentation
Simple APIs and innovative documentation
 
Seeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewSeeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper review
 
05 contours seg_matching
05 contours seg_matching05 contours seg_matching
05 contours seg_matching
 
그림 그리는 AI
그림 그리는 AI그림 그리는 AI
그림 그리는 AI
 
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
 
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ..."Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
 
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
 
AI and Deep Learning
AI and Deep Learning AI and Deep Learning
AI and Deep Learning
 
Video Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFTVideo Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFT
 
Tree building 2
Tree building 2Tree building 2
Tree building 2
 
Stereo Matching by Deep Learning
Stereo Matching by Deep LearningStereo Matching by Deep Learning
Stereo Matching by Deep Learning
 
Evaluation of conditional images synthesis: generating a photorealistic image...
Evaluation of conditional images synthesis: generating a photorealistic image...Evaluation of conditional images synthesis: generating a photorealistic image...
Evaluation of conditional images synthesis: generating a photorealistic image...
 
backpropagation in neural networks
backpropagation in neural networksbackpropagation in neural networks
backpropagation in neural networks
 
Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesis
 
D3 D10 Unleashed New Features And Effects
D3 D10 Unleashed   New Features And EffectsD3 D10 Unleashed   New Features And Effects
D3 D10 Unleashed New Features And Effects
 

Recently uploaded

Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
frank0071
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
zeex60
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
fafyfskhan251kmf
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
Wasswaderrick3
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Studia Poinsotiana
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
S.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary levelS.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary level
ronaldlakony0
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 

Recently uploaded (20)

Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
S.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary levelS.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary level
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 

GaitSet: Regarding Gait as a Set for Cross-View Gait Recognition (tutorial)

  • 1. GaitSet R e ga r d i n g G a i t a s a S e t Fo r C r o s s - Vi e w G a i t R e c o g n i t i o n
  • 2. Gait Recognition: Identify persons by gait PAMIGroup FDU Long Distance: Gait VS. Face, Fingerprint, iris… No Need of Cooperation: Gait VS. Fingerprint, iris… Has broad applications in crime prevention, forensic identification and social security Robust to the Appearance Change: Gait VS. Person re-id Classical approach Gait template Segmentation, Alignment Remove color & texture Extract temporal information Pixel-level Operations
  • 3. A template aggregates information in pixel-level. It’s not rational. Obviously, it will lose temporal information. PAMIGroup FDU
  • 4. A template aggregates information in pixel-level. It’s not rational. Obviously, it will lose temporal information. Neural network ought to have access to each frame. Then… → Gait Sequence? PAMIGroup FDU
  • 5. A template aggregates information in pixel-level. It’s not rational. Obviously, it will lose temporal information. Neural network ought to have access to each frame. Then… → Gait Sequence? Order in a sequence could cause a bunch of issues: How to unify frame rate? Unify walking speed? Align first frame? PAMIGroup FDU
  • 6. A template aggregates information in pixel-level. It’s not rational. Obviously, it will lose temporal information. Neural network ought to have access to each frame. Then… → Gait Sequence? Order in a sequence could cause a bunch of issues: How to unify frame rate? Unify walking speed? Align first frame? Let’s get rid of order! SET PAMIGroup FDU
  • 7. Regarding Gait as Set PAMIGroup FDU Single Image Set of Image Sequence of Image √ Permutation invariance Views Walking Condition Single Multiple Multiple Multiple × SingleSingle √
  • 8. GaitSet: Set Pooling (SP) Use CNN to extract feature of each silhouette in an input set Then What? CNN & Pooling CNN & Pooling CNN & Pooling PAMIGroup FDU
  • 9. GaitSet: Set Pooling (SP) CNN & Pooling CNN & Pooling CNN & Pooling Set of feature maps → Feature map of set Use CNN to extract feature of each silhouette in an input set PAMIGroup FDU
  • 10. GaitSet: Set Pooling (SP) • A permutation invariant function CNN & Pooling CNN & Pooling CNN & Pooling Set Pooling (SP) • Able to take a set with arbitrary cardinality The permutation of the element in the input set should not influence the output, formulated as: 𝐺 𝑣 𝑗 𝑗 = 1,2, … , 𝑛 = 𝐺 𝑣 𝜋(𝑗) 𝑗 = 1,2, … , 𝑛 where 𝜋 is any permutation. To ensure the flexibility of the model Since in real life scenario, the number of a person’s gait silhouettes can be arbitrary PAMIGroup FDU
  • 11. GaitSet: Set Pooling (SP) • Statistical Function CNN & Pooling CNN & Pooling CNN & Pooling Set Pooling (SP) • Joint Functions max ⋅ , mean ⋅ , median(⋅) • max ⋅ + mean ⋅ + median ⋅ • 1_1𝐶(𝑐𝑎𝑡 max ⋅ , mean ⋅ , median ⋅ ) 1_1𝐶: 1 × 1 convolutional layer, 𝑐𝑎𝑡: concatenate • Attention Copy𝑛times 1,𝑐,ℎ,𝑤 →(𝑛,𝑐,ℎ,𝑤) 𝐴 (1, 𝑐, ℎ, 𝑤) × (𝑛, 𝑐, ℎ, 𝑤) 1_1𝐶 𝑚𝑒𝑑𝑖𝑎𝑛 𝑚𝑒𝑎𝑛 𝑚𝑎𝑥 𝑐𝑎𝑡 + 𝑚𝑎𝑥 𝑧(𝑐, ℎ, 𝑤) Use the global information to learn an attention map for each frame-level feature map to refine itTorefine featuremap
  • 12. GaitSet: Horizontal Pyramid Mapping • Strips: help the network focus on features with different scales • Different discriminative information has different scales: movement of hands, foots, heads, shoulders… movement of arms, legs… movement of whole body • It is commonly used in person re-id , , … PAMIGroup FDU
  • 13. GAP+GMP 𝑤 ℎ 𝑐 𝑑 𝑛=෍ 𝑠=1 𝑆 2𝑠−1 fc1,1 fc2,2 fc2,1 fcS,1~2 𝑆−1 𝑑 𝑓′ 𝑓 𝑧2,1 𝑧2,2 GaitSet: Horizontal Pyramid Mapping • Strips: help the network focus on features with different scales • To be more efficient, we split the set-level feature map instead of the original silhouettes. CNN & Pooling CNN & Pooling CNN & Pooling SP 𝑐 𝑤 ℎ GAP: Global Average Pooling GMP: Global Max Pooling • Parameters in FCs are not shared Different strips represents features with different scales in different positions. PAMIGroup FDU
  • 14. GaitSet: Main Pipeline PAMIGroup FDU CNN: extracting information from each silhouette → SP: aggregate frame information into set information → HPM: get discriminative representation • Loss: Triplet Loss for each strip • Test: concatenate all strips to get the representation of the input silhouette set
  • 15. GaitSet: Pipeline PAMIGroup FDU • Multi-layer Global Pipeline (MGP) Shallow layer focus on local and fine-grained information Deep layer focus on global and coarse-grained information Use MGP to collect various-level set information
  • 16. Ablation • Set VS. GEI (Line 1~2) With identical network Exceeds for over 10% • HPM: Share VS. NOT (Line 2~3) Exceeds around 2~3% • MGP (Last 2 line) Exceeds around 1~3% • Different SP (Line 3~8) CL: max(⋅) BG: 1_1𝐶(𝑐𝑎𝑡 max, mean, median ) NM: attention max(⋅) is chosen for its simplicity CASIA-B: 124subjects×11views ×3walking conditions(10videos/view) = 13,640 videos Walking Condition: NM: normal; BG: carrying bag; CL: with coat PAMIGroup FDU
  • 17. Effective CASIA-B: 124subjects×11views ×3walking conditions(10videos/view) = 13,640 videos
  • 18. Effective OU-MVLP: 10,307subjects(5153train/5154test)×14views ×2videos/view = 288,596 videos • Exceed for over 40% PAMIGroup FDU
  • 19. Fast Directly learns the representation instead of measuring the similarity between a pair of gait. DATA Network Representation Learn representation √ Linear: 𝑛 + 𝑚 × network complexity For 𝑛 samples in probe and 𝑚 samples in gallery Measure similarity between a pair of sample × Quadratic: n × 𝑚 × network complexity DATA1 Network Same ID? DATA2 Network The average computational complexity for one sample in CASIA-B is 8.6GFLOPs PAMIGroup FDU
  • 20. Flexible: Limited Silhouettes 25.0 44.1 58.5 68.5 75.2 79.5 82.5 84.7 86.1 87.7 92.9 94.3 94.6 94.8 0.0 20.0 40.0 60.0 80.0 100.0 0 10 20 30 40 50 60 70 80 90 100 Random Images All Images 95.0 25 Rank-1accuracy(%) Number of selected Images • Robustness: reach an 82.5% accuracy with 7 silhouettes • Our method does learn the motional gait information • The accuracy rises monotonically with the increase of the silhouette’s number • One gait period contains around 25 silhouettes. More silhouettes will NOT bring much more motional information. Consistently, the accuracy is close to the best performance at this position. PAMIGroup FDU
  • 21. Flexible: Multiple Views & Walking Conditions Make full use of each silhouette An input set can contain any number of non-consecutive silhouettes filmed under different viewpoints with different walking conditions Set Contains Two Views • Combine both parallel and vertical information • Generally, the larger the difference between two views is, the better the results are Set Contains Two Walking Conditions • The accuracies rise with the increase of silhouettes number. • BG & CL have complementary information PAMIGroup FDU
  • 22. Thanks Hanqing Chao hqchao16@fudan.edu.cn Yiwei He heyw15@fudan.edu.cn Junping Zhang * jpzhang@fudan.edu.cn Jianfeng Feng jffeng@fudan.edu.cn