SlideShare a Scribd company logo
1 of 50
Pose Machines
Estimating Articulated Pose from Images
Robotics Institute
Carnegie Mellon University
Convolutional Pose Machines. Shih-En Wei, Varun Ramakrishna, Takeo Kanade, and Yaser Sheikh. IEEE Conference on Computer
Vision and Pattern Recognition (CVPR), 2016.
Pose Machines: Articulated Pose Estimation via Inference Machines. Varun Ramakrishna, Daniel Munoz, Martial Hebert, J.A. Bagnell,
Yaser Sheikh. In ECCV 2014 (Oral presentation).
2016/8/11 1
Goal: Articulated Pose Estimation
2016/8/11 2
Goal: Articulated Pose Estimation
https://www.youtube.com/watch?v=Oi_ycvFHd64&index=6&list=PLNh5A7HtLRcpsMfvyG0DED-Dr4zW5Lpcg
2016/8/11 3
Goal: Articulated Pose Estimation
https://www.youtube.com/watch?v=MsZkLK0Wcmk&list=PLNh5A7HtLRcpsMfvyG0DED-Dr4zW5Lpcg&index=1
2016/8/11 4
Which part corresponds to a body part?
• Local evidence is weak
• Part context is a strong cue
• Top-down cues are helpful2016/8/11 5
Using Local Image Evidence
Multi-ClassClassificationof Patches
g1
Image
Features
1xz
ImageLocation z
Input Image
handsfeet
Requires a high-capacity
supervised predictor capable of
handling multi-modal data2016/8/11 6
Using Local Image Evidence
A ClassicalSlidingWindowDetectionPipeline
Image
Feature
Extraction
Classification
2016/8/11 7
Local Image Evidence is Weak
• Certain parts are easier to detect than others
head neck l.shoulder l.elbow l.wrist
2016/8/11 8
Part Context is a Strong Cue
Part detection confidences provide spatial context cues
L-ShoulderL-ElbowImage Neck
2016/8/11 9
Tree Structures vs Loopy Graphs
Tree Structures
• Fast and exact
inference
• Double counting
Loopy Graphs
• Rich context
• Approximate inference
2015/9/11 10
Designing Context Representations
Context features encode responses of aprevious prediction stage
Offset
Features
Patch
Features
Image
2016/8/11 L b11
Context
Features
g2 g3
Stage II Stage III
Confidence Maps Confidence Maps
g1
Context
Features
Stage I
Confidence Maps
Stage I Confidence
Image
Features
Head Neck L-Shoulder L-Elbow L-Wrist
2016/8/11 L b12
g2g1
Context
Features
g3
Image
Features
Context
Features
Stage I
Confidence Maps
Stage II
Confidence Maps
Stage III
Confidence Maps
Stage II Confidence
Head Neck L-Shoulder L-Elbow L-Wrist
2016/8/11 L b13
g2g1 g3
Context
Features
Context
Features
Stage I
Confidence Maps
Stage II
Confidence Maps
Stage III
Confidence Maps
Image
Features
Stage III Confidence
Head Neck L-Shoulder L-Elbow L-Wrist
2016/8/11 L b14
Level 1 parts Level 2 poselet Level 3 full body
[Bourdev et al., CVPR2009][Sun et al., CVPR2012]
[Duan et al., BMVC 2012][Singh et al., ECCV2012]
[Pishchulin et al., CVPR2013] etc.
Top Down Cues are Helpful
Larger Composite Parts can be Easier to detect
2016/8/11 15
2
gT
1
gT
Stage t = (T = 3)
Context
Features
Context
Context
Features
Image
Features
Features
Context
Features
Context
Features
Context
Features
Image
Features
Image
Features
Image
Features
2g1
L g1
Stage t = 1
1
g1
Level
1
Level
2
LevelL
Image
Features
Image
Features
Image
Features
L
g2
2
g2
1
g
Stage t = 2
Incorporating Hierarchical Cues
• Each level of the hierarchy uses a separate predictor
• Context features are computed on the outputs of the previous stage
• Spatial context information is passed across layers via context features
L
gT
2016/8/11 16
1g2
1g1
Level
1
1
gT
Image
Features
Image
Features
Image
Features
Context
Features
Context
Features
Level
2
2g1
L g1
L g2
2g2
Stage t = 1 Stage t = 2
Level I Confidence Maps
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
L gT
2gT
Stage t = (T = 3)
Context
Features
Context
Features
Context
Features
Context
Features
LevelL
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
StageIStageIIStageIII
2016/8/11 L b17
Stage t = 2
Level 2 Confidence Maps
StageIStageIIStageIII
Head+Sho L.Arm R.Arm Torso L.Leg Bkgd.R.Leg
1g2
1g1
Level
1
1
gT
Image
Features
Image
Features
Image
Features
Context
Features
Context
Features
Level
2
2g1
L g1
L g2
2g2
Stage t = 1
L gT
2gT
Stage t = (T = 3)
Context
Features
Context
Features
Context
Features
Context
Features
LevelL
2016/8/11 L b18
Stage t = 2
Level 3 Confidence Maps
Torso Bkgd.
Stage
I
Stage
II
Stage
III
1g2
1g1
Level
1
1
gT
Image
Features
Image
Features
Image
Features
Context
Features
Context
Features
Level
2
2g1
L g1
L g2
2g2
Stage t = 1
L gT
2gT
Stage t = (T = 3)
Context
Features
Context
Features
Context
Features
Context
Features
LevelL
2016/8/11 L b19
1g2
1g1
Level
1
1gT
Image
Features
Image
Features
Image
Features
Context
Features
Context
Features
Level
2
2g1
L g1
L g2
2g2
Stage t = 1 Stage t = 2
L gT
2gT
Stage t = (T = 3)
Context
Features
Context
Features
Context
Features
Context
Features
LevelL
Fully Connected Model
2016/8/11 L b20
Pose Machines
Sequential Prediction with Spatial Context
Training reduces to training multiple
supervised classifiers
g2g1 g3
Context
Features
Context
Features
Stage I
Confidence Maps
Stage II
Confidence Maps
Stage III
Confidence MapsImage
Features
Image
Features
Image
Features
No structured loss function
No specialized solvers
No handcrafted spatial model
Spatial model is learned implicitly by the
classifiers in a data-driven fashion
2016/8/11 21
Learning Feature Representations
• Convolutional Architectures for Feature Embedding
2016/8/11 22
Learning Context Representations
• Large Receptive Fields as a Design Criterion
2016/8/11 23
Learning Context Representations
• Large Receptive Fields Improve Pose Estimation
2016/8/11 24
Convolutional Pose Machines
• Designing a Convolutional Architecture
2016/8/11 25
Learning
• Joint Training with Intermediate Supervision
𝑓𝑡 = − 2
2 Loss: Euclidean distance
groundtruth prediction
Network without Intermediate Supervision leads vanishing gradients
2016/8/11 26
Input Stage 1
Layer 1 Layer 3 Layer 6
4
1 10
3
10
Epoch
10
2
1
10
0
10
Output
Layer 18
Stage 2
Layer 7 Layer 9 Layer 12 Layer 13
Stage 3
Layer 15
4
2 10
3
10
Epoch
10
2
1
10
0
10
4
3 10
3
10
Epoch
10
2
1
10
0
10
−0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5
Supervision Supervision
Histograms of Gradient Magnitude During Training
Supervision
Learning
Intermediate Supervision Addresses Vanishing Gradients
Gradient Magnitude
10
Gradient (× 10−3
) With Intermediate Supervision Without Intermediate Supervision
0
10
1
10
2
10
3
10
4
Input
Image
h w 3
5⇥5
C
5⇥5
C
2⇥ 5⇥5 9⇥9 1⇥1 1⇥1
P C C C C
9⇥9
C
9⇥9 13⇥13 13⇥13 15⇥15 1⇥1 1⇥1
C C C C C C
2⇥
P
5⇥5
C
5⇥5
C
5⇥5
C
2⇥
P
2⇥
P
Input
Image
h w 3
h0 w0
P1+1 P1+1
9⇥9
C
Loss
1
f 2
Loss
1f 1
x1
1
x1
2 9⇥9 13⇥13 13⇥13 15⇥15 1⇥1 1⇥1
C C C C C C
5⇥5 2⇥ 5⇥5 2⇥ 5⇥5
C P C P C
Input
Image
h w 3
h0 w0
P1+1
Loss
1f 3
x1
2
h0 w0
Stage 3, level 1Stage 2, level 1Stage 1, level 1
2016/8/11 27
Input
Layer 1
Output
Layer 18
10
0
10
1
10
2
10
3
10
4
Epoch1
Stage 1
Layer 3 Layer 6 Layer 7
Stage 2
Layer 9 Layer 12 Layer 13
Stage 3
Layer 15
10
0
10
1
10
2
10
3
10
4
Epoch2
−0.5 0.0 0.5
10
0
10
1
10
2
10
3
10
4
Epoch3
−0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5
Histograms of Gradient Magnitude During Training
Supervision Supervision Supervision
Input
Image
h w 3
5⇥5
C
5⇥5
C
2⇥ 5⇥5 9⇥9 1⇥1 1⇥1
P C C C C
9⇥9
C
9⇥9 13⇥13 13⇥13 15⇥15 1⇥1 1⇥1
C C C C C C
2⇥
P
5⇥5
C
5⇥5
C
5⇥5
C
2⇥
P
2⇥
P
Input
Image
h w 3
h0 w0
P1+1 P1+1
9⇥9
C
Loss
1
f 2
Loss
1f 1
x1
1
x1
2 9⇥9 13⇥13 13⇥13 15⇥15 1⇥1 1⇥1
C C C C C C
5⇥5 2⇥ 5⇥5 2⇥ 5⇥5
C P C P C
Input
Image
h w 3
h0 w0
P1+1
Loss
1f 3
x1
2
h0 w0
Gradient (× 10−3
) With Intermediate Supervision Without Intermediate Supervision
Stage 3, level 1Stage 2, level 1Stage 1, level 1
Learning
Intermediate Supervision Addresses Vanishing Gradients
2016/8/11 28
0
0
Detectionrate%
(i) With Intermediate Supervision (IS)
(ii) Stagewise
(iii) IS + Stagewise Pretrain
(iv) Without Intermediate Supervision
0.05 0.1 0.15 0.2
Normalized distance
100
90
80
70
60
50
40
30
20
10
PCK total, LSP OC
Learning
Comparison of Learning Methods
2016/8/11 29
Qualitative Results
2016/8/11 L b30
Evaluation
Qualitative Examples on LEEDS (Person-centric)
2016/8/11 L b31
Evaluation
Qualitative Examples on MPI (Person-centric)
2016/8/11 L b32
Resolving Symmetric Confusions
LeftRight
t = 1 t =
2
t =
3
Wrists
2016/8/11 L b33
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 34
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Stage
II
Stage
I
Stage
III
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Context from the confidence map of
head is removed
Ablative Spatial Analysis
2016/8/11 35
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 36
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 37
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
StageIIStageIStageIII
Predicted
Pose
Ablative Spatial Analysis
2016/8/11 38
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 39
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 40
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 41
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 42
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 43
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 44
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 45
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 46
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 47
Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd.
Predicted
Pose
Level 1 Part Confidences
L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank.
Predicted confidences are resilient to missing
context (of one part)
Stage
II
Stage
I
Stage
III
Ablative Spatial Analysis
2016/8/11 48
0 0.05 0.1 0.15
Normalized distance
0.2 0
0
100
90
80
70
60
50
40
30
20
10
Detectionrate%
Ours 3−Stage 2−Level
Tompson et al., CVPR’15
Tompson et al., NIPS’14
Chen&Yullie, NIPS’14
Toshev et al., CVPR’14
Sapp et al., CVPR’13
Evaluation
PCK Performance Comparison on FLIC dataset
PCK wrist, FLIC
0.05 0.1 0.15
Normalized distance
0.2
PCK elbow, FLIC
2016/8/11 49
0 0.05 0.1 0.15
Normalized distance
Ours 3−Stage 2−Level
0.2 0
0
100
90
80
70
60
50
40
30
20
10
PCK total, LSP PC
Detectionrate%
Tompson et al., NIPS’14 Pishchulin et al., ICCV’13 Chen&Yuille, NIPS’14 Wang et al., CVPR’13
0.05 0.1 0.15 0.2 0
Normalized distance
0.05 0.1 0.15 0.2 0
Normalized distance
PCK wrist&elbow, LSP PC
0.05 0.1 0.15 0.2 0
Normalized distance
PCK knee, LSP PC
0.05 0.1 0.15 0.2
PCK ankle, LSP PC
Normalized distance
PCK hip, LSP PC
Evaluation
PCK Performance Comparison on LEEDS dataset (Person-centric)
2016/8/11 50

More Related Content

What's hot

Human action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptorHuman action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptorSoma Boubou
 
Learning from Computer Simulation to Tackle Real-World Problems
Learning from Computer Simulation to Tackle Real-World ProblemsLearning from Computer Simulation to Tackle Real-World Problems
Learning from Computer Simulation to Tackle Real-World ProblemsNAVER Engineering
 
Attentive semantic alignment with offset aware correlation kernels
Attentive semantic alignment with offset aware correlation kernelsAttentive semantic alignment with offset aware correlation kernels
Attentive semantic alignment with offset aware correlation kernelsNAVER Engineering
 
Scanning 3 d full human bodies using kinects
Scanning 3 d full human bodies using kinectsScanning 3 d full human bodies using kinects
Scanning 3 d full human bodies using kinectsFensa Saj
 
Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...Wesley De Neve
 
SPPRA'2013 Paper Presentation
SPPRA'2013 Paper PresentationSPPRA'2013 Paper Presentation
SPPRA'2013 Paper PresentationActiveEon
 
Visual Object Tracking: review
Visual Object Tracking: reviewVisual Object Tracking: review
Visual Object Tracking: reviewDmytro Mishkin
 
Recent advances on low-rank and sparse decomposition for moving object detection
Recent advances on low-rank and sparse decomposition for moving object detectionRecent advances on low-rank and sparse decomposition for moving object detection
Recent advances on low-rank and sparse decomposition for moving object detectionActiveEon
 
Double-constrained RPCA based on Saliency Maps for Foreground Detection in Au...
Double-constrained RPCA based on Saliency Maps for Foreground Detection in Au...Double-constrained RPCA based on Saliency Maps for Foreground Detection in Au...
Double-constrained RPCA based on Saliency Maps for Foreground Detection in Au...ActiveEon
 
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...ActiveEon
 
PhD Thesis Defense Presentation: Robust Low-rank and Sparse Decomposition for...
PhD Thesis Defense Presentation: Robust Low-rank and Sparse Decomposition for...PhD Thesis Defense Presentation: Robust Low-rank and Sparse Decomposition for...
PhD Thesis Defense Presentation: Robust Low-rank and Sparse Decomposition for...ActiveEon
 
Achieving undelayed initialization in monocular slam with generalized objects...
Achieving undelayed initialization in monocular slam with generalized objects...Achieving undelayed initialization in monocular slam with generalized objects...
Achieving undelayed initialization in monocular slam with generalized objects...Chen-Han Hsiao
 
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene GraphsAction Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene GraphsSangmin Woo
 
Robust Low-rank and Sparse Decomposition for Moving Object Detection
Robust Low-rank and Sparse Decomposition for Moving Object DetectionRobust Low-rank and Sparse Decomposition for Moving Object Detection
Robust Low-rank and Sparse Decomposition for Moving Object DetectionActiveEon
 

What's hot (20)

Human action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptorHuman action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptor
 
Learning from Computer Simulation to Tackle Real-World Problems
Learning from Computer Simulation to Tackle Real-World ProblemsLearning from Computer Simulation to Tackle Real-World Problems
Learning from Computer Simulation to Tackle Real-World Problems
 
Attentive semantic alignment with offset aware correlation kernels
Attentive semantic alignment with offset aware correlation kernelsAttentive semantic alignment with offset aware correlation kernels
Attentive semantic alignment with offset aware correlation kernels
 
Scanning 3 d full human bodies using kinects
Scanning 3 d full human bodies using kinectsScanning 3 d full human bodies using kinects
Scanning 3 d full human bodies using kinects
 
Video stabilization
Video stabilizationVideo stabilization
Video stabilization
 
Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...
 
SPPRA'2013 Paper Presentation
SPPRA'2013 Paper PresentationSPPRA'2013 Paper Presentation
SPPRA'2013 Paper Presentation
 
Visual Object Tracking: review
Visual Object Tracking: reviewVisual Object Tracking: review
Visual Object Tracking: review
 
PR12-CAM
PR12-CAMPR12-CAM
PR12-CAM
 
Recent advances on low-rank and sparse decomposition for moving object detection
Recent advances on low-rank and sparse decomposition for moving object detectionRecent advances on low-rank and sparse decomposition for moving object detection
Recent advances on low-rank and sparse decomposition for moving object detection
 
998-isvc16
998-isvc16998-isvc16
998-isvc16
 
Presentation of Visual Tracking
Presentation of Visual TrackingPresentation of Visual Tracking
Presentation of Visual Tracking
 
Double-constrained RPCA based on Saliency Maps for Foreground Detection in Au...
Double-constrained RPCA based on Saliency Maps for Foreground Detection in Au...Double-constrained RPCA based on Saliency Maps for Foreground Detection in Au...
Double-constrained RPCA based on Saliency Maps for Foreground Detection in Au...
 
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
 
PhD Thesis Defense Presentation: Robust Low-rank and Sparse Decomposition for...
PhD Thesis Defense Presentation: Robust Low-rank and Sparse Decomposition for...PhD Thesis Defense Presentation: Robust Low-rank and Sparse Decomposition for...
PhD Thesis Defense Presentation: Robust Low-rank and Sparse Decomposition for...
 
Achieving undelayed initialization in monocular slam with generalized objects...
Achieving undelayed initialization in monocular slam with generalized objects...Achieving undelayed initialization in monocular slam with generalized objects...
Achieving undelayed initialization in monocular slam with generalized objects...
 
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene GraphsAction Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
 
Robust Low-rank and Sparse Decomposition for Moving Object Detection
Robust Low-rank and Sparse Decomposition for Moving Object DetectionRobust Low-rank and Sparse Decomposition for Moving Object Detection
Robust Low-rank and Sparse Decomposition for Moving Object Detection
 
Moving object detection
Moving object detectionMoving object detection
Moving object detection
 
Video Object Segmentation - Laura Leal-Taixé - UPC Barcelona 2018
Video Object Segmentation - Laura Leal-Taixé - UPC Barcelona 2018Video Object Segmentation - Laura Leal-Taixé - UPC Barcelona 2018
Video Object Segmentation - Laura Leal-Taixé - UPC Barcelona 2018
 

Viewers also liked

Convolutional Pose Machines
Convolutional Pose MachinesConvolutional Pose Machines
Convolutional Pose MachinesTakanori Ogata
 
Deformable Part Models are Convolutional Neural Networks
Deformable Part Models are Convolutional Neural NetworksDeformable Part Models are Convolutional Neural Networks
Deformable Part Models are Convolutional Neural NetworksWei Yang
 
Articulated human pose estimation by deep learning
Articulated human pose estimation by deep learningArticulated human pose estimation by deep learning
Articulated human pose estimation by deep learningWei Yang
 
Deep convolutional neural fields for depth estimation from a single image
Deep convolutional neural fields for depth estimation from a single imageDeep convolutional neural fields for depth estimation from a single image
Deep convolutional neural fields for depth estimation from a single imageWei Yang
 
Human Pose Estimation by Deep Learning
Human Pose Estimation by Deep LearningHuman Pose Estimation by Deep Learning
Human Pose Estimation by Deep LearningWei Yang
 
Manifold learning
Manifold learningManifold learning
Manifold learningWei Yang
 
20 Instagram Pics that will Have You Wanting to Visit the Grand Canyon Just ...
20 Instagram Pics that will Have You Wanting to Visit the  Grand Canyon Just ...20 Instagram Pics that will Have You Wanting to Visit the  Grand Canyon Just ...
20 Instagram Pics that will Have You Wanting to Visit the Grand Canyon Just ...Grand Canyon Visitor Center
 
Docking Pose Assessment: The importance of keeping your GARD up
Docking Pose Assessment: The importance of keeping your GARD upDocking Pose Assessment: The importance of keeping your GARD up
Docking Pose Assessment: The importance of keeping your GARD upDavid Thompson
 
All pose face alignment robust to occlusion
All pose face alignment robust to occlusionAll pose face alignment robust to occlusion
All pose face alignment robust to occlusionJongju Shin
 
[Mmlab seminar 2016] deep learning for human pose estimation
[Mmlab seminar 2016] deep learning for human pose estimation[Mmlab seminar 2016] deep learning for human pose estimation
[Mmlab seminar 2016] deep learning for human pose estimationWei Yang
 
How to Get My Paper Accepted at Top Software Engineering Conferences
How to Get My Paper Accepted at Top Software Engineering ConferencesHow to Get My Paper Accepted at Top Software Engineering Conferences
How to Get My Paper Accepted at Top Software Engineering ConferencesAlex Orso
 
Single person pose recognition and tracking
Single person pose recognition and trackingSingle person pose recognition and tracking
Single person pose recognition and trackingJavier_Barbadillo
 
Efficient Running with Pose Method
Efficient Running with Pose MethodEfficient Running with Pose Method
Efficient Running with Pose Methodsuzyhgoodwin
 
Deep Learningライブラリ 色々つかってみた感想まとめ
Deep Learningライブラリ 色々つかってみた感想まとめDeep Learningライブラリ 色々つかってみた感想まとめ
Deep Learningライブラリ 色々つかってみた感想まとめTakanori Ogata
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
科学と機械学習のあいだ:変量の設計・変換・選択・交互作用・線形性
科学と機械学習のあいだ:変量の設計・変換・選択・交互作用・線形性科学と機械学習のあいだ:変量の設計・変換・選択・交互作用・線形性
科学と機械学習のあいだ:変量の設計・変換・選択・交互作用・線形性Ichigaku Takigawa
 

Viewers also liked (19)

Convolutional Pose Machines
Convolutional Pose MachinesConvolutional Pose Machines
Convolutional Pose Machines
 
Deformable Part Models are Convolutional Neural Networks
Deformable Part Models are Convolutional Neural NetworksDeformable Part Models are Convolutional Neural Networks
Deformable Part Models are Convolutional Neural Networks
 
Articulated human pose estimation by deep learning
Articulated human pose estimation by deep learningArticulated human pose estimation by deep learning
Articulated human pose estimation by deep learning
 
Deep convolutional neural fields for depth estimation from a single image
Deep convolutional neural fields for depth estimation from a single imageDeep convolutional neural fields for depth estimation from a single image
Deep convolutional neural fields for depth estimation from a single image
 
Human Pose Estimation by Deep Learning
Human Pose Estimation by Deep LearningHuman Pose Estimation by Deep Learning
Human Pose Estimation by Deep Learning
 
Manifold learning
Manifold learningManifold learning
Manifold learning
 
Monocular Human Pose Estimation with Bayesian Networks
Monocular Human Pose Estimation with Bayesian NetworksMonocular Human Pose Estimation with Bayesian Networks
Monocular Human Pose Estimation with Bayesian Networks
 
20 Instagram Pics that will Have You Wanting to Visit the Grand Canyon Just ...
20 Instagram Pics that will Have You Wanting to Visit the  Grand Canyon Just ...20 Instagram Pics that will Have You Wanting to Visit the  Grand Canyon Just ...
20 Instagram Pics that will Have You Wanting to Visit the Grand Canyon Just ...
 
Docking Pose Assessment: The importance of keeping your GARD up
Docking Pose Assessment: The importance of keeping your GARD upDocking Pose Assessment: The importance of keeping your GARD up
Docking Pose Assessment: The importance of keeping your GARD up
 
All pose face alignment robust to occlusion
All pose face alignment robust to occlusionAll pose face alignment robust to occlusion
All pose face alignment robust to occlusion
 
Towards the Extended Pose
Towards the Extended PoseTowards the Extended Pose
Towards the Extended Pose
 
[Mmlab seminar 2016] deep learning for human pose estimation
[Mmlab seminar 2016] deep learning for human pose estimation[Mmlab seminar 2016] deep learning for human pose estimation
[Mmlab seminar 2016] deep learning for human pose estimation
 
How to Get My Paper Accepted at Top Software Engineering Conferences
How to Get My Paper Accepted at Top Software Engineering ConferencesHow to Get My Paper Accepted at Top Software Engineering Conferences
How to Get My Paper Accepted at Top Software Engineering Conferences
 
Single person pose recognition and tracking
Single person pose recognition and trackingSingle person pose recognition and tracking
Single person pose recognition and tracking
 
Efficient Running with Pose Method
Efficient Running with Pose MethodEfficient Running with Pose Method
Efficient Running with Pose Method
 
Deep Learningライブラリ 色々つかってみた感想まとめ
Deep Learningライブラリ 色々つかってみた感想まとめDeep Learningライブラリ 色々つかってみた感想まとめ
Deep Learningライブラリ 色々つかってみた感想まとめ
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
科学と機械学習のあいだ:変量の設計・変換・選択・交互作用・線形性
科学と機械学習のあいだ:変量の設計・変換・選択・交互作用・線形性科学と機械学習のあいだ:変量の設計・変換・選択・交互作用・線形性
科学と機械学習のあいだ:変量の設計・変換・選択・交互作用・線形性
 
Pose
PosePose
Pose
 

Similar to Pose Machine

Multimodal Residual Networks for Visual QA
Multimodal Residual Networks for Visual QAMultimodal Residual Networks for Visual QA
Multimodal Residual Networks for Visual QAJin-Hwa Kim
 
Convolutional Neural Networks CNN
Convolutional Neural Networks CNNConvolutional Neural Networks CNN
Convolutional Neural Networks CNNAbdullah al Mamun
 
Risking Everything with Akka Streams
Risking Everything with Akka StreamsRisking Everything with Akka Streams
Risking Everything with Akka Streamsjohofer
 
Multimodal Residual Learning for Visual Question-Answering
Multimodal Residual Learning for Visual Question-AnsweringMultimodal Residual Learning for Visual Question-Answering
Multimodal Residual Learning for Visual Question-AnsweringNAVER D2
 
Spatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataSpatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataTony Fast
 
ACC_dohuan
ACC_dohuanACC_dohuan
ACC_dohuanDo Huan
 
Anomaly Detection in Sequences of Short Text Using Iterative Language Models
Anomaly Detection in Sequences of Short Text Using Iterative Language ModelsAnomaly Detection in Sequences of Short Text Using Iterative Language Models
Anomaly Detection in Sequences of Short Text Using Iterative Language ModelsCynthia Freeman
 
Fcv learn yu
Fcv learn yuFcv learn yu
Fcv learn yuzukun
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Wesley De Neve
 
Mining interesting locations and travel sequences from gps trajectories
Mining interesting locations and travel sequences from gps trajectoriesMining interesting locations and travel sequences from gps trajectories
Mining interesting locations and travel sequences from gps trajectoriesHopeBay Technologies, Inc.
 
Lecture 06 marco aurelio ranzato - deep learning
Lecture 06   marco aurelio ranzato - deep learningLecture 06   marco aurelio ranzato - deep learning
Lecture 06 marco aurelio ranzato - deep learningmustafa sarac
 
An incremental algorithm for transition-based CCG parsing
An incremental algorithm for transition-based CCG parsingAn incremental algorithm for transition-based CCG parsing
An incremental algorithm for transition-based CCG parsingAkira Miyazawa
 
Open Topology: A Toolkit for Brain Isosurface Correction-776
Open Topology: A Toolkit for Brain Isosurface Correction-776Open Topology: A Toolkit for Brain Isosurface Correction-776
Open Topology: A Toolkit for Brain Isosurface Correction-776Kitware Kitware
 
BlinkDB and G-OLA: Supporting Continuous Answers with Error Bars in SparkSQL-...
BlinkDB and G-OLA: Supporting Continuous Answers with Error Bars in SparkSQL-...BlinkDB and G-OLA: Supporting Continuous Answers with Error Bars in SparkSQL-...
BlinkDB and G-OLA: Supporting Continuous Answers with Error Bars in SparkSQL-...Spark Summit
 
Geohydrology ii (3)
Geohydrology ii (3)Geohydrology ii (3)
Geohydrology ii (3)Amro Elfeki
 

Similar to Pose Machine (20)

Multimodal Residual Networks for Visual QA
Multimodal Residual Networks for Visual QAMultimodal Residual Networks for Visual QA
Multimodal Residual Networks for Visual QA
 
Convolutional Neural Networks CNN
Convolutional Neural Networks CNNConvolutional Neural Networks CNN
Convolutional Neural Networks CNN
 
Risking Everything with Akka Streams
Risking Everything with Akka StreamsRisking Everything with Akka Streams
Risking Everything with Akka Streams
 
Multimodal Residual Learning for Visual Question-Answering
Multimodal Residual Learning for Visual Question-AnsweringMultimodal Residual Learning for Visual Question-Answering
Multimodal Residual Learning for Visual Question-Answering
 
Spatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataSpatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud data
 
Sequential Query Expansion using Concept Graph
Sequential Query Expansion using Concept GraphSequential Query Expansion using Concept Graph
Sequential Query Expansion using Concept Graph
 
ACC_dohuan
ACC_dohuanACC_dohuan
ACC_dohuan
 
Anomaly Detection in Sequences of Short Text Using Iterative Language Models
Anomaly Detection in Sequences of Short Text Using Iterative Language ModelsAnomaly Detection in Sequences of Short Text Using Iterative Language Models
Anomaly Detection in Sequences of Short Text Using Iterative Language Models
 
Lec11 object-re-id
Lec11 object-re-idLec11 object-re-id
Lec11 object-re-id
 
Smart Room Gesture Control
Smart Room Gesture ControlSmart Room Gesture Control
Smart Room Gesture Control
 
Fcv learn yu
Fcv learn yuFcv learn yu
Fcv learn yu
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
 
Mining interesting locations and travel sequences from gps trajectories
Mining interesting locations and travel sequences from gps trajectoriesMining interesting locations and travel sequences from gps trajectories
Mining interesting locations and travel sequences from gps trajectories
 
Lecture 06 marco aurelio ranzato - deep learning
Lecture 06   marco aurelio ranzato - deep learningLecture 06   marco aurelio ranzato - deep learning
Lecture 06 marco aurelio ranzato - deep learning
 
An incremental algorithm for transition-based CCG parsing
An incremental algorithm for transition-based CCG parsingAn incremental algorithm for transition-based CCG parsing
An incremental algorithm for transition-based CCG parsing
 
Intoduction to Neural Network
Intoduction to Neural NetworkIntoduction to Neural Network
Intoduction to Neural Network
 
Open Topology: A Toolkit for Brain Isosurface Correction-776
Open Topology: A Toolkit for Brain Isosurface Correction-776Open Topology: A Toolkit for Brain Isosurface Correction-776
Open Topology: A Toolkit for Brain Isosurface Correction-776
 
BlinkDB and G-OLA: Supporting Continuous Answers with Error Bars in SparkSQL-...
BlinkDB and G-OLA: Supporting Continuous Answers with Error Bars in SparkSQL-...BlinkDB and G-OLA: Supporting Continuous Answers with Error Bars in SparkSQL-...
BlinkDB and G-OLA: Supporting Continuous Answers with Error Bars in SparkSQL-...
 
Geohydrology ii (3)
Geohydrology ii (3)Geohydrology ii (3)
Geohydrology ii (3)
 
(Spring 2012) IT 345 Posters
(Spring 2012) IT 345 Posters(Spring 2012) IT 345 Posters
(Spring 2012) IT 345 Posters
 

Recently uploaded

Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxMedical College
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationColumbia Weather Systems
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS
 
Organic farming with special reference to vermiculture
Organic farming with special reference to vermicultureOrganic farming with special reference to vermiculture
Organic farming with special reference to vermicultureTakeleZike1
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlshansessene
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingNetHelix
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
well logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxwell logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxzaydmeerab121
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsSérgio Sacani
 
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxThermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxuniversity
 

Recently uploaded (20)

Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptx
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather Station
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
 
AZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTXAZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTX
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
 
Organic farming with special reference to vermiculture
Organic farming with special reference to vermicultureOrganic farming with special reference to vermiculture
Organic farming with special reference to vermiculture
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girls
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
well logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxwell logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptx
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive stars
 
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxThermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
 
Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?
 

Pose Machine

  • 1. Pose Machines Estimating Articulated Pose from Images Robotics Institute Carnegie Mellon University Convolutional Pose Machines. Shih-En Wei, Varun Ramakrishna, Takeo Kanade, and Yaser Sheikh. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. Pose Machines: Articulated Pose Estimation via Inference Machines. Varun Ramakrishna, Daniel Munoz, Martial Hebert, J.A. Bagnell, Yaser Sheikh. In ECCV 2014 (Oral presentation). 2016/8/11 1
  • 2. Goal: Articulated Pose Estimation 2016/8/11 2
  • 3. Goal: Articulated Pose Estimation https://www.youtube.com/watch?v=Oi_ycvFHd64&index=6&list=PLNh5A7HtLRcpsMfvyG0DED-Dr4zW5Lpcg 2016/8/11 3
  • 4. Goal: Articulated Pose Estimation https://www.youtube.com/watch?v=MsZkLK0Wcmk&list=PLNh5A7HtLRcpsMfvyG0DED-Dr4zW5Lpcg&index=1 2016/8/11 4
  • 5. Which part corresponds to a body part? • Local evidence is weak • Part context is a strong cue • Top-down cues are helpful2016/8/11 5
  • 6. Using Local Image Evidence Multi-ClassClassificationof Patches g1 Image Features 1xz ImageLocation z Input Image handsfeet Requires a high-capacity supervised predictor capable of handling multi-modal data2016/8/11 6
  • 7. Using Local Image Evidence A ClassicalSlidingWindowDetectionPipeline Image Feature Extraction Classification 2016/8/11 7
  • 8. Local Image Evidence is Weak • Certain parts are easier to detect than others head neck l.shoulder l.elbow l.wrist 2016/8/11 8
  • 9. Part Context is a Strong Cue Part detection confidences provide spatial context cues L-ShoulderL-ElbowImage Neck 2016/8/11 9
  • 10. Tree Structures vs Loopy Graphs Tree Structures • Fast and exact inference • Double counting Loopy Graphs • Rich context • Approximate inference 2015/9/11 10
  • 11. Designing Context Representations Context features encode responses of aprevious prediction stage Offset Features Patch Features Image 2016/8/11 L b11
  • 12. Context Features g2 g3 Stage II Stage III Confidence Maps Confidence Maps g1 Context Features Stage I Confidence Maps Stage I Confidence Image Features Head Neck L-Shoulder L-Elbow L-Wrist 2016/8/11 L b12
  • 13. g2g1 Context Features g3 Image Features Context Features Stage I Confidence Maps Stage II Confidence Maps Stage III Confidence Maps Stage II Confidence Head Neck L-Shoulder L-Elbow L-Wrist 2016/8/11 L b13
  • 14. g2g1 g3 Context Features Context Features Stage I Confidence Maps Stage II Confidence Maps Stage III Confidence Maps Image Features Stage III Confidence Head Neck L-Shoulder L-Elbow L-Wrist 2016/8/11 L b14
  • 15. Level 1 parts Level 2 poselet Level 3 full body [Bourdev et al., CVPR2009][Sun et al., CVPR2012] [Duan et al., BMVC 2012][Singh et al., ECCV2012] [Pishchulin et al., CVPR2013] etc. Top Down Cues are Helpful Larger Composite Parts can be Easier to detect 2016/8/11 15
  • 16. 2 gT 1 gT Stage t = (T = 3) Context Features Context Context Features Image Features Features Context Features Context Features Context Features Image Features Image Features Image Features 2g1 L g1 Stage t = 1 1 g1 Level 1 Level 2 LevelL Image Features Image Features Image Features L g2 2 g2 1 g Stage t = 2 Incorporating Hierarchical Cues • Each level of the hierarchy uses a separate predictor • Context features are computed on the outputs of the previous stage • Spatial context information is passed across layers via context features L gT 2016/8/11 16
  • 17. 1g2 1g1 Level 1 1 gT Image Features Image Features Image Features Context Features Context Features Level 2 2g1 L g1 L g2 2g2 Stage t = 1 Stage t = 2 Level I Confidence Maps L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. L gT 2gT Stage t = (T = 3) Context Features Context Features Context Features Context Features LevelL Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. StageIStageIIStageIII 2016/8/11 L b17
  • 18. Stage t = 2 Level 2 Confidence Maps StageIStageIIStageIII Head+Sho L.Arm R.Arm Torso L.Leg Bkgd.R.Leg 1g2 1g1 Level 1 1 gT Image Features Image Features Image Features Context Features Context Features Level 2 2g1 L g1 L g2 2g2 Stage t = 1 L gT 2gT Stage t = (T = 3) Context Features Context Features Context Features Context Features LevelL 2016/8/11 L b18
  • 19. Stage t = 2 Level 3 Confidence Maps Torso Bkgd. Stage I Stage II Stage III 1g2 1g1 Level 1 1 gT Image Features Image Features Image Features Context Features Context Features Level 2 2g1 L g1 L g2 2g2 Stage t = 1 L gT 2gT Stage t = (T = 3) Context Features Context Features Context Features Context Features LevelL 2016/8/11 L b19
  • 20. 1g2 1g1 Level 1 1gT Image Features Image Features Image Features Context Features Context Features Level 2 2g1 L g1 L g2 2g2 Stage t = 1 Stage t = 2 L gT 2gT Stage t = (T = 3) Context Features Context Features Context Features Context Features LevelL Fully Connected Model 2016/8/11 L b20
  • 21. Pose Machines Sequential Prediction with Spatial Context Training reduces to training multiple supervised classifiers g2g1 g3 Context Features Context Features Stage I Confidence Maps Stage II Confidence Maps Stage III Confidence MapsImage Features Image Features Image Features No structured loss function No specialized solvers No handcrafted spatial model Spatial model is learned implicitly by the classifiers in a data-driven fashion 2016/8/11 21
  • 22. Learning Feature Representations • Convolutional Architectures for Feature Embedding 2016/8/11 22
  • 23. Learning Context Representations • Large Receptive Fields as a Design Criterion 2016/8/11 23
  • 24. Learning Context Representations • Large Receptive Fields Improve Pose Estimation 2016/8/11 24
  • 25. Convolutional Pose Machines • Designing a Convolutional Architecture 2016/8/11 25
  • 26. Learning • Joint Training with Intermediate Supervision 𝑓𝑡 = − 2 2 Loss: Euclidean distance groundtruth prediction Network without Intermediate Supervision leads vanishing gradients 2016/8/11 26
  • 27. Input Stage 1 Layer 1 Layer 3 Layer 6 4 1 10 3 10 Epoch 10 2 1 10 0 10 Output Layer 18 Stage 2 Layer 7 Layer 9 Layer 12 Layer 13 Stage 3 Layer 15 4 2 10 3 10 Epoch 10 2 1 10 0 10 4 3 10 3 10 Epoch 10 2 1 10 0 10 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 Supervision Supervision Histograms of Gradient Magnitude During Training Supervision Learning Intermediate Supervision Addresses Vanishing Gradients Gradient Magnitude 10 Gradient (× 10−3 ) With Intermediate Supervision Without Intermediate Supervision 0 10 1 10 2 10 3 10 4 Input Image h w 3 5⇥5 C 5⇥5 C 2⇥ 5⇥5 9⇥9 1⇥1 1⇥1 P C C C C 9⇥9 C 9⇥9 13⇥13 13⇥13 15⇥15 1⇥1 1⇥1 C C C C C C 2⇥ P 5⇥5 C 5⇥5 C 5⇥5 C 2⇥ P 2⇥ P Input Image h w 3 h0 w0 P1+1 P1+1 9⇥9 C Loss 1 f 2 Loss 1f 1 x1 1 x1 2 9⇥9 13⇥13 13⇥13 15⇥15 1⇥1 1⇥1 C C C C C C 5⇥5 2⇥ 5⇥5 2⇥ 5⇥5 C P C P C Input Image h w 3 h0 w0 P1+1 Loss 1f 3 x1 2 h0 w0 Stage 3, level 1Stage 2, level 1Stage 1, level 1 2016/8/11 27
  • 28. Input Layer 1 Output Layer 18 10 0 10 1 10 2 10 3 10 4 Epoch1 Stage 1 Layer 3 Layer 6 Layer 7 Stage 2 Layer 9 Layer 12 Layer 13 Stage 3 Layer 15 10 0 10 1 10 2 10 3 10 4 Epoch2 −0.5 0.0 0.5 10 0 10 1 10 2 10 3 10 4 Epoch3 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 −0.5 0.0 0.5 Histograms of Gradient Magnitude During Training Supervision Supervision Supervision Input Image h w 3 5⇥5 C 5⇥5 C 2⇥ 5⇥5 9⇥9 1⇥1 1⇥1 P C C C C 9⇥9 C 9⇥9 13⇥13 13⇥13 15⇥15 1⇥1 1⇥1 C C C C C C 2⇥ P 5⇥5 C 5⇥5 C 5⇥5 C 2⇥ P 2⇥ P Input Image h w 3 h0 w0 P1+1 P1+1 9⇥9 C Loss 1 f 2 Loss 1f 1 x1 1 x1 2 9⇥9 13⇥13 13⇥13 15⇥15 1⇥1 1⇥1 C C C C C C 5⇥5 2⇥ 5⇥5 2⇥ 5⇥5 C P C P C Input Image h w 3 h0 w0 P1+1 Loss 1f 3 x1 2 h0 w0 Gradient (× 10−3 ) With Intermediate Supervision Without Intermediate Supervision Stage 3, level 1Stage 2, level 1Stage 1, level 1 Learning Intermediate Supervision Addresses Vanishing Gradients 2016/8/11 28
  • 29. 0 0 Detectionrate% (i) With Intermediate Supervision (IS) (ii) Stagewise (iii) IS + Stagewise Pretrain (iv) Without Intermediate Supervision 0.05 0.1 0.15 0.2 Normalized distance 100 90 80 70 60 50 40 30 20 10 PCK total, LSP OC Learning Comparison of Learning Methods 2016/8/11 29
  • 31. Evaluation Qualitative Examples on LEEDS (Person-centric) 2016/8/11 L b31
  • 32. Evaluation Qualitative Examples on MPI (Person-centric) 2016/8/11 L b32
  • 33. Resolving Symmetric Confusions LeftRight t = 1 t = 2 t = 3 Wrists 2016/8/11 L b33
  • 34. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 34
  • 35. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Stage II Stage I Stage III Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Context from the confidence map of head is removed Ablative Spatial Analysis 2016/8/11 35
  • 36. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 36
  • 37. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 37
  • 38. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) StageIIStageIStageIII Predicted Pose Ablative Spatial Analysis 2016/8/11 38
  • 39. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 39
  • 40. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 40
  • 41. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 41
  • 42. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 42
  • 43. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 43
  • 44. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 44
  • 45. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 45
  • 46. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 46
  • 47. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 47
  • 48. Head Neck L.Sho. L.Elb. R.Hip R.Knee R.Ank. Bkgd. Predicted Pose Level 1 Part Confidences L.Wri. R.Sho. R.Elb. R.Wri. L.Hip L.Knee L.Ank. Predicted confidences are resilient to missing context (of one part) Stage II Stage I Stage III Ablative Spatial Analysis 2016/8/11 48
  • 49. 0 0.05 0.1 0.15 Normalized distance 0.2 0 0 100 90 80 70 60 50 40 30 20 10 Detectionrate% Ours 3−Stage 2−Level Tompson et al., CVPR’15 Tompson et al., NIPS’14 Chen&Yullie, NIPS’14 Toshev et al., CVPR’14 Sapp et al., CVPR’13 Evaluation PCK Performance Comparison on FLIC dataset PCK wrist, FLIC 0.05 0.1 0.15 Normalized distance 0.2 PCK elbow, FLIC 2016/8/11 49
  • 50. 0 0.05 0.1 0.15 Normalized distance Ours 3−Stage 2−Level 0.2 0 0 100 90 80 70 60 50 40 30 20 10 PCK total, LSP PC Detectionrate% Tompson et al., NIPS’14 Pishchulin et al., ICCV’13 Chen&Yuille, NIPS’14 Wang et al., CVPR’13 0.05 0.1 0.15 0.2 0 Normalized distance 0.05 0.1 0.15 0.2 0 Normalized distance PCK wrist&elbow, LSP PC 0.05 0.1 0.15 0.2 0 Normalized distance PCK knee, LSP PC 0.05 0.1 0.15 0.2 PCK ankle, LSP PC Normalized distance PCK hip, LSP PC Evaluation PCK Performance Comparison on LEEDS dataset (Person-centric) 2016/8/11 50