SlideShare a Scribd company logo
1 of 44
Depth-Based Real Time Head Motion Tracking 
Using 3D Template Matching 
以深度影像及三維樣板比對為基礎之即時頭部 
運動軌跡追蹤 
學生:屠愚 
指導教授:歐陽明博士 
1
Outline 
• Introduction 
• System Overview 
• Algorithm 
• Results & System Demo 
• Conclusion & Future Work 
2
3 
Introducti 
on
What’s Head Motion Tracking 
4 
Y-axis 
Yaw 
Pitch 
Roll 
Y translation 
Real-time reconstruction of this 6-DoF motion vector 
given a stream of video input.
5 
Why Head Motion Tracking 
GVAainzdDeiem oOa anGrtciaieionmngtea tion
Related Work 
• Color Image Based Methods 
“Head Pose Estimation in Computer Vision : A Survey”, 
E. Murphy-Chutorian and M. M. Trivedi., PAMI 09 
• Appearance Template Methods 
• Feature Tracking Methods 
• Detector Arrays 
• Nonlinear Regression Methods 
• Manifold Embedding Methods 
• Flexible Models 
• Geometric Methods 
• Hybrid Methods 
• Too sensitive to illumination variations! 6
• SwissRanger SR4000 
– Messa Image (2006) 
– $9000 
• CameCube 2.0 
– PMD Technology (2002) 
– $12000 
7 
Depth Camera
• Kinect by Microsoft 
– $149.99 
• Xtion Pro by Asus 
– $189 . 
– $300 . 
8 
Depth Camera
9 
Related Work 
“Real-time performance-based facial animation” 
T. Weise et al., SIGGRAPH 2011 
“Real Time Head Pose Estimation with Random Regression 
Forests”, 
G. Fanelli et al., CVPR 2011
10 
System 
Overview
11
12
Flow Chart 
Least Square Error Method 
Inverse Rotation 
Nose Detection 
Sampling 
Iterative Optimization Method 
User Acting Avatar Control 
13 
Depth Data 
Acquisition Real-time Head Pose Estimation
Algorithm 
Method 1 :Least square 
error method 
14
15 
Nose Detection
Z-axis 
16 
Nose Detection 
Zmin 
Zmin+5
17 
Nose Detection
Nose Detection 
18
Inverse Rotation 
19
Nose Tracking Flow Chart 
20 
Nose Detection 
Nose푡 푥, 푦 
Nose푡−1 푥, 푦 , 
휃푡−1, 휓푡−1, 휙푡−1, 
Depth Map푡 
Searching 
Window 
Setting 
Inverse 
Rotation 
z-min 
Searching
Least Square Plane Fitting 
21 
Yaw & Pitch ! 
Ax+By+Cz=1
Least Square Ellipse Fitting 
23 
Roll !
Dynamic weighted average filter 
0.01 
0.1 
1 
10 
퐻 ∙ 푚푎푥푙∈[1,푘] 푡푖 − 푡푖−푙 
∗ = 
푘 푤푗 푡푖−푗 
푗=0 
푘 푤푗 
“Real-time performance-based facial animation” T. Weise et al., Siggraph 2011 
24 
Flickering Issue 
푛표푟푚푎푙푖푧푒푑 
푤푒푖푔ℎ푡 휔 
푓푟푎푚푒 
푖 푖-1 푖-2 푖-3 푖-4 
푤푗 = 푒−푡 푗∙퐻∙푚푎푥푙∈[1,푘] 푡푖−푡푖−푙 푖 
푗=0
Algorithm 
Method 2 :Optimization 
method 
25
26 
User Acting 
Flow Chart 
Least Square Error Method 
Inverse Rotation 
Nose Detection 
Sampling 
Depth Data 
Iterative Optimization Method 
Acquisition 
Real-time Head Pose Estimation 
Avatar Control
27 
Model Point Cloud
28 
Sample Point Cloud
• 푚표푡푖표푛 푣푒푐푡표푟 푎 Θ, 휓, 휙, 푡푥 , 푡푦, 푡푧 
푝∈푃푆 min(푑 푝′, 푃푀 ) 
• 퐸 푎, 푃푆, 푃푀 = 푝 
• 푅푒푠푢푙푡 = 푎∗ = arg 푚푖푛 (퐸) 
29 
Energy Function
30 
Optimization 
• Gradient Decent Algorithm 
푎푘+1 = 푎푘 − 훼푘훻퐸(푎푘 ) 
Non-differentiable 
• 12 Possible Gradients 
훻1 = 
1 
0 
0 
0 
0 
0 
, 훻2 = 
−1 
0 
0 
0 
0 
0 
, 훻3 = 
0 
1 
0 
0 
0 
0 
, 훻4 = 
0 
−1 
0 
0 
0 
0 
, 훻5 = 
0 
0 
1 
0 
0 
0 
, 훻6 = 
0 
0 
−1 
0 
0 
0 
, 
훻7 = 
0 
0 
0 
1 
0 
0 
, 훻8 = 
0 
0 
0 
−1 
0 
0 
, 훻9 = 
0 
0 
0 
0 
1 
0 
, 훻10 = 
0 
0 
0 
0 
−1 
0 
, 훻11 = 
0 
0 
0 
0 
0 
1 
, 훻12 = 
0 
0 
0 
0 
0 
−1
31 
Optimization
Results 
Of Method 1 
32
33
34
35
Results 
Of Method 2 
36
37
38
39
Our Method with 50% Database (Without Bad Initialization) 
40 
Our Method with Whole Database 
Our Method with 70% Database (Without Crash)
41
System 
Demo 
42
Conclusion 
43
• Contribution: 
– Two novel algorithms. 
– Not affected by varying lighting conditions. 
– Real-time responses without GPU acceleration. 
– Outperforms the state-of-the-art approach (G. Fanelli 
et al., CVPR 2011). 
• Future Work: 
– Tracking without nose. 
– Combine with NITE skeleton tracking. 
– Facial Expression Recognition / Retargeting. 
44
Thank You 
Very Much 
45

More Related Content

Similar to Depth-Based Real Time Head Motion Tracking Using 3D Template Matching

Moving object detection in video surveillance
Moving object detection in video surveillanceMoving object detection in video surveillance
Moving object detection in video surveillanceAshfaqul Haque John
 
Wearable Accelerometer Optimal Positions for Human Motion Recognition(LifeTec...
Wearable Accelerometer Optimal Positions for Human Motion Recognition(LifeTec...Wearable Accelerometer Optimal Positions for Human Motion Recognition(LifeTec...
Wearable Accelerometer Optimal Positions for Human Motion Recognition(LifeTec...sugiuralab
 
Position estimation in kalman filter
Position estimation in kalman filterPosition estimation in kalman filter
Position estimation in kalman filterfreeshah2010
 
Human computer interaction_ 23CSM1R19.pptx
Human computer interaction_ 23CSM1R19.pptxHuman computer interaction_ 23CSM1R19.pptx
Human computer interaction_ 23CSM1R19.pptxsp23csm1r19
 
Detection and recognition of face using neural network
Detection and recognition of face using neural networkDetection and recognition of face using neural network
Detection and recognition of face using neural networkSmriti Tikoo
 
Human Action Recognition Using 3D Joint Information and HOOFD Features
Human Action Recognition Using 3D Joint Information and HOOFD FeaturesHuman Action Recognition Using 3D Joint Information and HOOFD Features
Human Action Recognition Using 3D Joint Information and HOOFD FeaturesBarış Üstündağ
 
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...Jens Grubert
 
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...Tomohiro Fukuda
 
CHI'16 Journal "A Mouse With Two Optical Sensors That Eliminates Coordinate D...
CHI'16 Journal "A Mouse With Two Optical Sensors That Eliminates Coordinate D...CHI'16 Journal "A Mouse With Two Optical Sensors That Eliminates Coordinate D...
CHI'16 Journal "A Mouse With Two Optical Sensors That Eliminates Coordinate D...Byungjoo Lee
 
Efficient architecture to condensate visual information driven by attention ...
Efficient architecture to condensate visual information driven by attention ...Efficient architecture to condensate visual information driven by attention ...
Efficient architecture to condensate visual information driven by attention ...Sara Granados Cabeza
 
AI+ Remote Sensing: Applying Deep Learning to Image Enhancement, Analytics, a...
AI+ Remote Sensing: Applying Deep Learning to Image Enhancement, Analytics, a...AI+ Remote Sensing: Applying Deep Learning to Image Enhancement, Analytics, a...
AI+ Remote Sensing: Applying Deep Learning to Image Enhancement, Analytics, a...Jui-Hsin (Larry) Lai
 
Eva Mohedano, "Investigating EEG for Saliency and Segmentation Applications i...
Eva Mohedano, "Investigating EEG for Saliency and Segmentation Applications i...Eva Mohedano, "Investigating EEG for Saliency and Segmentation Applications i...
Eva Mohedano, "Investigating EEG for Saliency and Segmentation Applications i...Universitat Politècnica de Catalunya
 
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...Wanjin Yu
 
Face Identification for Humanoid Robot
Face Identification for Humanoid RobotFace Identification for Humanoid Robot
Face Identification for Humanoid Robotthomaswangxin
 
Comparison of Matrix Completion Algorithms for Background Initialization in V...
Comparison of Matrix Completion Algorithms for Background Initialization in V...Comparison of Matrix Completion Algorithms for Background Initialization in V...
Comparison of Matrix Completion Algorithms for Background Initialization in V...ActiveEon
 
IRJET- Face Detection and Tracking Algorithm using Open CV with Raspberry Pi
IRJET- Face Detection and Tracking Algorithm using Open CV with Raspberry PiIRJET- Face Detection and Tracking Algorithm using Open CV with Raspberry Pi
IRJET- Face Detection and Tracking Algorithm using Open CV with Raspberry PiIRJET Journal
 
Deep learning based gaze detection system for automobile drivers using nir ca...
Deep learning based gaze detection system for automobile drivers using nir ca...Deep learning based gaze detection system for automobile drivers using nir ca...
Deep learning based gaze detection system for automobile drivers using nir ca...Jaey Jeong
 
Eye Tracking Based Human - Computer Interaction
Eye Tracking Based Human - Computer InteractionEye Tracking Based Human - Computer Interaction
Eye Tracking Based Human - Computer InteractionSharath Raj
 

Similar to Depth-Based Real Time Head Motion Tracking Using 3D Template Matching (20)

Moving object detection in video surveillance
Moving object detection in video surveillanceMoving object detection in video surveillance
Moving object detection in video surveillance
 
Wearable Accelerometer Optimal Positions for Human Motion Recognition(LifeTec...
Wearable Accelerometer Optimal Positions for Human Motion Recognition(LifeTec...Wearable Accelerometer Optimal Positions for Human Motion Recognition(LifeTec...
Wearable Accelerometer Optimal Positions for Human Motion Recognition(LifeTec...
 
Position estimation in kalman filter
Position estimation in kalman filterPosition estimation in kalman filter
Position estimation in kalman filter
 
Human computer interaction_ 23CSM1R19.pptx
Human computer interaction_ 23CSM1R19.pptxHuman computer interaction_ 23CSM1R19.pptx
Human computer interaction_ 23CSM1R19.pptx
 
Detection and recognition of face using neural network
Detection and recognition of face using neural networkDetection and recognition of face using neural network
Detection and recognition of face using neural network
 
Human Action Recognition Using 3D Joint Information and HOOFD Features
Human Action Recognition Using 3D Joint Information and HOOFD FeaturesHuman Action Recognition Using 3D Joint Information and HOOFD Features
Human Action Recognition Using 3D Joint Information and HOOFD Features
 
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
 
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
 
CHI'16 Journal "A Mouse With Two Optical Sensors That Eliminates Coordinate D...
CHI'16 Journal "A Mouse With Two Optical Sensors That Eliminates Coordinate D...CHI'16 Journal "A Mouse With Two Optical Sensors That Eliminates Coordinate D...
CHI'16 Journal "A Mouse With Two Optical Sensors That Eliminates Coordinate D...
 
final ppt
final pptfinal ppt
final ppt
 
Efficient architecture to condensate visual information driven by attention ...
Efficient architecture to condensate visual information driven by attention ...Efficient architecture to condensate visual information driven by attention ...
Efficient architecture to condensate visual information driven by attention ...
 
AI+ Remote Sensing: Applying Deep Learning to Image Enhancement, Analytics, a...
AI+ Remote Sensing: Applying Deep Learning to Image Enhancement, Analytics, a...AI+ Remote Sensing: Applying Deep Learning to Image Enhancement, Analytics, a...
AI+ Remote Sensing: Applying Deep Learning to Image Enhancement, Analytics, a...
 
Eva Mohedano, "Investigating EEG for Saliency and Segmentation Applications i...
Eva Mohedano, "Investigating EEG for Saliency and Segmentation Applications i...Eva Mohedano, "Investigating EEG for Saliency and Segmentation Applications i...
Eva Mohedano, "Investigating EEG for Saliency and Segmentation Applications i...
 
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
 
Face Identification for Humanoid Robot
Face Identification for Humanoid RobotFace Identification for Humanoid Robot
Face Identification for Humanoid Robot
 
20211118 AI+ Remote Sensing
20211118 AI+ Remote Sensing20211118 AI+ Remote Sensing
20211118 AI+ Remote Sensing
 
Comparison of Matrix Completion Algorithms for Background Initialization in V...
Comparison of Matrix Completion Algorithms for Background Initialization in V...Comparison of Matrix Completion Algorithms for Background Initialization in V...
Comparison of Matrix Completion Algorithms for Background Initialization in V...
 
IRJET- Face Detection and Tracking Algorithm using Open CV with Raspberry Pi
IRJET- Face Detection and Tracking Algorithm using Open CV with Raspberry PiIRJET- Face Detection and Tracking Algorithm using Open CV with Raspberry Pi
IRJET- Face Detection and Tracking Algorithm using Open CV with Raspberry Pi
 
Deep learning based gaze detection system for automobile drivers using nir ca...
Deep learning based gaze detection system for automobile drivers using nir ca...Deep learning based gaze detection system for automobile drivers using nir ca...
Deep learning based gaze detection system for automobile drivers using nir ca...
 
Eye Tracking Based Human - Computer Interaction
Eye Tracking Based Human - Computer InteractionEye Tracking Based Human - Computer Interaction
Eye Tracking Based Human - Computer Interaction
 

Recently uploaded

Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 

Recently uploaded (20)

Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 

Depth-Based Real Time Head Motion Tracking Using 3D Template Matching

  • 1. Depth-Based Real Time Head Motion Tracking Using 3D Template Matching 以深度影像及三維樣板比對為基礎之即時頭部 運動軌跡追蹤 學生:屠愚 指導教授:歐陽明博士 1
  • 2. Outline • Introduction • System Overview • Algorithm • Results & System Demo • Conclusion & Future Work 2
  • 4. What’s Head Motion Tracking 4 Y-axis Yaw Pitch Roll Y translation Real-time reconstruction of this 6-DoF motion vector given a stream of video input.
  • 5. 5 Why Head Motion Tracking GVAainzdDeiem oOa anGrtciaieionmngtea tion
  • 6. Related Work • Color Image Based Methods “Head Pose Estimation in Computer Vision : A Survey”, E. Murphy-Chutorian and M. M. Trivedi., PAMI 09 • Appearance Template Methods • Feature Tracking Methods • Detector Arrays • Nonlinear Regression Methods • Manifold Embedding Methods • Flexible Models • Geometric Methods • Hybrid Methods • Too sensitive to illumination variations! 6
  • 7. • SwissRanger SR4000 – Messa Image (2006) – $9000 • CameCube 2.0 – PMD Technology (2002) – $12000 7 Depth Camera
  • 8. • Kinect by Microsoft – $149.99 • Xtion Pro by Asus – $189 . – $300 . 8 Depth Camera
  • 9. 9 Related Work “Real-time performance-based facial animation” T. Weise et al., SIGGRAPH 2011 “Real Time Head Pose Estimation with Random Regression Forests”, G. Fanelli et al., CVPR 2011
  • 11. 11
  • 12. 12
  • 13. Flow Chart Least Square Error Method Inverse Rotation Nose Detection Sampling Iterative Optimization Method User Acting Avatar Control 13 Depth Data Acquisition Real-time Head Pose Estimation
  • 14. Algorithm Method 1 :Least square error method 14
  • 16. Z-axis 16 Nose Detection Zmin Zmin+5
  • 20. Nose Tracking Flow Chart 20 Nose Detection Nose푡 푥, 푦 Nose푡−1 푥, 푦 , 휃푡−1, 휓푡−1, 휙푡−1, Depth Map푡 Searching Window Setting Inverse Rotation z-min Searching
  • 21. Least Square Plane Fitting 21 Yaw & Pitch ! Ax+By+Cz=1
  • 22. Least Square Ellipse Fitting 23 Roll !
  • 23. Dynamic weighted average filter 0.01 0.1 1 10 퐻 ∙ 푚푎푥푙∈[1,푘] 푡푖 − 푡푖−푙 ∗ = 푘 푤푗 푡푖−푗 푗=0 푘 푤푗 “Real-time performance-based facial animation” T. Weise et al., Siggraph 2011 24 Flickering Issue 푛표푟푚푎푙푖푧푒푑 푤푒푖푔ℎ푡 휔 푓푟푎푚푒 푖 푖-1 푖-2 푖-3 푖-4 푤푗 = 푒−푡 푗∙퐻∙푚푎푥푙∈[1,푘] 푡푖−푡푖−푙 푖 푗=0
  • 24. Algorithm Method 2 :Optimization method 25
  • 25. 26 User Acting Flow Chart Least Square Error Method Inverse Rotation Nose Detection Sampling Depth Data Iterative Optimization Method Acquisition Real-time Head Pose Estimation Avatar Control
  • 26. 27 Model Point Cloud
  • 28. • 푚표푡푖표푛 푣푒푐푡표푟 푎 Θ, 휓, 휙, 푡푥 , 푡푦, 푡푧 푝∈푃푆 min(푑 푝′, 푃푀 ) • 퐸 푎, 푃푆, 푃푀 = 푝 • 푅푒푠푢푙푡 = 푎∗ = arg 푚푖푛 (퐸) 29 Energy Function
  • 29. 30 Optimization • Gradient Decent Algorithm 푎푘+1 = 푎푘 − 훼푘훻퐸(푎푘 ) Non-differentiable • 12 Possible Gradients 훻1 = 1 0 0 0 0 0 , 훻2 = −1 0 0 0 0 0 , 훻3 = 0 1 0 0 0 0 , 훻4 = 0 −1 0 0 0 0 , 훻5 = 0 0 1 0 0 0 , 훻6 = 0 0 −1 0 0 0 , 훻7 = 0 0 0 1 0 0 , 훻8 = 0 0 0 −1 0 0 , 훻9 = 0 0 0 0 1 0 , 훻10 = 0 0 0 0 −1 0 , 훻11 = 0 0 0 0 0 1 , 훻12 = 0 0 0 0 0 −1
  • 32. 33
  • 33. 34
  • 34. 35
  • 36. 37
  • 37. 38
  • 38. 39
  • 39. Our Method with 50% Database (Without Bad Initialization) 40 Our Method with Whole Database Our Method with 70% Database (Without Crash)
  • 40. 41
  • 43. • Contribution: – Two novel algorithms. – Not affected by varying lighting conditions. – Real-time responses without GPU acceleration. – Outperforms the state-of-the-art approach (G. Fanelli et al., CVPR 2011). • Future Work: – Tracking without nose. – Combine with NITE skeleton tracking. – Facial Expression Recognition / Retargeting. 44
  • 44. Thank You Very Much 45

Editor's Notes

  1. Messa Image 2006, Swiss Ranger 9000
  2. 3DV 2000 release zcam, 2009/3 brought by M$