SlideShare a Scribd company logo
GESTURE RECOGNITION:
VIRTUALITY AND REALITY
OLEKSANDR BAIEV
PHD
SR. ENGINEER AT SAMSUNG R&D UKRAINE
AGENDA
• Domain and current devices
• Hardware details
• Data processing tricks
• Hand Localization
• Joint’s coordinates reconstruction
• Skeleton recovering
WHY WE SHOULD RECONSTRUCT GESTURES
WHY WE SHOULD RECONSTRUCT GESTURES
CURRENT SOLUTIONS
LeapMotion
MS Kinect
Intell RealSense
Pebble Interfaces
MS HoloLens
WHERE IS DATA SCIENCE
Stereo cameras
IR projector/camera
ToF camera
another sensors
RAW images
Structured light’s images
Depth image
Voodoo
is
here
Hand skeleton
HOW TO GET DEPTH
Stereo images ToF sensorIR projector/camera
1. Several images
2. Calculate disparities
3. Get depth
1. Project structured light
2. Evaluate structure distortion
3. Get depth
1. Measure time
of light flight
2. Get depth
VALUE OF EACH PIXEL IS DISTANCE TO POINT ON IMG
Hand Localization
Coordinates of
joints recovering
Hand skeleton
reconstruction
Tompson et al. 2014. Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks
VALUE OF EACH PIXEL IS DISTANCE TO POINT ON IMG
Hand Localization
Coordinates of
joints recovering
Hand skeleton
reconstruction
TASK1. LOCALIZATION: LETS FIND A HAND
Use Random Forest for pixel-by-
pixel background subtraction
𝐼 𝑢 +
∆𝑢
𝐼 𝑢, 𝑣
, 𝑣 +
∆𝑣
𝐼 𝑢, 𝑣
− I 𝑢, 𝑣 ≥ 𝑑 𝑡
Shotton et al. 2011. Real-Time Human Pose Recognition in Parts from Single Depth Images
VALUE OF EACH PIXEL IS DISTANCE TO POINT ON IMG
Hand Localization
Coordinates of
joints recovering
Hand skeleton
reconstruction
Tompson et al. 2014. Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks
TASK2. REGRESSION: LETS FIND EACH JOINT
Coordinates of each join as output of CNN
TASK2. REGRESSION: LETS FIND EACH JOINT
Trick #2: heat-maps as outputTrick #1: multiscale convolutions
Tompson et al. 2014. Real-
Time Continuous Pose
Recovery of Human Hands
Using Convolutional Networks
VALUE OF EACH PIXEL IS DISTANCE TO POINT ON IMG
Hand Localization
Coordinates of
joints recovering
Hand skeleton
reconstruction
Tompson et al. 2014. Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks
TASK3. INFERENCE: FIND SKELETON
Find sum of max values in
appropriate heat-map’s
values with distances
between joints as set of
constraints
• Accurate prediction
• Real time solution
• Work on standard equipment
Gesture recognition: virtual and reality

More Related Content

Similar to Gesture recognition: virtual and reality

Satellite image Processing Seminar Report
Satellite image Processing Seminar ReportSatellite image Processing Seminar Report
Satellite image Processing Seminar Report
alok ray
 
[Paper introduction] Performance Capture of Interacting Characters with Handh...
[Paper introduction] Performance Capture of Interacting Characters with Handh...[Paper introduction] Performance Capture of Interacting Characters with Handh...
[Paper introduction] Performance Capture of Interacting Characters with Handh...Mitsuru Nakazawa
 
Introduction to Medical Imaging
Introduction to Medical ImagingIntroduction to Medical Imaging
Introduction to Medical Imaging
Hossain Md Shakhawat
 
Gesture detection by virtual surface
Gesture detection by virtual surfaceGesture detection by virtual surface
Gesture detection by virtual surfaceAshish Garg
 
Working of photogrammetry and remote sensing
Working of photogrammetry and remote sensingWorking of photogrammetry and remote sensing
Working of photogrammetry and remote sensing
NI BT
 
Scale & Measurement Module
Scale & Measurement ModuleScale & Measurement Module
Scale & Measurement ModuleAccessNano
 
Undergrad thesis - Vivekananda S
Undergrad thesis - Vivekananda SUndergrad thesis - Vivekananda S
Undergrad thesis - Vivekananda S
Northeastern Univeristy
 
IMAGE GUIDED SURGERY
IMAGE GUIDED SURGERYIMAGE GUIDED SURGERY
Testo 882
Testo 882Testo 882
Testo 882
videoborescope
 
"What is Neuromorphic Event-based Computer Vision? Sensors, Theory and Applic...
"What is Neuromorphic Event-based Computer Vision? Sensors, Theory and Applic..."What is Neuromorphic Event-based Computer Vision? Sensors, Theory and Applic...
"What is Neuromorphic Event-based Computer Vision? Sensors, Theory and Applic...
Edge AI and Vision Alliance
 
Testo 885 Super Resolution
Testo 885 Super ResolutionTesto 885 Super Resolution
Testo 885 Super Resolution
videoborescope
 
2_UNITS_MEASUREMENT.ppt
2_UNITS_MEASUREMENT.ppt2_UNITS_MEASUREMENT.ppt
2_UNITS_MEASUREMENT.ppt
MLaniban
 
G044044249
G044044249G044044249
G044044249
IJERA Editor
 
Testo 885-2
Testo 885-2Testo 885-2
Testo 885-2
videoborescope
 
When Remote Sensing Meets Artificial Intelligence
When Remote Sensing Meets Artificial IntelligenceWhen Remote Sensing Meets Artificial Intelligence
When Remote Sensing Meets Artificial Intelligence
WahyuRahmaniar2
 
20th. Single Molecule Workshop Picoquant 2014
20th. Single Molecule Workshop Picoquant 201420th. Single Molecule Workshop Picoquant 2014
20th. Single Molecule Workshop Picoquant 2014
Dirk Hähnel
 
SULI HYDE J Report
SULI HYDE J ReportSULI HYDE J Report
SULI HYDE J ReportJeremy Hyde
 
Motion Human Detection & Tracking Based On Background Subtraction
Motion Human Detection & Tracking Based On Background SubtractionMotion Human Detection & Tracking Based On Background Subtraction
Motion Human Detection & Tracking Based On Background Subtraction
International Journal of Engineering Inventions www.ijeijournal.com
 
lecture5-wearables-and-motion-sening.pptx
lecture5-wearables-and-motion-sening.pptxlecture5-wearables-and-motion-sening.pptx
lecture5-wearables-and-motion-sening.pptx
VishwajeetGupta11
 

Similar to Gesture recognition: virtual and reality (20)

Satellite image Processing Seminar Report
Satellite image Processing Seminar ReportSatellite image Processing Seminar Report
Satellite image Processing Seminar Report
 
[Paper introduction] Performance Capture of Interacting Characters with Handh...
[Paper introduction] Performance Capture of Interacting Characters with Handh...[Paper introduction] Performance Capture of Interacting Characters with Handh...
[Paper introduction] Performance Capture of Interacting Characters with Handh...
 
Raskar COSI invited talk Oct 2009
Raskar COSI invited talk Oct 2009Raskar COSI invited talk Oct 2009
Raskar COSI invited talk Oct 2009
 
Introduction to Medical Imaging
Introduction to Medical ImagingIntroduction to Medical Imaging
Introduction to Medical Imaging
 
Gesture detection by virtual surface
Gesture detection by virtual surfaceGesture detection by virtual surface
Gesture detection by virtual surface
 
Working of photogrammetry and remote sensing
Working of photogrammetry and remote sensingWorking of photogrammetry and remote sensing
Working of photogrammetry and remote sensing
 
Scale & Measurement Module
Scale & Measurement ModuleScale & Measurement Module
Scale & Measurement Module
 
Undergrad thesis - Vivekananda S
Undergrad thesis - Vivekananda SUndergrad thesis - Vivekananda S
Undergrad thesis - Vivekananda S
 
IMAGE GUIDED SURGERY
IMAGE GUIDED SURGERYIMAGE GUIDED SURGERY
IMAGE GUIDED SURGERY
 
Testo 882
Testo 882Testo 882
Testo 882
 
"What is Neuromorphic Event-based Computer Vision? Sensors, Theory and Applic...
"What is Neuromorphic Event-based Computer Vision? Sensors, Theory and Applic..."What is Neuromorphic Event-based Computer Vision? Sensors, Theory and Applic...
"What is Neuromorphic Event-based Computer Vision? Sensors, Theory and Applic...
 
Testo 885 Super Resolution
Testo 885 Super ResolutionTesto 885 Super Resolution
Testo 885 Super Resolution
 
2_UNITS_MEASUREMENT.ppt
2_UNITS_MEASUREMENT.ppt2_UNITS_MEASUREMENT.ppt
2_UNITS_MEASUREMENT.ppt
 
G044044249
G044044249G044044249
G044044249
 
Testo 885-2
Testo 885-2Testo 885-2
Testo 885-2
 
When Remote Sensing Meets Artificial Intelligence
When Remote Sensing Meets Artificial IntelligenceWhen Remote Sensing Meets Artificial Intelligence
When Remote Sensing Meets Artificial Intelligence
 
20th. Single Molecule Workshop Picoquant 2014
20th. Single Molecule Workshop Picoquant 201420th. Single Molecule Workshop Picoquant 2014
20th. Single Molecule Workshop Picoquant 2014
 
SULI HYDE J Report
SULI HYDE J ReportSULI HYDE J Report
SULI HYDE J Report
 
Motion Human Detection & Tracking Based On Background Subtraction
Motion Human Detection & Tracking Based On Background SubtractionMotion Human Detection & Tracking Based On Background Subtraction
Motion Human Detection & Tracking Based On Background Subtraction
 
lecture5-wearables-and-motion-sening.pptx
lecture5-wearables-and-motion-sening.pptxlecture5-wearables-and-motion-sening.pptx
lecture5-wearables-and-motion-sening.pptx
 

Recently uploaded

FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 

Gesture recognition: virtual and reality

  • 1. GESTURE RECOGNITION: VIRTUALITY AND REALITY OLEKSANDR BAIEV PHD SR. ENGINEER AT SAMSUNG R&D UKRAINE
  • 2. AGENDA • Domain and current devices • Hardware details • Data processing tricks • Hand Localization • Joint’s coordinates reconstruction • Skeleton recovering
  • 3. WHY WE SHOULD RECONSTRUCT GESTURES
  • 4. WHY WE SHOULD RECONSTRUCT GESTURES
  • 5. CURRENT SOLUTIONS LeapMotion MS Kinect Intell RealSense Pebble Interfaces MS HoloLens
  • 6. WHERE IS DATA SCIENCE Stereo cameras IR projector/camera ToF camera another sensors RAW images Structured light’s images Depth image Voodoo is here Hand skeleton
  • 7. HOW TO GET DEPTH Stereo images ToF sensorIR projector/camera 1. Several images 2. Calculate disparities 3. Get depth 1. Project structured light 2. Evaluate structure distortion 3. Get depth 1. Measure time of light flight 2. Get depth
  • 8. VALUE OF EACH PIXEL IS DISTANCE TO POINT ON IMG Hand Localization Coordinates of joints recovering Hand skeleton reconstruction Tompson et al. 2014. Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks
  • 9. VALUE OF EACH PIXEL IS DISTANCE TO POINT ON IMG Hand Localization Coordinates of joints recovering Hand skeleton reconstruction
  • 10. TASK1. LOCALIZATION: LETS FIND A HAND Use Random Forest for pixel-by- pixel background subtraction 𝐼 𝑢 + ∆𝑢 𝐼 𝑢, 𝑣 , 𝑣 + ∆𝑣 𝐼 𝑢, 𝑣 − I 𝑢, 𝑣 ≥ 𝑑 𝑡 Shotton et al. 2011. Real-Time Human Pose Recognition in Parts from Single Depth Images
  • 11. VALUE OF EACH PIXEL IS DISTANCE TO POINT ON IMG Hand Localization Coordinates of joints recovering Hand skeleton reconstruction Tompson et al. 2014. Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks
  • 12. TASK2. REGRESSION: LETS FIND EACH JOINT Coordinates of each join as output of CNN
  • 13. TASK2. REGRESSION: LETS FIND EACH JOINT Trick #2: heat-maps as outputTrick #1: multiscale convolutions Tompson et al. 2014. Real- Time Continuous Pose Recovery of Human Hands Using Convolutional Networks
  • 14. VALUE OF EACH PIXEL IS DISTANCE TO POINT ON IMG Hand Localization Coordinates of joints recovering Hand skeleton reconstruction Tompson et al. 2014. Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks
  • 15. TASK3. INFERENCE: FIND SKELETON Find sum of max values in appropriate heat-map’s values with distances between joints as set of constraints
  • 16. • Accurate prediction • Real time solution • Work on standard equipment