SlideShare a Scribd company logo
VIDEO BASED HUMAN INTERACTION 4D
TOUCH PAD
NAYANA JOSHI,
Dept. of IS
OVERVIEW :
• What is VBHI ?
• Introduction to 4D Touch pad.
• Structure & working of 4D ……
• Using computer vision in human-computer
interaction systems has become a popular
approach to enhance current interfaces.
• The majority of techniques that use vision rely
on global user tracking and modeling.
• But VBHI, watches a region-of-interest (ROI) in
the video stream and waits for recognizable
user-input; i.e. interaction is site-centric.
VIDEO BASED HUMAN INTERACTION ?
VIDEO BASED HUMAN INTERACTION ? Cont…
• User can give the input in the form of gestures
in the perceptual space which is governed by
its interaction model.
• In simplest case, tracked hand motion and
gesture recognition could replace mouse in
traditional applications.
Comparing conventional VBI with the VICs
approach
INTRODUCTION TO 4D
• We live in a world of three dimensions.
• 2D and 3D are very familiar to all which contains
only the spatial dimension (length, width, height).
• But the concept of 4D is new. The fourth
dimension can be:
• Fragnance
• Sense of touch
• Time
Introduction Cont . . .
• Here the concept of 4D combines space and time
within a single coordinate system, typically with three
spatial dimensions and one temporal dimension(time).
• In spacetime, a coordinate grid that spans the 3+1
dimensions locates events i.e you have where and
when something is.
• Example: Consider an ant at a point (1,1,1) which is an
example for 3D. But the presence of an ant at (1,1,1) at
9am i.e (1,1,1,9) is an illustration for 4D.
4D Example: If the inner cube becomes the outer
one and vice-versa instantaneously, i.e
coordinates can be represented only with the
help of a fourth dimention, time.
What is 4D Touchpad?
• A touch pad is a device for pointing on a computer
display screen, originally incorporated in
laptop, computers etc.
• 4D Touchpad (4DT) is a type of touchpad where the
intuitive gestures of users can be used to give the
input.
• It contains a platform for human-machine interfaces
that provides direct interaction with interface
components through intuitive actions and gestures.
.
Example : Here for flipping a coin in the screen
user have to just imitate the action of flipping
in the Region Of Interface(ROI) of the 4DT.
• The 4DT is based on the 3D-2D Projection-
based mode of the VICs framework.
• The visual interaction cues (VICs) paradigm
uses a shared perceptual space between the
user and the computer.
Structure of 4D Touchpad :
• 4DT contains a pair of cameras with a wide-baseline
and a projector that are directed at a table.
• The region above the surface of the table between the
two cameras is the ROI of the 4DT.
• The image of the screen to be displayed to the user is
projected on to the table from the projector through
the mirror.
• Mirror is mounted such that screen is shown clearly in
the table.
• Special adjustment is done in caliberating the camera
so as to obtain the correct sequence of actions by the
user.
Structure of 4DT Cont...d
WORKING OF 4D TOUCHPAD:
The 4DT is based on the 3D-2D Projection-based
mode of the VICs framework. It involves following steps:
a] 3D-2D Projection:
b]Image Rectification:
Homography
c]Analysis of Stereo Properties:
Stereo analysis
Gesture Recognition
A natural Gesture Language
3D-2D Projection:
First the 3D view of the projected camera is
projected on to the plane by representing a
user’s pint of view in this 3D space.
This view is drawn below as the pyramid. All
ray of lights that pass this ”pyramid” will
originate from objects that the user sees.
In the end, the whole 3D to 2D projection
will be about projection the entire 3D world
on that plane in front of the pyramid.
Example : Imagine we want to project a
cube on a plane. If we want to draw the
situation, it would look somewhat like this.
Image Rectification Using
Homography:
• A Homography is an invertible transformation from the
real projective plane to the projective plane that maps
corresponding points.
• Homography has got mainly 2 properties:
1. For a stationary camera with its fixed centre of
projection, it does not depend on the scene
structure (i.e., depth of the scene points).
2. It applies even if the camera pans and
zooms, which means to change the focal length of
the camera while it is rotating about its centre.
In 4D touchpad the projection of the camera and the
actual figure is homographycally analysed.
Projected and Rectified Image
STEREO ANALYSIS :
• For both cameras, we can subtract the current frame
from a stored background frame yielding mask of
modified regions. This can be used for simple stereo
calculation.
• Then, we can take the difference between two
modified masks to find all pixels not on the plane and
use it in more intensive computations like 3D gesture
recognition.
Gesture Recognition:
• The action of a user gesturing over an interface
component presents a sequence of visual cues
gestures spatiotemporal signature.
Consider a standard push button :
1. The user enters the local region: Disturbance
2. The finger moves onto the button: large color blob
3. The finger pushes the button: fixed duration
4. The button is pressed and processing completes.
A natural Gesture Language:
• A natural gesture language implements the visual
interaction model on the 4DT plat-form.
• The gesture language comprises a vocabulary of
individual gestures.
Gesture Description
Press Finger on centre
Press-left Finger on left edge
Press-right Finger on right edge
Stop An open hand
Flips Mimics flipping a coin over
Twist Clockwise twisting
Drop
Pick
Index finger and thumb open
Index finger and thumb half closed
Since the cameras are fixed we lose the ability
to extract the panel orientation.
Set up is expensive as we use 2 cameras to
cover the wide base line of ROI.
Disadvantages of 4D Touch Pad
Conclusion :
• In the local regions of the video images, gesture recognition is solved by
modeling the spatio-temporal pattern of visual cues that correspond to
gestures.
• In the VICs paradigm, we approach gesture recognition without globally
tracking and modeling the user.
• The paradigm is applicable in both conventional 2D interface settings
and unconventional 3D virtual/augmented reality settings.
• The devices like the mouse is replaced with a direct, natural language of
interaction.
• It acts as a coherent, linguistic model that integrates heterogeneous
forms of the low-level gestures into a single framework.
• VICs modeling technique consistently orders image based on
overlapping pixel content and is robust to view point change and
occlusion.
References :
• Jason J. Corso, Guangqi Ye, Darius Burschka, and
Gregory D. Hager, ”A Practical Paradigm and Platform
for Video-Based Human-Computer Interaction” ,2008
• Jason J. Corso, ”The 4D Touchpad: Unencumbered
HCI With VICs”,2005
• A.D. Jepson, ”Computing a homography between 2D
scene and image”,2006
• F.J. Estrada ,A.D. Jepson, D. Fleet, ”Planar
Homographies”,2004
• Johnny Bistrom, Alessandro Cogliati, ”Post-WIMP
User Interface Model for 3D Web Applications”,2005
THANK YOU
.
THANK YOU

More Related Content

What's hot

A Shot Boundary Detection Method for News Video Based Human Skin Region (Face...
A Shot Boundary Detection Method for News Video Based Human Skin Region (Face...A Shot Boundary Detection Method for News Video Based Human Skin Region (Face...
A Shot Boundary Detection Method for News Video Based Human Skin Region (Face...ijsrd.com
 
What Is DSLR?
What Is DSLR?What Is DSLR?
What Is DSLR?stiky
 
Stereoscopic 3D: Generation Methods and Display Technologies for Industry and...
Stereoscopic 3D: Generation Methods and Display Technologies for Industry and...Stereoscopic 3D: Generation Methods and Display Technologies for Industry and...
Stereoscopic 3D: Generation Methods and Display Technologies for Industry and...Ray Phan
 
Hand Gesture Controls for Digital TV using Mobile ARM Platform
Hand Gesture Controls for Digital TV using Mobile ARM PlatformHand Gesture Controls for Digital TV using Mobile ARM Platform
Hand Gesture Controls for Digital TV using Mobile ARM Platformijsrd.com
 
Computer generated holography as a generic display technology
Computer generated holography as a generic display technologyComputer generated holography as a generic display technology
Computer generated holography as a generic display technologyPritam Bhansali
 
Dan Walsh - Undergrad FYP Presentation
Dan Walsh - Undergrad FYP PresentationDan Walsh - Undergrad FYP Presentation
Dan Walsh - Undergrad FYP PresentationDan Walsh
 
presentation on 3D Technology
presentation on 3D Technologypresentation on 3D Technology
presentation on 3D TechnologyMatti Ur Rehman
 
Interactive Projector Screen with Hand Detection Using LED Lights
Interactive Projector Screen with Hand Detection Using LED LightsInteractive Projector Screen with Hand Detection Using LED Lights
Interactive Projector Screen with Hand Detection Using LED LightsCSCJournals
 
3-D TECHNOLOGY
3-D TECHNOLOGY 3-D TECHNOLOGY
3-D TECHNOLOGY dharmawath
 
Goal location prediction based on deep learning using RGB-D camera
Goal location prediction based on deep learning using RGB-D cameraGoal location prediction based on deep learning using RGB-D camera
Goal location prediction based on deep learning using RGB-D camerajournalBEEI
 

What's hot (20)

3d tv
3d tv3d tv
3d tv
 
Broadcast Lens Technology Part 2
Broadcast Lens Technology Part 2Broadcast Lens Technology Part 2
Broadcast Lens Technology Part 2
 
3D Technology
3D Technology 3D Technology
3D Technology
 
A Shot Boundary Detection Method for News Video Based Human Skin Region (Face...
A Shot Boundary Detection Method for News Video Based Human Skin Region (Face...A Shot Boundary Detection Method for News Video Based Human Skin Region (Face...
A Shot Boundary Detection Method for News Video Based Human Skin Region (Face...
 
What Is DSLR?
What Is DSLR?What Is DSLR?
What Is DSLR?
 
Stereoscopic 3D: Generation Methods and Display Technologies for Industry and...
Stereoscopic 3D: Generation Methods and Display Technologies for Industry and...Stereoscopic 3D: Generation Methods and Display Technologies for Industry and...
Stereoscopic 3D: Generation Methods and Display Technologies for Industry and...
 
3 d technology
3 d technology3 d technology
3 d technology
 
Datt 2500 week 11
Datt 2500 week 11Datt 2500 week 11
Datt 2500 week 11
 
Computer Graphics display technologies(Computer graphics tutorials)
Computer Graphics display technologies(Computer graphics tutorials)Computer Graphics display technologies(Computer graphics tutorials)
Computer Graphics display technologies(Computer graphics tutorials)
 
Hand Gesture Controls for Digital TV using Mobile ARM Platform
Hand Gesture Controls for Digital TV using Mobile ARM PlatformHand Gesture Controls for Digital TV using Mobile ARM Platform
Hand Gesture Controls for Digital TV using Mobile ARM Platform
 
Computer generated holography as a generic display technology
Computer generated holography as a generic display technologyComputer generated holography as a generic display technology
Computer generated holography as a generic display technology
 
Dan Walsh - Undergrad FYP Presentation
Dan Walsh - Undergrad FYP PresentationDan Walsh - Undergrad FYP Presentation
Dan Walsh - Undergrad FYP Presentation
 
presentation on 3D Technology
presentation on 3D Technologypresentation on 3D Technology
presentation on 3D Technology
 
Interactive Projector Screen with Hand Detection Using LED Lights
Interactive Projector Screen with Hand Detection Using LED LightsInteractive Projector Screen with Hand Detection Using LED Lights
Interactive Projector Screen with Hand Detection Using LED Lights
 
3-D TECHNOLOGY
3-D TECHNOLOGY 3-D TECHNOLOGY
3-D TECHNOLOGY
 
3D PC GLASS
3D PC GLASS3D PC GLASS
3D PC GLASS
 
3D Stereoscopic Filmmaking
3D Stereoscopic Filmmaking3D Stereoscopic Filmmaking
3D Stereoscopic Filmmaking
 
Compressive DIsplays: SID Keynote by Ramesh Raskar
Compressive DIsplays: SID Keynote by Ramesh RaskarCompressive DIsplays: SID Keynote by Ramesh Raskar
Compressive DIsplays: SID Keynote by Ramesh Raskar
 
BEHIND 3D &4D
BEHIND 3D &4DBEHIND 3D &4D
BEHIND 3D &4D
 
Goal location prediction based on deep learning using RGB-D camera
Goal location prediction based on deep learning using RGB-D cameraGoal location prediction based on deep learning using RGB-D camera
Goal location prediction based on deep learning using RGB-D camera
 

Viewers also liked

Wireless Sensor Network with Mobile Integration
Wireless Sensor Network with Mobile IntegrationWireless Sensor Network with Mobile Integration
Wireless Sensor Network with Mobile Integrationimshashank
 
WIRELWIRELESS INTEGRATED NETWORK SENSORS
WIRELWIRELESS INTEGRATED NETWORK SENSORSWIRELWIRELESS INTEGRATED NETWORK SENSORS
WIRELWIRELESS INTEGRATED NETWORK SENSORSDeepak Kumar Mohapatra
 
What Is Google Wave?
What Is Google Wave?What Is Google Wave?
What Is Google Wave?Mark Fidelman
 
NOSA March 12, 2017 CSE-PPT Region 11 03122017
NOSA March 12, 2017 CSE-PPT Region 11 03122017NOSA March 12, 2017 CSE-PPT Region 11 03122017
NOSA March 12, 2017 CSE-PPT Region 11 03122017Civil Service Exam Info
 
Local multipoint distribution service(lmds)
Local multipoint distribution service(lmds)Local multipoint distribution service(lmds)
Local multipoint distribution service(lmds)Vivek Kumar
 
Wireless sensor network report
Wireless sensor network reportWireless sensor network report
Wireless sensor network reportGanesh Khadsan
 
Silent sound-technology ppt final
Silent sound-technology ppt finalSilent sound-technology ppt final
Silent sound-technology ppt finalLohit Dalal
 
Border security using wireless integrated network sensors(wins)
Border security using wireless integrated network sensors(wins)Border security using wireless integrated network sensors(wins)
Border security using wireless integrated network sensors(wins)PRADEEP Cheekatla
 
PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND- H...
PORTABLE CAMERA-BASED  ASSISTIVE TEXT AND PRODUCT  LABEL READING FROM HAND- H...PORTABLE CAMERA-BASED  ASSISTIVE TEXT AND PRODUCT  LABEL READING FROM HAND- H...
PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND- H...Sathmica K
 
Presentation on Paper battery
Presentation on Paper battery Presentation on Paper battery
Presentation on Paper battery manish katara
 
Power Saving in Wireless Sensor Networks
Power Saving in Wireless Sensor NetworksPower Saving in Wireless Sensor Networks
Power Saving in Wireless Sensor NetworksMshari Alabdulkarim
 

Viewers also liked (20)

Sohail resume
Sohail resumeSohail resume
Sohail resume
 
Haptics touch the virtual
Haptics touch the virtualHaptics touch the virtual
Haptics touch the virtual
 
Wireless Sensor Network with Mobile Integration
Wireless Sensor Network with Mobile IntegrationWireless Sensor Network with Mobile Integration
Wireless Sensor Network with Mobile Integration
 
Paper battery
Paper batteryPaper battery
Paper battery
 
WIRELWIRELESS INTEGRATED NETWORK SENSORS
WIRELWIRELESS INTEGRATED NETWORK SENSORSWIRELWIRELESS INTEGRATED NETWORK SENSORS
WIRELWIRELESS INTEGRATED NETWORK SENSORS
 
What Is Google Wave?
What Is Google Wave?What Is Google Wave?
What Is Google Wave?
 
Smart dust
Smart dustSmart dust
Smart dust
 
NOSA March 12, 2017 CSE-PPT Region 11 03122017
NOSA March 12, 2017 CSE-PPT Region 11 03122017NOSA March 12, 2017 CSE-PPT Region 11 03122017
NOSA March 12, 2017 CSE-PPT Region 11 03122017
 
Virtual keyboard
Virtual keyboardVirtual keyboard
Virtual keyboard
 
Local multipoint distribution service(lmds)
Local multipoint distribution service(lmds)Local multipoint distribution service(lmds)
Local multipoint distribution service(lmds)
 
Wireless sensor network report
Wireless sensor network reportWireless sensor network report
Wireless sensor network report
 
Paper Battery
Paper BatteryPaper Battery
Paper Battery
 
Silent sound-technology ppt final
Silent sound-technology ppt finalSilent sound-technology ppt final
Silent sound-technology ppt final
 
Border security using wireless integrated network sensors(wins)
Border security using wireless integrated network sensors(wins)Border security using wireless integrated network sensors(wins)
Border security using wireless integrated network sensors(wins)
 
zigbee full ppt
zigbee full pptzigbee full ppt
zigbee full ppt
 
Zigbee Presentation
Zigbee PresentationZigbee Presentation
Zigbee Presentation
 
PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND- H...
PORTABLE CAMERA-BASED  ASSISTIVE TEXT AND PRODUCT  LABEL READING FROM HAND- H...PORTABLE CAMERA-BASED  ASSISTIVE TEXT AND PRODUCT  LABEL READING FROM HAND- H...
PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND- H...
 
Paper battery
Paper batteryPaper battery
Paper battery
 
Presentation on Paper battery
Presentation on Paper battery Presentation on Paper battery
Presentation on Paper battery
 
Power Saving in Wireless Sensor Networks
Power Saving in Wireless Sensor NetworksPower Saving in Wireless Sensor Networks
Power Saving in Wireless Sensor Networks
 

Similar to Nayana

virtual reality Information-160422181930.pdf
virtual reality Information-160422181930.pdfvirtual reality Information-160422181930.pdf
virtual reality Information-160422181930.pdf21107117
 
sixth sense technology by pranav mistery
sixth sense technology by pranav misterysixth sense technology by pranav mistery
sixth sense technology by pranav misteryAmitGajera
 
eng.pptx
eng.pptxeng.pptx
eng.pptxZuine
 
project presentation on mouse simulation using finger tip detection
project presentation on mouse simulation using finger tip detection project presentation on mouse simulation using finger tip detection
project presentation on mouse simulation using finger tip detection Sumit Varshney
 
Sixth sense techno_ppt
Sixth sense techno_pptSixth sense techno_ppt
Sixth sense techno_pptPALAK_SINGH
 
Synthetic environment
Synthetic environmentSynthetic environment
Synthetic environmentUllas Gupta
 
The Sixth Sense Technology
The Sixth Sense TechnologyThe Sixth Sense Technology
The Sixth Sense TechnologyRonak Dumasia
 
Advance Interaction Techniques
Advance Interaction Techniques Advance Interaction Techniques
Advance Interaction Techniques Waqar_Ali52
 
Hihihihihihihivivivirtual reality.ppt.pptx
Hihihihihihihivivivirtual reality.ppt.pptxHihihihihihihivivivirtual reality.ppt.pptx
Hihihihihihihivivivirtual reality.ppt.pptxfijomiy607
 
Sixth sense techno_ppt
Sixth sense techno_pptSixth sense techno_ppt
Sixth sense techno_pptFeroz Ahmed
 
6th sense technology
6th sense technology6th sense technology
6th sense technologySarbjeet kaur
 

Similar to Nayana (20)

Virtual Reality(full)
Virtual Reality(full)Virtual Reality(full)
Virtual Reality(full)
 
Virtual Reality
Virtual RealityVirtual Reality
Virtual Reality
 
virtual reality Information-160422181930.pdf
virtual reality Information-160422181930.pdfvirtual reality Information-160422181930.pdf
virtual reality Information-160422181930.pdf
 
Virtual reality
Virtual realityVirtual reality
Virtual reality
 
Virtual Mouse
Virtual MouseVirtual Mouse
Virtual Mouse
 
ICS2208 lecture7
ICS2208 lecture7ICS2208 lecture7
ICS2208 lecture7
 
ARI2132 lecture 8
ARI2132 lecture 8ARI2132 lecture 8
ARI2132 lecture 8
 
sixth sense technology by pranav mistery
sixth sense technology by pranav misterysixth sense technology by pranav mistery
sixth sense technology by pranav mistery
 
Virtual reality
Virtual realityVirtual reality
Virtual reality
 
eng.pptx
eng.pptxeng.pptx
eng.pptx
 
Virtual keyboard
Virtual keyboardVirtual keyboard
Virtual keyboard
 
project presentation on mouse simulation using finger tip detection
project presentation on mouse simulation using finger tip detection project presentation on mouse simulation using finger tip detection
project presentation on mouse simulation using finger tip detection
 
Sixth sense techno_ppt
Sixth sense techno_pptSixth sense techno_ppt
Sixth sense techno_ppt
 
Synthetic environment
Synthetic environmentSynthetic environment
Synthetic environment
 
Kinect sensor
Kinect sensorKinect sensor
Kinect sensor
 
The Sixth Sense Technology
The Sixth Sense TechnologyThe Sixth Sense Technology
The Sixth Sense Technology
 
Advance Interaction Techniques
Advance Interaction Techniques Advance Interaction Techniques
Advance Interaction Techniques
 
Hihihihihihihivivivirtual reality.ppt.pptx
Hihihihihihihivivivirtual reality.ppt.pptxHihihihihihihivivivirtual reality.ppt.pptx
Hihihihihihihivivivirtual reality.ppt.pptx
 
Sixth sense techno_ppt
Sixth sense techno_pptSixth sense techno_ppt
Sixth sense techno_ppt
 
6th sense technology
6th sense technology6th sense technology
6th sense technology
 

Recently uploaded

IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoTAnalytics
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...Product School
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCzechDreamin
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaCzechDreamin
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeCzechDreamin
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Thierry Lestable
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...CzechDreamin
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...Product School
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Product School
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...Sri Ambati
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsPaul Groth
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityScyllaDB
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...Product School
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationZilliz
 

Recently uploaded (20)

IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 

Nayana

  • 1. VIDEO BASED HUMAN INTERACTION 4D TOUCH PAD NAYANA JOSHI, Dept. of IS
  • 2. OVERVIEW : • What is VBHI ? • Introduction to 4D Touch pad. • Structure & working of 4D ……
  • 3. • Using computer vision in human-computer interaction systems has become a popular approach to enhance current interfaces. • The majority of techniques that use vision rely on global user tracking and modeling. • But VBHI, watches a region-of-interest (ROI) in the video stream and waits for recognizable user-input; i.e. interaction is site-centric. VIDEO BASED HUMAN INTERACTION ?
  • 4. VIDEO BASED HUMAN INTERACTION ? Cont… • User can give the input in the form of gestures in the perceptual space which is governed by its interaction model. • In simplest case, tracked hand motion and gesture recognition could replace mouse in traditional applications.
  • 5. Comparing conventional VBI with the VICs approach
  • 6. INTRODUCTION TO 4D • We live in a world of three dimensions. • 2D and 3D are very familiar to all which contains only the spatial dimension (length, width, height). • But the concept of 4D is new. The fourth dimension can be: • Fragnance • Sense of touch • Time
  • 7. Introduction Cont . . . • Here the concept of 4D combines space and time within a single coordinate system, typically with three spatial dimensions and one temporal dimension(time). • In spacetime, a coordinate grid that spans the 3+1 dimensions locates events i.e you have where and when something is. • Example: Consider an ant at a point (1,1,1) which is an example for 3D. But the presence of an ant at (1,1,1) at 9am i.e (1,1,1,9) is an illustration for 4D.
  • 8. 4D Example: If the inner cube becomes the outer one and vice-versa instantaneously, i.e coordinates can be represented only with the help of a fourth dimention, time.
  • 9. What is 4D Touchpad? • A touch pad is a device for pointing on a computer display screen, originally incorporated in laptop, computers etc. • 4D Touchpad (4DT) is a type of touchpad where the intuitive gestures of users can be used to give the input. • It contains a platform for human-machine interfaces that provides direct interaction with interface components through intuitive actions and gestures.
  • 10. . Example : Here for flipping a coin in the screen user have to just imitate the action of flipping in the Region Of Interface(ROI) of the 4DT. • The 4DT is based on the 3D-2D Projection- based mode of the VICs framework. • The visual interaction cues (VICs) paradigm uses a shared perceptual space between the user and the computer.
  • 11. Structure of 4D Touchpad :
  • 12. • 4DT contains a pair of cameras with a wide-baseline and a projector that are directed at a table. • The region above the surface of the table between the two cameras is the ROI of the 4DT. • The image of the screen to be displayed to the user is projected on to the table from the projector through the mirror. • Mirror is mounted such that screen is shown clearly in the table. • Special adjustment is done in caliberating the camera so as to obtain the correct sequence of actions by the user. Structure of 4DT Cont...d
  • 13. WORKING OF 4D TOUCHPAD: The 4DT is based on the 3D-2D Projection-based mode of the VICs framework. It involves following steps: a] 3D-2D Projection: b]Image Rectification: Homography c]Analysis of Stereo Properties: Stereo analysis Gesture Recognition A natural Gesture Language
  • 14. 3D-2D Projection: First the 3D view of the projected camera is projected on to the plane by representing a user’s pint of view in this 3D space. This view is drawn below as the pyramid. All ray of lights that pass this ”pyramid” will originate from objects that the user sees. In the end, the whole 3D to 2D projection will be about projection the entire 3D world on that plane in front of the pyramid. Example : Imagine we want to project a cube on a plane. If we want to draw the situation, it would look somewhat like this.
  • 15. Image Rectification Using Homography: • A Homography is an invertible transformation from the real projective plane to the projective plane that maps corresponding points. • Homography has got mainly 2 properties: 1. For a stationary camera with its fixed centre of projection, it does not depend on the scene structure (i.e., depth of the scene points). 2. It applies even if the camera pans and zooms, which means to change the focal length of the camera while it is rotating about its centre.
  • 16. In 4D touchpad the projection of the camera and the actual figure is homographycally analysed. Projected and Rectified Image
  • 17. STEREO ANALYSIS : • For both cameras, we can subtract the current frame from a stored background frame yielding mask of modified regions. This can be used for simple stereo calculation. • Then, we can take the difference between two modified masks to find all pixels not on the plane and use it in more intensive computations like 3D gesture recognition.
  • 18. Gesture Recognition: • The action of a user gesturing over an interface component presents a sequence of visual cues gestures spatiotemporal signature. Consider a standard push button : 1. The user enters the local region: Disturbance 2. The finger moves onto the button: large color blob 3. The finger pushes the button: fixed duration 4. The button is pressed and processing completes.
  • 19. A natural Gesture Language: • A natural gesture language implements the visual interaction model on the 4DT plat-form. • The gesture language comprises a vocabulary of individual gestures. Gesture Description Press Finger on centre Press-left Finger on left edge Press-right Finger on right edge Stop An open hand Flips Mimics flipping a coin over Twist Clockwise twisting Drop Pick Index finger and thumb open Index finger and thumb half closed
  • 20. Since the cameras are fixed we lose the ability to extract the panel orientation. Set up is expensive as we use 2 cameras to cover the wide base line of ROI. Disadvantages of 4D Touch Pad
  • 21. Conclusion : • In the local regions of the video images, gesture recognition is solved by modeling the spatio-temporal pattern of visual cues that correspond to gestures. • In the VICs paradigm, we approach gesture recognition without globally tracking and modeling the user. • The paradigm is applicable in both conventional 2D interface settings and unconventional 3D virtual/augmented reality settings. • The devices like the mouse is replaced with a direct, natural language of interaction. • It acts as a coherent, linguistic model that integrates heterogeneous forms of the low-level gestures into a single framework. • VICs modeling technique consistently orders image based on overlapping pixel content and is robust to view point change and occlusion.
  • 22. References : • Jason J. Corso, Guangqi Ye, Darius Burschka, and Gregory D. Hager, ”A Practical Paradigm and Platform for Video-Based Human-Computer Interaction” ,2008 • Jason J. Corso, ”The 4D Touchpad: Unencumbered HCI With VICs”,2005 • A.D. Jepson, ”Computing a homography between 2D scene and image”,2006 • F.J. Estrada ,A.D. Jepson, D. Fleet, ”Planar Homographies”,2004 • Johnny Bistrom, Alessandro Cogliati, ”Post-WIMP User Interface Model for 3D Web Applications”,2005