SlideShare a Scribd company logo
1 of 24
Download to read offline
Robots that see likeRobots that see like
humanshumans
Real Time Stereo Vision With ZED Camera
on Nvidia Jetson TX1
Walter Lucetti
myzhar@robot-home.it
Summary
21/05/2016Officine Robotiche 2016
2
WalterLucetti–myzhar@robot-home.it
Live Demo
Conclusions
Introduction
Stereo “Binocular” Vision
Robotic Stereo Vision
“Simplified” Stereo Vision
Pin Hole Camera model
Stereo Vision camera model
Stereo Vision process
A Robotic Stereo Vision System
Nvidia Jetson TX1
Stereolabs ZED camera
Stereolabs ZED SDK
Stereolabs ZED engine
Simple Code example
Introduction3
Introduction
4
WalterLucetti–myzhar@robot-home.it
Can a robot replicate the complicated
human binocular vision system?
21/05/2016Officine Robotiche 2016
Stereo “Binocular” Vision
5
[from Wikipedia: Binocular vision]
Vision made with TWO “EYES”
It gives a “creature” a spare eye in case one is damaged
It gives a wider field of view
It can give stereopsis in which binocular disparity (or
parallax) provided by the two eyes' different positions on
the head gives precise depth perception
It allows a creature to see more of, or all of, an object
behind an obstacle
It gives binocular summation in which the ability to detect
faint objects is enhanced
WalterLucetti–myzhar@robot-home.it
21/05/2016Officine Robotiche 2016
Robotic Stereo Vision
6
How to replicate Human Stereo Vision?
Two synchronized camera sensors
A solid mechanical structure
A powerful elaboration system
and if we want to be more precise, but we want to complicate the
system:
Two motors to replicate vergence capability
WalterLucetti–myzhar@robot-home.it
21/05/2016Officine Robotiche 2016
“Simplified” Stereo Vision7
“Simplified” Stereo Vision
8
WalterLucetti–myzhar@robot-home.it
A complete Stereo Vision system has
vergence capability, but is too complicated
We will study a Simplified Stereo Vision
system where cameras cannot rotate
21/05/2016Officine Robotiche 2016
9
WalterLucetti–myzhar@robot-home.it
Pin hole camera model
Camera
Plane
Optical
Axis
Image coordinates
World coordinates
World to image
Image to world
Without “Z” we cannot estimate the real dimension of an object using
a single camera!
“f” can be estimated
calibrating the
camera system
This is a simplified
camera model that
does not take care
of
Optical Distorsion
and
Sensor
Imperfections
21/05/2016Officine Robotiche 2016
u=
f ∗x
z
v=
f ∗y
z
x=
u∗z
f
y=
v∗z
f
10
WalterLucetti–myzhar@robot-home.it
Stereo Vision model
Very important parameter:
●
Baseline: b
From triangles similarity:
Very important value:
● Disparity: D = ul
- ur
From disparity we can calculate how
much a point is far from a camera:
THE “Z” VALUE!!!
21/05/2016Officine Robotiche 2016
f
Z P
=
ul
X P
f
Z P
=
−ur
b−X P
ZP=
b∗ f
ul−ur
Calculate “Disparity” (D)
for each point of the image
11
WalterLucetti–myzhar@robot-home.it
Stereo Vision process
Calculate the depth (Z) of
each point of the image
Calculate X and Y of each
point of the image
Algorithm
MATCHING
REQUIRES A LOT
OF
COMPUTATIONAL
POWER
D = ul
- ur
21/05/2016Officine Robotiche 2016
12
WalterLucetti–myzhar@robot-home.it
Stereo Vision summary
Allows to calculate real world coordinate
of an object
Requires a solid mechanical structure
Requires a precise calibration of the
optical system
Requires a lot of computational power
…
Is more complicated than what seen until
now
21/05/2016Officine Robotiche 2016
A Robotic Stereo Vision system13
14
WalterLucetti–myzhar@robot-home.it
A Robotic Stereo Vision System
Nvidia Jetson TX1
www.nvidia.com/object/jetson-embedded-
systems.html
Stereolabs ZED
www.stereolabs.com
21/05/2016Officine Robotiche 2016
15
WalterLucetti–myzhar@robot-home.it
Nvidia Jetson TK1
Main features
Nvidia Tegra X1 SOC
NVIDIA Maxwell GPU with 256
CUDA cores
NVIDIA 4-Plus-1™ 2.3 Ghz quad-
core ARM® Cortex™-A15 CPU
4 GB LPDDR4 Memory
16 GB eMMC 5.1 Flash Storage
1 full-size SD/MMC connector
1 full-size HDMI port
1 USB 2.0 port, micro AB
1 USB 3.0 port, A
1 RTL8111GS Realtek GigE LAN
GPIO, SPI, I2C, UART, CAN
WiFi & Bluetooth connection
...
 Native OpenCV support with GPU
module for Real Time computer vision
 ROS enabled
 Ubuntu powered
and more...
 Android
 ...
21/05/2016Officine Robotiche 2016
16
WalterLucetti–myzhar@robot-home.it
Stereolabs ZED camera
Main features
Auto calibrating and auto sync
USB 3.0
High Res at high FPS
2208x1242 15 Hz
1920x1080 30 Hz
1280x720 60 Hz
640x480 120 Hz
Simple and well documented C++
SDK
Automatic Depth Map generation
3D Range: 1 m to 20 m
(12 cm baseline)
Outdoor and indoor
Jetson TX1 driver
ROS driver
OpenCV support
21/05/2016Officine Robotiche 2016
17
WalterLucetti–myzhar@robot-home.it
Stereolabs ZED SDK
C++ Classes and APIs for Linux and Windows
2 main classes and 5 support classes
Main classes:
Camera: the main class to use the ZED camera
Mat: the class to store image data in GPU and CPU memory
spaces
Support classes:
CamParameters: calibration params for the two sensors
StereoParameters: calibration params for stereo processing
resolution: dimensions of the images
float3Struct: 3D point
uchar3Struct: RGB pixel
Current version:
0.9.3-beta for Linux and Windows, 0.9.3-alpha for Jetson TX1
21/05/2016Officine Robotiche 2016
18
WalterLucetti–myzhar@robot-home.it
Stereolabs ZED engine
Stereo results are available
in GPU memory
Direct processing by CUDA
or download to CPU memory
for CPU processing
21/05/2016Officine Robotiche 2016
19
WalterLucetti–myzhar@robot-home.it
Stereolabs ZED tools
ZED Explorer ZED Depth Viewer
21/05/2016Officine Robotiche 2016
20
WalterLucetti–myzhar@robot-home.it
Simple code example
//ZED Includes
#include <zed/Camera.hpp>
int main(int argc, char **argv)
{
sl::zed::SENSING_MODE dm_type = sl::zed::RAW;
sl::zed::Camera* zed;
// Camera object creation
zed = new sl::zed::Camera(sl::zed::HD2K);
// We can use different resolutions:
// HD2K HD1080 HD720 VGA
// Sizes of the images
int width = zed->getImageSize().width;
int height = zed->getImageSize().height;
// Initialization of the camera
sl::zed::ERRCODE err =
zed->init( sl::zed::MODE::PERFORMANCE );
if (err != sl::zed::SUCCESS)
{
delete zed;
return 1;
}
// Stereo processing
zed->grab(dm_type);
// CPU Data
sl::zed::Mat depth_cpu;
sl::zed::Mat left_cpu;
sl::zed::Mat right_cpu;
// Data acquiring on CPU memory
depth_cpu =
zed->retrieveMeasure(sl::MEASURE::DEPTH);
left_cpu =
zed->retrieveImage(sl::SIDE::LEFT);
right_cpu =
zed->retrieveImage(sl::SIDE::RIGHT);
// GPU Data
sl::zed::Mat depth_gpu;
sl::zed::Mat left_gpu;
sl::zed::Mat right_gpu;
// Data acquiring on GPU memory
depth_cpu =
zed->retrieveMeasure_gpu(sl::MEASURE::DEPTH);
left_cpu =
zed->retrieveImage_gpu(sl::SIDE::LEFT);
right_cpu =
zed->retrieveImage_gpu(sl::SIDE::RIGHT);
return 0;
}
21/05/2016Officine Robotiche 2016
Live demo21
Conclusions22
23
WalterLucetti–myzhar@robot-home.it
Conclusions
With Stereo Vision
A robot can see obstacles
A robot can measure obstacles
A robot can localize obstacles
Stereo Vision requires
a lot of mathematical elaborations
a lot of computational power
a solid mechanical structure
so
Nvidia Jetson TX1 with Stereolabs ZED
is a really amazing solution for Robotic Stereo Vision!
21/05/2016Officine Robotiche 2016
The End!
OR
THANK YOU FOR YOUR
ATTENTION!
Walter Lucetti
http://myzharbot.robot-home.it
GitHub
@myzhar
Youtube
Myzhar
Twitter
@myzhar
Facebook
robothome
24
21/05/2016Officine Robotiche 2016

More Related Content

What's hot

3 d graphics with opengl part 2
3 d graphics with opengl  part 23 d graphics with opengl  part 2
3 d graphics with opengl part 2Sardar Alam
 
Build Your Own 3D Scanner: Introduction
Build Your Own 3D Scanner: IntroductionBuild Your Own 3D Scanner: Introduction
Build Your Own 3D Scanner: IntroductionDouglas Lanman
 
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-PlanesBuild Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-PlanesDouglas Lanman
 
EENG512FinalPresentation_DanielKuntz
EENG512FinalPresentation_DanielKuntzEENG512FinalPresentation_DanielKuntz
EENG512FinalPresentation_DanielKuntzDaniel K
 
Lecture 02 yasutaka furukawa - 3 d reconstruction with priors
Lecture 02   yasutaka furukawa - 3 d reconstruction with priorsLecture 02   yasutaka furukawa - 3 d reconstruction with priors
Lecture 02 yasutaka furukawa - 3 d reconstruction with priorsmustafa sarac
 
Notes on image processing
Notes on image processingNotes on image processing
Notes on image processingMohammed Kamel
 
Build Your Own 3D Scanner: The Mathematics of 3D Triangulation
Build Your Own 3D Scanner: The Mathematics of 3D TriangulationBuild Your Own 3D Scanner: The Mathematics of 3D Triangulation
Build Your Own 3D Scanner: The Mathematics of 3D TriangulationDouglas Lanman
 
Fingerprint High Level Classification
Fingerprint High Level ClassificationFingerprint High Level Classification
Fingerprint High Level ClassificationReza Rahimi
 
Keynote at Tracking Workshop during ISMAR 2014
Keynote at Tracking Workshop during ISMAR 2014Keynote at Tracking Workshop during ISMAR 2014
Keynote at Tracking Workshop during ISMAR 2014Darius Burschka
 
Depth estimation from stereo image pairs using block-matching
Depth estimation from stereo image pairs using block-matchingDepth estimation from stereo image pairs using block-matching
Depth estimation from stereo image pairs using block-matchingAbhranil Das
 
Build Your Own 3D Scanner: Conclusion
Build Your Own 3D Scanner: ConclusionBuild Your Own 3D Scanner: Conclusion
Build Your Own 3D Scanner: ConclusionDouglas Lanman
 
Saad alsheekh multi view
Saad alsheekh  multi viewSaad alsheekh  multi view
Saad alsheekh multi viewSaadAlSheekh1
 
Wii Sensor Bar Positioning in 3D Space
Wii Sensor Bar Positioning in 3D SpaceWii Sensor Bar Positioning in 3D Space
Wii Sensor Bar Positioning in 3D SpaceYoav Francis
 
Build Your Own 3D Scanner: 3D Scanning with Structured Lighting
Build Your Own 3D Scanner: 3D Scanning with Structured LightingBuild Your Own 3D Scanner: 3D Scanning with Structured Lighting
Build Your Own 3D Scanner: 3D Scanning with Structured LightingDouglas Lanman
 

What's hot (20)

Computer Graphics
Computer GraphicsComputer Graphics
Computer Graphics
 
3 d graphics with opengl part 2
3 d graphics with opengl  part 23 d graphics with opengl  part 2
3 d graphics with opengl part 2
 
Build Your Own 3D Scanner: Introduction
Build Your Own 3D Scanner: IntroductionBuild Your Own 3D Scanner: Introduction
Build Your Own 3D Scanner: Introduction
 
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-PlanesBuild Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
 
EENG512FinalPresentation_DanielKuntz
EENG512FinalPresentation_DanielKuntzEENG512FinalPresentation_DanielKuntz
EENG512FinalPresentation_DanielKuntz
 
Lecture1
Lecture1Lecture1
Lecture1
 
Lecture 02 yasutaka furukawa - 3 d reconstruction with priors
Lecture 02   yasutaka furukawa - 3 d reconstruction with priorsLecture 02   yasutaka furukawa - 3 d reconstruction with priors
Lecture 02 yasutaka furukawa - 3 d reconstruction with priors
 
Notes on image processing
Notes on image processingNotes on image processing
Notes on image processing
 
CGV 18CS62 VTU CSE
CGV 18CS62 VTU CSECGV 18CS62 VTU CSE
CGV 18CS62 VTU CSE
 
Build Your Own 3D Scanner: The Mathematics of 3D Triangulation
Build Your Own 3D Scanner: The Mathematics of 3D TriangulationBuild Your Own 3D Scanner: The Mathematics of 3D Triangulation
Build Your Own 3D Scanner: The Mathematics of 3D Triangulation
 
Fingerprint High Level Classification
Fingerprint High Level ClassificationFingerprint High Level Classification
Fingerprint High Level Classification
 
sawano-icma2000
sawano-icma2000sawano-icma2000
sawano-icma2000
 
Clipping
ClippingClipping
Clipping
 
Hit and-miss transform
Hit and-miss transformHit and-miss transform
Hit and-miss transform
 
Keynote at Tracking Workshop during ISMAR 2014
Keynote at Tracking Workshop during ISMAR 2014Keynote at Tracking Workshop during ISMAR 2014
Keynote at Tracking Workshop during ISMAR 2014
 
Depth estimation from stereo image pairs using block-matching
Depth estimation from stereo image pairs using block-matchingDepth estimation from stereo image pairs using block-matching
Depth estimation from stereo image pairs using block-matching
 
Build Your Own 3D Scanner: Conclusion
Build Your Own 3D Scanner: ConclusionBuild Your Own 3D Scanner: Conclusion
Build Your Own 3D Scanner: Conclusion
 
Saad alsheekh multi view
Saad alsheekh  multi viewSaad alsheekh  multi view
Saad alsheekh multi view
 
Wii Sensor Bar Positioning in 3D Space
Wii Sensor Bar Positioning in 3D SpaceWii Sensor Bar Positioning in 3D Space
Wii Sensor Bar Positioning in 3D Space
 
Build Your Own 3D Scanner: 3D Scanning with Structured Lighting
Build Your Own 3D Scanner: 3D Scanning with Structured LightingBuild Your Own 3D Scanner: 3D Scanning with Structured Lighting
Build Your Own 3D Scanner: 3D Scanning with Structured Lighting
 

Viewers also liked

Passive stereo vision with deep learning
Passive stereo vision with deep learningPassive stereo vision with deep learning
Passive stereo vision with deep learningYu Huang
 
Stereoscopic Display of Lung PET/CT DICOM Scans using Perspective
Stereoscopic Display of Lung PET/CT DICOM Scans using PerspectiveStereoscopic Display of Lung PET/CT DICOM Scans using Perspective
Stereoscopic Display of Lung PET/CT DICOM Scans using PerspectiveCassidy Chen
 
Stereogram and Stereo Vision
Stereogram and Stereo VisionStereogram and Stereo Vision
Stereogram and Stereo VisionMasayuki Tanaka
 
bag-of-words models
bag-of-words models bag-of-words models
bag-of-words models Xiaotao Zou
 
Programming with kinect v2
Programming with kinect v2Programming with kinect v2
Programming with kinect v2Matteo Valoriani
 
11 X1 T05 07 Angle Between Two Lines
11 X1 T05 07 Angle Between Two Lines11 X1 T05 07 Angle Between Two Lines
11 X1 T05 07 Angle Between Two LinesNigel Simmons
 
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Universitat Politècnica de Catalunya
 
MEMS Laser Scanning, the platform for next generation of 3D Depth Sensors
MEMS Laser Scanning, the platform for next generation of 3D Depth SensorsMEMS Laser Scanning, the platform for next generation of 3D Depth Sensors
MEMS Laser Scanning, the platform for next generation of 3D Depth SensorsMicroVision
 

Viewers also liked (9)

Passive stereo vision with deep learning
Passive stereo vision with deep learningPassive stereo vision with deep learning
Passive stereo vision with deep learning
 
Stereoscopic Display of Lung PET/CT DICOM Scans using Perspective
Stereoscopic Display of Lung PET/CT DICOM Scans using PerspectiveStereoscopic Display of Lung PET/CT DICOM Scans using Perspective
Stereoscopic Display of Lung PET/CT DICOM Scans using Perspective
 
News
NewsNews
News
 
Stereogram and Stereo Vision
Stereogram and Stereo VisionStereogram and Stereo Vision
Stereogram and Stereo Vision
 
bag-of-words models
bag-of-words models bag-of-words models
bag-of-words models
 
Programming with kinect v2
Programming with kinect v2Programming with kinect v2
Programming with kinect v2
 
11 X1 T05 07 Angle Between Two Lines
11 X1 T05 07 Angle Between Two Lines11 X1 T05 07 Angle Between Two Lines
11 X1 T05 07 Angle Between Two Lines
 
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...
 
MEMS Laser Scanning, the platform for next generation of 3D Depth Sensors
MEMS Laser Scanning, the platform for next generation of 3D Depth SensorsMEMS Laser Scanning, the platform for next generation of 3D Depth Sensors
MEMS Laser Scanning, the platform for next generation of 3D Depth Sensors
 

Similar to Robots that see like humans

Creating next-gen VR and MR experiences using Varjo VR-1 and XR-1 - Unite Cop...
Creating next-gen VR and MR experiences using Varjo VR-1 and XR-1 - Unite Cop...Creating next-gen VR and MR experiences using Varjo VR-1 and XR-1 - Unite Cop...
Creating next-gen VR and MR experiences using Varjo VR-1 and XR-1 - Unite Cop...Unity Technologies
 
OTOY GTC17 Presentation Slides: "The Future of GPU Rendering"
OTOY GTC17 Presentation Slides: "The Future of GPU Rendering"OTOY GTC17 Presentation Slides: "The Future of GPU Rendering"
OTOY GTC17 Presentation Slides: "The Future of GPU Rendering"OTOY Inc.
 
[Paper Presentation] FoveAR: Combining an Optically See-Through Near-Eye Disp...
[Paper Presentation] FoveAR: Combining an Optically See-Through Near-Eye Disp...[Paper Presentation] FoveAR: Combining an Optically See-Through Near-Eye Disp...
[Paper Presentation] FoveAR: Combining an Optically See-Through Near-Eye Disp...Pei-Hsuan (Ike) Tsai
 
Visual Mapping and Collision Avoidance Dynamic Environments in Dynamic Enviro...
Visual Mapping and Collision Avoidance Dynamic Environments in Dynamic Enviro...Visual Mapping and Collision Avoidance Dynamic Environments in Dynamic Enviro...
Visual Mapping and Collision Avoidance Dynamic Environments in Dynamic Enviro...Darius Burschka
 
Recent Trends in Signal and Image Processing - Applications
Recent Trends in Signal and Image Processing - ApplicationsRecent Trends in Signal and Image Processing - Applications
Recent Trends in Signal and Image Processing - ApplicationsAnand Muglikar
 
VR digest. June 2018
VR digest. June 2018VR digest. June 2018
VR digest. June 2018ElifTech
 
Design the implementation of Robotic Simulator: Goalkeeper.
Design the implementation of Robotic Simulator: Goalkeeper.Design the implementation of Robotic Simulator: Goalkeeper.
Design the implementation of Robotic Simulator: Goalkeeper.Ankita Tiwari
 
Virtual_Reality_New.pptx
Virtual_Reality_New.pptxVirtual_Reality_New.pptx
Virtual_Reality_New.pptxBapuPradhan1
 
OTOY Presentation - 2015 NVIDIA GPU Technology Conference - March 17 2015
OTOY Presentation - 2015 NVIDIA GPU Technology Conference - March 17 2015OTOY Presentation - 2015 NVIDIA GPU Technology Conference - March 17 2015
OTOY Presentation - 2015 NVIDIA GPU Technology Conference - March 17 2015otoyinc
 
The mag pi-issue-28-en
The mag pi-issue-28-enThe mag pi-issue-28-en
The mag pi-issue-28-enNguyen Nam
 
THE THIRD EYE-Presentation
THE THIRD EYE-PresentationTHE THIRD EYE-Presentation
THE THIRD EYE-PresentationRomil Shah
 
"Image Sensor Formats and Interfaces for IoT Applications," a Presentation fr...
"Image Sensor Formats and Interfaces for IoT Applications," a Presentation fr..."Image Sensor Formats and Interfaces for IoT Applications," a Presentation fr...
"Image Sensor Formats and Interfaces for IoT Applications," a Presentation fr...Edge AI and Vision Alliance
 
Discovering The Unknown Aspects Of Nuke
Discovering The Unknown Aspects Of NukeDiscovering The Unknown Aspects Of Nuke
Discovering The Unknown Aspects Of NukeAnimation Kolkata
 

Similar to Robots that see like humans (20)

Virtual Techniques: VDC - Trend Report 2018
Virtual Techniques: VDC - Trend Report 2018Virtual Techniques: VDC - Trend Report 2018
Virtual Techniques: VDC - Trend Report 2018
 
Creating next-gen VR and MR experiences using Varjo VR-1 and XR-1 - Unite Cop...
Creating next-gen VR and MR experiences using Varjo VR-1 and XR-1 - Unite Cop...Creating next-gen VR and MR experiences using Varjo VR-1 and XR-1 - Unite Cop...
Creating next-gen VR and MR experiences using Varjo VR-1 and XR-1 - Unite Cop...
 
3D Technology
3D Technology 3D Technology
3D Technology
 
OTOY GTC17 Presentation Slides: "The Future of GPU Rendering"
OTOY GTC17 Presentation Slides: "The Future of GPU Rendering"OTOY GTC17 Presentation Slides: "The Future of GPU Rendering"
OTOY GTC17 Presentation Slides: "The Future of GPU Rendering"
 
[Paper Presentation] FoveAR: Combining an Optically See-Through Near-Eye Disp...
[Paper Presentation] FoveAR: Combining an Optically See-Through Near-Eye Disp...[Paper Presentation] FoveAR: Combining an Optically See-Through Near-Eye Disp...
[Paper Presentation] FoveAR: Combining an Optically See-Through Near-Eye Disp...
 
Visual Mapping and Collision Avoidance Dynamic Environments in Dynamic Enviro...
Visual Mapping and Collision Avoidance Dynamic Environments in Dynamic Enviro...Visual Mapping and Collision Avoidance Dynamic Environments in Dynamic Enviro...
Visual Mapping and Collision Avoidance Dynamic Environments in Dynamic Enviro...
 
3D Workshop
3D Workshop3D Workshop
3D Workshop
 
Recent Trends in Signal and Image Processing - Applications
Recent Trends in Signal and Image Processing - ApplicationsRecent Trends in Signal and Image Processing - Applications
Recent Trends in Signal and Image Processing - Applications
 
VR digest. June 2018
VR digest. June 2018VR digest. June 2018
VR digest. June 2018
 
Hacking for salone: drone races
Hacking for salone: drone racesHacking for salone: drone races
Hacking for salone: drone races
 
Design the implementation of Robotic Simulator: Goalkeeper.
Design the implementation of Robotic Simulator: Goalkeeper.Design the implementation of Robotic Simulator: Goalkeeper.
Design the implementation of Robotic Simulator: Goalkeeper.
 
Virtual_Reality_New.pptx
Virtual_Reality_New.pptxVirtual_Reality_New.pptx
Virtual_Reality_New.pptx
 
OTOY Presentation - 2015 NVIDIA GPU Technology Conference - March 17 2015
OTOY Presentation - 2015 NVIDIA GPU Technology Conference - March 17 2015OTOY Presentation - 2015 NVIDIA GPU Technology Conference - March 17 2015
OTOY Presentation - 2015 NVIDIA GPU Technology Conference - March 17 2015
 
The mag pi-issue-28-en
The mag pi-issue-28-enThe mag pi-issue-28-en
The mag pi-issue-28-en
 
THE THIRD EYE-Presentation
THE THIRD EYE-PresentationTHE THIRD EYE-Presentation
THE THIRD EYE-Presentation
 
"Image Sensor Formats and Interfaces for IoT Applications," a Presentation fr...
"Image Sensor Formats and Interfaces for IoT Applications," a Presentation fr..."Image Sensor Formats and Interfaces for IoT Applications," a Presentation fr...
"Image Sensor Formats and Interfaces for IoT Applications," a Presentation fr...
 
Discovering The Unknown Aspects Of Nuke
Discovering The Unknown Aspects Of NukeDiscovering The Unknown Aspects Of Nuke
Discovering The Unknown Aspects Of Nuke
 
Raskar Computational Camera Fall 2009 Lecture 01
Raskar Computational Camera Fall 2009 Lecture 01Raskar Computational Camera Fall 2009 Lecture 01
Raskar Computational Camera Fall 2009 Lecture 01
 
Datt 2501 week 11
Datt 2501 week 11Datt 2501 week 11
Datt 2501 week 11
 
15793804.ppt
15793804.ppt15793804.ppt
15793804.ppt
 

Recently uploaded

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 

Robots that see like humans

  • 1. Robots that see likeRobots that see like humanshumans Real Time Stereo Vision With ZED Camera on Nvidia Jetson TX1 Walter Lucetti myzhar@robot-home.it
  • 2. Summary 21/05/2016Officine Robotiche 2016 2 WalterLucetti–myzhar@robot-home.it Live Demo Conclusions Introduction Stereo “Binocular” Vision Robotic Stereo Vision “Simplified” Stereo Vision Pin Hole Camera model Stereo Vision camera model Stereo Vision process A Robotic Stereo Vision System Nvidia Jetson TX1 Stereolabs ZED camera Stereolabs ZED SDK Stereolabs ZED engine Simple Code example
  • 4. Introduction 4 WalterLucetti–myzhar@robot-home.it Can a robot replicate the complicated human binocular vision system? 21/05/2016Officine Robotiche 2016
  • 5. Stereo “Binocular” Vision 5 [from Wikipedia: Binocular vision] Vision made with TWO “EYES” It gives a “creature” a spare eye in case one is damaged It gives a wider field of view It can give stereopsis in which binocular disparity (or parallax) provided by the two eyes' different positions on the head gives precise depth perception It allows a creature to see more of, or all of, an object behind an obstacle It gives binocular summation in which the ability to detect faint objects is enhanced WalterLucetti–myzhar@robot-home.it 21/05/2016Officine Robotiche 2016
  • 6. Robotic Stereo Vision 6 How to replicate Human Stereo Vision? Two synchronized camera sensors A solid mechanical structure A powerful elaboration system and if we want to be more precise, but we want to complicate the system: Two motors to replicate vergence capability WalterLucetti–myzhar@robot-home.it 21/05/2016Officine Robotiche 2016
  • 8. “Simplified” Stereo Vision 8 WalterLucetti–myzhar@robot-home.it A complete Stereo Vision system has vergence capability, but is too complicated We will study a Simplified Stereo Vision system where cameras cannot rotate 21/05/2016Officine Robotiche 2016
  • 9. 9 WalterLucetti–myzhar@robot-home.it Pin hole camera model Camera Plane Optical Axis Image coordinates World coordinates World to image Image to world Without “Z” we cannot estimate the real dimension of an object using a single camera! “f” can be estimated calibrating the camera system This is a simplified camera model that does not take care of Optical Distorsion and Sensor Imperfections 21/05/2016Officine Robotiche 2016 u= f ∗x z v= f ∗y z x= u∗z f y= v∗z f
  • 10. 10 WalterLucetti–myzhar@robot-home.it Stereo Vision model Very important parameter: ● Baseline: b From triangles similarity: Very important value: ● Disparity: D = ul - ur From disparity we can calculate how much a point is far from a camera: THE “Z” VALUE!!! 21/05/2016Officine Robotiche 2016 f Z P = ul X P f Z P = −ur b−X P ZP= b∗ f ul−ur
  • 11. Calculate “Disparity” (D) for each point of the image 11 WalterLucetti–myzhar@robot-home.it Stereo Vision process Calculate the depth (Z) of each point of the image Calculate X and Y of each point of the image Algorithm MATCHING REQUIRES A LOT OF COMPUTATIONAL POWER D = ul - ur 21/05/2016Officine Robotiche 2016
  • 12. 12 WalterLucetti–myzhar@robot-home.it Stereo Vision summary Allows to calculate real world coordinate of an object Requires a solid mechanical structure Requires a precise calibration of the optical system Requires a lot of computational power … Is more complicated than what seen until now 21/05/2016Officine Robotiche 2016
  • 13. A Robotic Stereo Vision system13
  • 14. 14 WalterLucetti–myzhar@robot-home.it A Robotic Stereo Vision System Nvidia Jetson TX1 www.nvidia.com/object/jetson-embedded- systems.html Stereolabs ZED www.stereolabs.com 21/05/2016Officine Robotiche 2016
  • 15. 15 WalterLucetti–myzhar@robot-home.it Nvidia Jetson TK1 Main features Nvidia Tegra X1 SOC NVIDIA Maxwell GPU with 256 CUDA cores NVIDIA 4-Plus-1™ 2.3 Ghz quad- core ARM® Cortex™-A15 CPU 4 GB LPDDR4 Memory 16 GB eMMC 5.1 Flash Storage 1 full-size SD/MMC connector 1 full-size HDMI port 1 USB 2.0 port, micro AB 1 USB 3.0 port, A 1 RTL8111GS Realtek GigE LAN GPIO, SPI, I2C, UART, CAN WiFi & Bluetooth connection ...  Native OpenCV support with GPU module for Real Time computer vision  ROS enabled  Ubuntu powered and more...  Android  ... 21/05/2016Officine Robotiche 2016
  • 16. 16 WalterLucetti–myzhar@robot-home.it Stereolabs ZED camera Main features Auto calibrating and auto sync USB 3.0 High Res at high FPS 2208x1242 15 Hz 1920x1080 30 Hz 1280x720 60 Hz 640x480 120 Hz Simple and well documented C++ SDK Automatic Depth Map generation 3D Range: 1 m to 20 m (12 cm baseline) Outdoor and indoor Jetson TX1 driver ROS driver OpenCV support 21/05/2016Officine Robotiche 2016
  • 17. 17 WalterLucetti–myzhar@robot-home.it Stereolabs ZED SDK C++ Classes and APIs for Linux and Windows 2 main classes and 5 support classes Main classes: Camera: the main class to use the ZED camera Mat: the class to store image data in GPU and CPU memory spaces Support classes: CamParameters: calibration params for the two sensors StereoParameters: calibration params for stereo processing resolution: dimensions of the images float3Struct: 3D point uchar3Struct: RGB pixel Current version: 0.9.3-beta for Linux and Windows, 0.9.3-alpha for Jetson TX1 21/05/2016Officine Robotiche 2016
  • 18. 18 WalterLucetti–myzhar@robot-home.it Stereolabs ZED engine Stereo results are available in GPU memory Direct processing by CUDA or download to CPU memory for CPU processing 21/05/2016Officine Robotiche 2016
  • 19. 19 WalterLucetti–myzhar@robot-home.it Stereolabs ZED tools ZED Explorer ZED Depth Viewer 21/05/2016Officine Robotiche 2016
  • 20. 20 WalterLucetti–myzhar@robot-home.it Simple code example //ZED Includes #include <zed/Camera.hpp> int main(int argc, char **argv) { sl::zed::SENSING_MODE dm_type = sl::zed::RAW; sl::zed::Camera* zed; // Camera object creation zed = new sl::zed::Camera(sl::zed::HD2K); // We can use different resolutions: // HD2K HD1080 HD720 VGA // Sizes of the images int width = zed->getImageSize().width; int height = zed->getImageSize().height; // Initialization of the camera sl::zed::ERRCODE err = zed->init( sl::zed::MODE::PERFORMANCE ); if (err != sl::zed::SUCCESS) { delete zed; return 1; } // Stereo processing zed->grab(dm_type); // CPU Data sl::zed::Mat depth_cpu; sl::zed::Mat left_cpu; sl::zed::Mat right_cpu; // Data acquiring on CPU memory depth_cpu = zed->retrieveMeasure(sl::MEASURE::DEPTH); left_cpu = zed->retrieveImage(sl::SIDE::LEFT); right_cpu = zed->retrieveImage(sl::SIDE::RIGHT); // GPU Data sl::zed::Mat depth_gpu; sl::zed::Mat left_gpu; sl::zed::Mat right_gpu; // Data acquiring on GPU memory depth_cpu = zed->retrieveMeasure_gpu(sl::MEASURE::DEPTH); left_cpu = zed->retrieveImage_gpu(sl::SIDE::LEFT); right_cpu = zed->retrieveImage_gpu(sl::SIDE::RIGHT); return 0; } 21/05/2016Officine Robotiche 2016
  • 23. 23 WalterLucetti–myzhar@robot-home.it Conclusions With Stereo Vision A robot can see obstacles A robot can measure obstacles A robot can localize obstacles Stereo Vision requires a lot of mathematical elaborations a lot of computational power a solid mechanical structure so Nvidia Jetson TX1 with Stereolabs ZED is a really amazing solution for Robotic Stereo Vision! 21/05/2016Officine Robotiche 2016
  • 24. The End! OR THANK YOU FOR YOUR ATTENTION! Walter Lucetti http://myzharbot.robot-home.it GitHub @myzhar Youtube Myzhar Twitter @myzhar Facebook robothome 24 21/05/2016Officine Robotiche 2016