SlideShare a Scribd company logo
1
High-resolution 3D Reconstruction
on a Mobile Processor
Michael Mangan
Senior Product Manager
Qualcomm Technologies, Inc.
May 3, 2016
2
30
years of driving
the evolution
of wireless
#1
in 3G/4G
LTE modem
#1
in RF
Source: Qualcomm Incorporated data. Currently, Qualcomm semiconductors are products of Qualcomm Technologies, Inc. or its subsidiaries
IHS, Jan. ’16 (RF); Strategy Analytics, Dec. ’15 (modem, AP)
3
Qualcomm® Snapdragon™ Chipsets
drive new experiences
Context aware
computing
Machine learning
Computing
performance
VR / AR - beyond small screen
360 degree camera
3D and low-light photography
Security
Biometric sensor
Virtual SIM/Multiple devices
Ultra HD VoLTE / audio quality
4G+
Wi-FiSuperior converged connectivity
Qualcomm Snapdragon is a product of Qualcomm Technologies, Inc.
Gaming
4
What is Active Depth Capture?
Depth provides z-dimension to a scene; a photograph provides only x-y information.
Two ways to capture depth information from a scene or object:
Passive Depth Capture:
(No IR Transmitter)
• Stereo RGB cameras can passively
generate a depth map of a scene.
• Baseline separation between the cameras
causes parallax between the two received
images.
• Parallax can be used to infer a disparity
estimate, which in turn is used to
generate a depth map.
Active Depth Capture:
(IR Transmitter)
• IR laser transmits, various
techniques are used to infer depth
from the reflected laser.
» Time of Flight
» Active Stereo
» Structured Light
5
Depth from Structured Light—
Technology Overview
Depth information is generated
using a structured light sensor
• Coded pattern is projected onto the scene
using near infrared (NIR) light
• NIR camera receives the reflected,
distorted pattern
• Codes in the received image are matched
against known codes in the transmitted
pattern
• Depth at each code location estimated
from the disparity between original and
received code positions, leading to
a dense depth map
NIR image
Depth map
coded pattern
transmitter receiver
6
Scanner Flow in Action
7
Scanner Block Diagram
Scan
Starts
Color + Depth
(Structure Light Depth
Based Generation)
Live 3D
Renderer/Viewer
USER MOVES USER STOPS
Scan
Finishes
USE CASE:
3D Printing, Social
Networking, Gaming
Avatars, etc.
Computer Vision Based
Initial Pose Estimation
Inertial Motion
Sensor Fusion
Bundle
Adjustment
HD Texture
Generation
3D Mesh
Generation
Color
Correction
TRACKING/ALIGNMENT
8
Scanner System Architecture
3D Scanner Application
RGBD Image Grabber
Camera 2 APIDepth JNI 3D Scanner JNI
Depth Engine
(DSP/HVX)
RGB
Grabber
NIR
Grabber
3D Scanner Engine
(CPU/GPU)
SysFS Camera HAL Camera HAL
Raw
RGB Data
Raw
NIR Data
Driver
Laser NIR Camera
RGB
Camera
Active Sensing Module
Note: Arrows indicate
dependency, not dataflow
Apps(Java)Middleware(C++)Drivers(C)Hardware
9
3DR Workload Summary—
Running on Snapdragon 820
3D Reconstruction requires running
several computational demanding
processes simultaneously:
1. Camera Pose Tracking
2. Sensor Fusion
3. Bundle Adjustment
4. Rendering
5. Mesh Generation
6. Texture Mapping
7. Structured Light Sensor Decoding
Thanks to the heterogeneous computational
framework of the Snapdragon 820, we are able
to do all of this at 15 FPS:
Cryo—CPU/Neon:
• Pose Tracking
• Bundle Adjustment
• Sensor Fusion
• Mesh Generation
Adreno—GPU:
• Rendering
• Texture Mapping
Hexagon—DSP/HVX:
• Depth from Structured
Light
3DR powered by
Snapdragon 820
Spectra ISP:
• RGB sensor processing
• Depth sensor interface
10
Highest quality 3DR requires
great HW & SW. Efficient CV
SW algorithms, operating with
accurate depth sensors, &
power efficient processors,
bring commercial grade 3DR
to mobile platforms.
Lessons Learned
Running 3DR on mobile
requires tuning algorithms for
power as well as performance.
Power efficient heterogeneous
processors are mandatory for
3DR to run within mobile power
and thermal envelopes.
The heterogeneous
processing cores on
Snapdragon 820, enable a
high-quality, 3DR experience
on mobile platforms.
11
3DR Algorithmic Details
12
Scanner Block Diagram
Scan
Starts
Color + Depth
(Structure Light Depth
Based Generation)
Live 3D
Renderer/Viewer
USER MOVES USER STOPS
Scan
Finishes
USE CASE:
3D Printing, Social
Networking, Gaming
Avatars, etc.
Computer Vision Based
Initial Pose Estimation
Inertial Motion
Sensor Fusion
Bundle
Adjustment
HD Texture
Generation
3D Mesh
Generation
Color
Correction
TRACKING/ALIGNMENT
13
Based on the Iterative Closest Point (ICP) Concept, minimize the sum of pixel
intensity differences (errors) and the sum of depth errors to align Images
𝑐𝑜𝑠𝑡 = 𝑃𝑖𝑥𝑒𝑙 𝐼𝑛𝑡𝑒𝑛𝑠𝑖𝑡𝑦 𝐸𝑟𝑟𝑜𝑟 2
+ 𝜆 𝑃𝑖𝑥𝑒𝑙 𝐷𝑒𝑝𝑡ℎ 𝐸𝑟𝑟𝑜𝑟 2
Pixel Intensity Error Depth Error
• F. Steinbruecker,et al., “Real-Time Visual Odometry from Dense RGB-D Images”, ICCV 2011
• C. Kerl et al., “Dense Continuous-Time Tracking and Mapping with Rolling Shutter RGB-D Cameras”, ICCV 2015
Computer Vision Based
Pose Estimation (6-DOF)
14
Flow
Reference Image
Current Image
Warp
subtract
Repeat to
Minimize Error
– =
Warped Image Error Image
Computer Vision Based
Pose Estimation (6-DOF)
15
Example
Computer Vision Based
Pose Estimation (6-DOF)
16
The Vision Pose will likely contain some errors.
• One example is lack of geometrical and textural structures
This can be overcome by fusing the vision pose with the Inertial Motion Unit (IMU) of the tablet
Using The Extended Kalman Filter (EKF) concept, one can predict poses from the IMU.
These are then fused in the update step of EKF to obtain the fused pose estimate
Motion Sensor Fusion
• M. Li et al., “3-D motion estimation and online temporal calibration for camera-IMU systems”, ICRA 2013
• S. Weiss et al., “Real-Time Metric State Estimation for Modular Vision-Inertial Systems. in IEEE International Conference on Robotics and Automation ”, ICRA 2011
Extended
Kalman Filter
(Predict)
Vision Based
Pose
Estimation
Extended
Kalman Filter
(Update)
Gyro
Accelerometer
17
Fused Poses need to be refined in order
to reduce the visual errors.
• Reason: Poses are being computed locally,
“between consecutive frames”
We use bundle adjustment to find optimal
global or semi-global poses
• Construct links (red lines) between captured frames
(blue nodes). Links are established if the re-projection
between captured images is above a certain threshold
• Jointly optimize the connected nodes
Bundle Adjustment
• V. Indelman et al., “Incremental Light Bundle Adjustment for Robotics Navigation”, IROS 2013
• R. Newcombe et al., “KinectFusion: Real-Time Dense Surface Mapping and Tracking”, IEEE ISMAR 2011
• K. Konolige et al., “FrameSLAM: from Bundle Adjustment to Realtime Visual Mappping”. IEEE Transactions on Robotics 2008
-0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8
-0.2
0
0.2
0.4
0.6
0.8
1
1.2
18
Having computed the 3D points, we need to generate the 3D surface mesh that best describes the
scene while reducing the noise
Many methods are available in the literature for surface reconstruction: Moving Least Squares
(MLS), TSDF & Poisson. Any can be used in theory. TSDF is the least computational demanding,
MLS and Poisson are more demanding
These are then followed by the marching cubes concept to generate the mesh
Surface Reconstruction / Mesh Generation
• S. Fleischmann et al., “Robust Moving Least-squares Fitting with Sharp Features”, ACM SIGGRAPH 2005
• M. Kazdan et al., “Poisson Surface Reconstruction”, Symposium on Geometry Processing 2006
• R. Newcombe et al., “KinectFusion: Real-Time Dense Surface Mapping and Tracking”, IEEE ISMAR 2011
19
Captured color images can suffer from casting due to many reasons like different lighting
sources. We need to correct that so that the overall color of the 3D model is in harmony
Solution: Estimate Color Casts & Remove them
• Gray points provide best estimate about color
• Estimate gray pixels & shift the appropriate channel gain to bring them to neutral gray
• Repeat until convergence
Color Correction
• J. Huo et al., ‘”Robust Automatic White Balance Algorithm Using Gray Color Points in Images”, IEEE Trans. Consumer Electronics, 2006
BEFORE
AFTER
20
The captured images need to be joined in one or more images called Texture Maps
Texture mapping can be thought of as “3D stitching of the images on the 3D model”
Obtaining the Texture Map consists in general of two steps:
• Determine where to put the pixels on a 3D model (texture coordinates)
• Determine what is the color of the pixel given a sequence of input images
Texture Mapping
• P. Debevec et al., “Efficient View-Dependent Image-Based Rendering with Projective Texture-Mapping”, Eurographics Rendering Workshop 1998
• M. Waechter et al., “Let There Be Color! Large-Scale Texturing of 3D Reconstructions”, ECCV 2015
Input Camera Images Output Texture Map Colored 3D Model
Using the Texture Map
21
Some 3DR Examples
22
Using our system we can scan
a small toy, human face/body
or an object
All of this can happen easily
on the Snapdragon 820, thanks
to its powerful heterogeneous
computational framework
Some Results
Thank you
Follow us on:
For more information, visit us at:
www.qualcomm.com & www.qualcomm.com/blog
Nothing in these materials is an offer to sell any of the components or devices referenced herein.
©2016 Qualcomm Technologies, Inc. and/or its affiliated companies. All Rights Reserved.
Qualcomm and Snapdragon are trademarks of Qualcomm Incorporated, registered in the United States and other countries. Why Wait is a trademark of Qualcomm
Incorporated. Other products and brand names may be trademarks or registered trademarks of their respective owners.
References in this presentation to “Qualcomm” may mean Qualcomm Incorporated, Qualcomm Technologies, Inc., and/or other subsidiaries or business units within
the Qualcomm corporate structure, as applicable.

Qualcomm Incorporated includes Qualcomm’s licensing business, QTL, and the vast majority of its patent
portfolio. Qualcomm Technologies, Inc., a wholly-owned subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of Qualcomm’s
engineering, research and development functions, and substantially all of its product and services businesses, including its semiconductor business, QCT.
23

More Related Content

What's hot

“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...
“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...
“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...
Edge AI and Vision Alliance
 
“Alternative Image Sensors for Intelligent In-Cabin Monitoring, Home Security...
“Alternative Image Sensors for Intelligent In-Cabin Monitoring, Home Security...“Alternative Image Sensors for Intelligent In-Cabin Monitoring, Home Security...
“Alternative Image Sensors for Intelligent In-Cabin Monitoring, Home Security...
Edge AI and Vision Alliance
 
3D Perception for Autonomous Driving - Datasets and Algorithms -
3D Perception for Autonomous Driving - Datasets and Algorithms -3D Perception for Autonomous Driving - Datasets and Algorithms -
3D Perception for Autonomous Driving - Datasets and Algorithms -
Kazuyuki Miyazawa
 
Lidar in the adverse weather: dust, fog, snow and rain
Lidar in the adverse weather: dust, fog, snow and rainLidar in the adverse weather: dust, fog, snow and rain
Lidar in the adverse weather: dust, fog, snow and rain
Yu Huang
 
Raskar, Rank and Sparsity in Computational Photography and Computational Ligh...
Raskar, Rank and Sparsity in Computational Photography and Computational Ligh...Raskar, Rank and Sparsity in Computational Photography and Computational Ligh...
Raskar, Rank and Sparsity in Computational Photography and Computational Ligh...
Camera Culture Group, MIT Media Lab
 
Computer Vision with Deep Learning
Computer Vision with Deep LearningComputer Vision with Deep Learning
Computer Vision with Deep Learning
Capgemini
 
Virtual Retinal Display: their falling cost and rising performance
Virtual Retinal Display: their falling cost and rising performanceVirtual Retinal Display: their falling cost and rising performance
Virtual Retinal Display: their falling cost and rising performance
Jeffrey Funk
 
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
Matthew O'Toole
 
Recent Progress on Single-Image Super-Resolution
Recent Progress on Single-Image Super-ResolutionRecent Progress on Single-Image Super-Resolution
Recent Progress on Single-Image Super-Resolution
Hiroto Honda
 
Kinect krishna kumar-itkan
Kinect krishna kumar-itkanKinect krishna kumar-itkan
Kinect krishna kumar-itkan
Pat Maher
 
iMinds insights - 3D Visualization Technologies
iMinds insights - 3D Visualization TechnologiesiMinds insights - 3D Visualization Technologies
iMinds insights - 3D Visualization Technologies
iMindsinsights
 
"The Evolution of Depth Sensing: From Exotic to Ubiquitous," a Presentation f...
"The Evolution of Depth Sensing: From Exotic to Ubiquitous," a Presentation f..."The Evolution of Depth Sensing: From Exotic to Ubiquitous," a Presentation f...
"The Evolution of Depth Sensing: From Exotic to Ubiquitous," a Presentation f...
Edge AI and Vision Alliance
 
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
Edge AI and Vision Alliance
 
Sensors On 3d Digitization
Sensors On 3d DigitizationSensors On 3d Digitization
Sensors On 3d DigitizationRajan Kumar
 
Tobias Rothermel (pmd technologies): pmd ToF – the Swiss Army Knife of 3D Sen...
Tobias Rothermel (pmd technologies): pmd ToF – the Swiss Army Knife of 3D Sen...Tobias Rothermel (pmd technologies): pmd ToF – the Swiss Army Knife of 3D Sen...
Tobias Rothermel (pmd technologies): pmd ToF – the Swiss Army Knife of 3D Sen...
AugmentedWorldExpo
 
A Fast Single-Pixel Laser Imager for VR/AR Headset Tracking
A Fast Single-Pixel Laser Imager for VR/AR Headset TrackingA Fast Single-Pixel Laser Imager for VR/AR Headset Tracking
A Fast Single-Pixel Laser Imager for VR/AR Headset Tracking
Ping Hsu
 
Arindam batabyal literature reviewpresentation
Arindam batabyal literature reviewpresentationArindam batabyal literature reviewpresentation
Arindam batabyal literature reviewpresentationArindam Batabyal
 
“Building an Autonomous Detect-and-Avoid System for Commercial Drones,” a Pre...
“Building an Autonomous Detect-and-Avoid System for Commercial Drones,” a Pre...“Building an Autonomous Detect-and-Avoid System for Commercial Drones,” a Pre...
“Building an Autonomous Detect-and-Avoid System for Commercial Drones,” a Pre...
Edge AI and Vision Alliance
 
Laser Beam Scanning LiDAR: MEMS-Driven 3D Sensing Automotive Applications fro...
Laser Beam Scanning LiDAR: MEMS-Driven 3D Sensing Automotive Applications fro...Laser Beam Scanning LiDAR: MEMS-Driven 3D Sensing Automotive Applications fro...
Laser Beam Scanning LiDAR: MEMS-Driven 3D Sensing Automotive Applications fro...
Jari Honkanen
 

What's hot (20)

“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...
“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...
“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...
 
“Alternative Image Sensors for Intelligent In-Cabin Monitoring, Home Security...
“Alternative Image Sensors for Intelligent In-Cabin Monitoring, Home Security...“Alternative Image Sensors for Intelligent In-Cabin Monitoring, Home Security...
“Alternative Image Sensors for Intelligent In-Cabin Monitoring, Home Security...
 
3D Perception for Autonomous Driving - Datasets and Algorithms -
3D Perception for Autonomous Driving - Datasets and Algorithms -3D Perception for Autonomous Driving - Datasets and Algorithms -
3D Perception for Autonomous Driving - Datasets and Algorithms -
 
Lidar in the adverse weather: dust, fog, snow and rain
Lidar in the adverse weather: dust, fog, snow and rainLidar in the adverse weather: dust, fog, snow and rain
Lidar in the adverse weather: dust, fog, snow and rain
 
Raskar, Rank and Sparsity in Computational Photography and Computational Ligh...
Raskar, Rank and Sparsity in Computational Photography and Computational Ligh...Raskar, Rank and Sparsity in Computational Photography and Computational Ligh...
Raskar, Rank and Sparsity in Computational Photography and Computational Ligh...
 
Computer Vision with Deep Learning
Computer Vision with Deep LearningComputer Vision with Deep Learning
Computer Vision with Deep Learning
 
Virtual Retinal Display: their falling cost and rising performance
Virtual Retinal Display: their falling cost and rising performanceVirtual Retinal Display: their falling cost and rising performance
Virtual Retinal Display: their falling cost and rising performance
 
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
 
Recent Progress on Single-Image Super-Resolution
Recent Progress on Single-Image Super-ResolutionRecent Progress on Single-Image Super-Resolution
Recent Progress on Single-Image Super-Resolution
 
3d scanning techniques
3d scanning techniques3d scanning techniques
3d scanning techniques
 
Kinect krishna kumar-itkan
Kinect krishna kumar-itkanKinect krishna kumar-itkan
Kinect krishna kumar-itkan
 
iMinds insights - 3D Visualization Technologies
iMinds insights - 3D Visualization TechnologiesiMinds insights - 3D Visualization Technologies
iMinds insights - 3D Visualization Technologies
 
"The Evolution of Depth Sensing: From Exotic to Ubiquitous," a Presentation f...
"The Evolution of Depth Sensing: From Exotic to Ubiquitous," a Presentation f..."The Evolution of Depth Sensing: From Exotic to Ubiquitous," a Presentation f...
"The Evolution of Depth Sensing: From Exotic to Ubiquitous," a Presentation f...
 
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
 
Sensors On 3d Digitization
Sensors On 3d DigitizationSensors On 3d Digitization
Sensors On 3d Digitization
 
Tobias Rothermel (pmd technologies): pmd ToF – the Swiss Army Knife of 3D Sen...
Tobias Rothermel (pmd technologies): pmd ToF – the Swiss Army Knife of 3D Sen...Tobias Rothermel (pmd technologies): pmd ToF – the Swiss Army Knife of 3D Sen...
Tobias Rothermel (pmd technologies): pmd ToF – the Swiss Army Knife of 3D Sen...
 
A Fast Single-Pixel Laser Imager for VR/AR Headset Tracking
A Fast Single-Pixel Laser Imager for VR/AR Headset TrackingA Fast Single-Pixel Laser Imager for VR/AR Headset Tracking
A Fast Single-Pixel Laser Imager for VR/AR Headset Tracking
 
Arindam batabyal literature reviewpresentation
Arindam batabyal literature reviewpresentationArindam batabyal literature reviewpresentation
Arindam batabyal literature reviewpresentation
 
“Building an Autonomous Detect-and-Avoid System for Commercial Drones,” a Pre...
“Building an Autonomous Detect-and-Avoid System for Commercial Drones,” a Pre...“Building an Autonomous Detect-and-Avoid System for Commercial Drones,” a Pre...
“Building an Autonomous Detect-and-Avoid System for Commercial Drones,” a Pre...
 
Laser Beam Scanning LiDAR: MEMS-Driven 3D Sensing Automotive Applications fro...
Laser Beam Scanning LiDAR: MEMS-Driven 3D Sensing Automotive Applications fro...Laser Beam Scanning LiDAR: MEMS-Driven 3D Sensing Automotive Applications fro...
Laser Beam Scanning LiDAR: MEMS-Driven 3D Sensing Automotive Applications fro...
 

Viewers also liked

[Project report]digital speedometer with password enabled speed controlling(1...
[Project report]digital speedometer with password enabled speed controlling(1...[Project report]digital speedometer with password enabled speed controlling(1...
[Project report]digital speedometer with password enabled speed controlling(1...
Shivam Patel
 
Wujanz_Error_Projection_2011
Wujanz_Error_Projection_2011Wujanz_Error_Projection_2011
Wujanz_Error_Projection_2011Jacob Collstrup
 
Build Your Own 3D Scanner: Surface Reconstruction
Build Your Own 3D Scanner: Surface ReconstructionBuild Your Own 3D Scanner: Surface Reconstruction
Build Your Own 3D Scanner: Surface Reconstruction
Douglas Lanman
 
Registration 3
Registration 3Registration 3
Registration 3ngaybuonte
 
Dmitrii Tihonkih - The Iterative Closest Points Algorithm and Affine Transfo...
Dmitrii Tihonkih - The Iterative Closest Points Algorithm and  Affine Transfo...Dmitrii Tihonkih - The Iterative Closest Points Algorithm and  Affine Transfo...
Dmitrii Tihonkih - The Iterative Closest Points Algorithm and Affine Transfo...
AIST
 
Softassign and EM-ICP on GPU
Softassign and EM-ICP on GPUSoftassign and EM-ICP on GPU
Softassign and EM-ICP on GPU
Toru Tamaki
 

Viewers also liked (6)

[Project report]digital speedometer with password enabled speed controlling(1...
[Project report]digital speedometer with password enabled speed controlling(1...[Project report]digital speedometer with password enabled speed controlling(1...
[Project report]digital speedometer with password enabled speed controlling(1...
 
Wujanz_Error_Projection_2011
Wujanz_Error_Projection_2011Wujanz_Error_Projection_2011
Wujanz_Error_Projection_2011
 
Build Your Own 3D Scanner: Surface Reconstruction
Build Your Own 3D Scanner: Surface ReconstructionBuild Your Own 3D Scanner: Surface Reconstruction
Build Your Own 3D Scanner: Surface Reconstruction
 
Registration 3
Registration 3Registration 3
Registration 3
 
Dmitrii Tihonkih - The Iterative Closest Points Algorithm and Affine Transfo...
Dmitrii Tihonkih - The Iterative Closest Points Algorithm and  Affine Transfo...Dmitrii Tihonkih - The Iterative Closest Points Algorithm and  Affine Transfo...
Dmitrii Tihonkih - The Iterative Closest Points Algorithm and Affine Transfo...
 
Softassign and EM-ICP on GPU
Softassign and EM-ICP on GPUSoftassign and EM-ICP on GPU
Softassign and EM-ICP on GPU
 

Similar to "High-resolution 3D Reconstruction on a Mobile Processor," a Presentation from Qualcomm

Understanding the world in 3D with AI.pdf
Understanding the world in 3D with AI.pdfUnderstanding the world in 3D with AI.pdf
Understanding the world in 3D with AI.pdf
Qualcomm Research
 
Dataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problemsDataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problems
PetteriTeikariPhD
 
Visual pattern recognition in robotics
Visual pattern recognition in roboticsVisual pattern recognition in robotics
Visual pattern recognition in robotics
IAEME Publication
 
Lecture No. 1 introduction.pptx
Lecture No. 1 introduction.pptxLecture No. 1 introduction.pptx
Lecture No. 1 introduction.pptx
AlifahadHussain
 
Computer Graphics Notes
Computer Graphics NotesComputer Graphics Notes
Computer Graphics Notes
Gurpreet singh
 
FPGA Based Pattern Generation and Synchonization for High Speed Structured Li...
FPGA Based Pattern Generation and Synchonization for High Speed Structured Li...FPGA Based Pattern Generation and Synchonization for High Speed Structured Li...
FPGA Based Pattern Generation and Synchonization for High Speed Structured Li...
TELKOMNIKA JOURNAL
 
Tracking Chessboard Corners Using Projective Transformation for Augmented Rea...
Tracking Chessboard Corners Using Projective Transformation for Augmented Rea...Tracking Chessboard Corners Using Projective Transformation for Augmented Rea...
Tracking Chessboard Corners Using Projective Transformation for Augmented Rea...
CSCJournals
 
Visual pattern recognition in robotics
Visual pattern recognition in roboticsVisual pattern recognition in robotics
Visual pattern recognition in robotics
IAEME Publication
 
A Wireless Network Infrastructure Architecture for Rural Communities
A Wireless Network Infrastructure Architecture for Rural CommunitiesA Wireless Network Infrastructure Architecture for Rural Communities
A Wireless Network Infrastructure Architecture for Rural Communities
AIRCC Publishing Corporation
 
Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrate...
 Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrate... Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrate...
Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrate...
AIRCC Publishing Corporation
 
Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrated...
Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrated...Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrated...
Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrated...
AIRCC Publishing Corporation
 
COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...
COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...
COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...
ijcsit
 
Interactive Full-Body Motion Capture Using Infrared Sensor Network
Interactive Full-Body Motion Capture Using Infrared Sensor Network  Interactive Full-Body Motion Capture Using Infrared Sensor Network
Interactive Full-Body Motion Capture Using Infrared Sensor Network
ijcga
 
Interactive full body motion capture using infrared sensor network
Interactive full body motion capture using infrared sensor networkInteractive full body motion capture using infrared sensor network
Interactive full body motion capture using infrared sensor network
ijcga
 
10.1109@ICCMC48092.2020.ICCMC-000167.pdf
10.1109@ICCMC48092.2020.ICCMC-000167.pdf10.1109@ICCMC48092.2020.ICCMC-000167.pdf
10.1109@ICCMC48092.2020.ICCMC-000167.pdf
mokamojah
 
Emerging vision technologies
Emerging vision technologiesEmerging vision technologies
Emerging vision technologies
Qualcomm Research
 
3D_Exploitation
3D_Exploitation3D_Exploitation
3D_ExploitationPeter Cho
 
IRJET- 3D Object Recognition of Car Image Detection
IRJET-  	  3D Object Recognition of Car Image DetectionIRJET-  	  3D Object Recognition of Car Image Detection
IRJET- 3D Object Recognition of Car Image Detection
IRJET Journal
 

Similar to "High-resolution 3D Reconstruction on a Mobile Processor," a Presentation from Qualcomm (20)

Understanding the world in 3D with AI.pdf
Understanding the world in 3D with AI.pdfUnderstanding the world in 3D with AI.pdf
Understanding the world in 3D with AI.pdf
 
Dataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problemsDataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problems
 
Visual pattern recognition in robotics
Visual pattern recognition in roboticsVisual pattern recognition in robotics
Visual pattern recognition in robotics
 
1604.08848v1
1604.08848v11604.08848v1
1604.08848v1
 
Lecture No. 1 introduction.pptx
Lecture No. 1 introduction.pptxLecture No. 1 introduction.pptx
Lecture No. 1 introduction.pptx
 
Computer Graphics Notes
Computer Graphics NotesComputer Graphics Notes
Computer Graphics Notes
 
FPGA Based Pattern Generation and Synchonization for High Speed Structured Li...
FPGA Based Pattern Generation and Synchonization for High Speed Structured Li...FPGA Based Pattern Generation and Synchonization for High Speed Structured Li...
FPGA Based Pattern Generation and Synchonization for High Speed Structured Li...
 
Tracking Chessboard Corners Using Projective Transformation for Augmented Rea...
Tracking Chessboard Corners Using Projective Transformation for Augmented Rea...Tracking Chessboard Corners Using Projective Transformation for Augmented Rea...
Tracking Chessboard Corners Using Projective Transformation for Augmented Rea...
 
Visual pattern recognition in robotics
Visual pattern recognition in roboticsVisual pattern recognition in robotics
Visual pattern recognition in robotics
 
A Wireless Network Infrastructure Architecture for Rural Communities
A Wireless Network Infrastructure Architecture for Rural CommunitiesA Wireless Network Infrastructure Architecture for Rural Communities
A Wireless Network Infrastructure Architecture for Rural Communities
 
Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrate...
 Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrate... Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrate...
Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrate...
 
Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrated...
Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrated...Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrated...
Complete End-to-End Low Cost Solution to a 3D Scanning System with Integrated...
 
COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...
COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...
COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...
 
Interactive Full-Body Motion Capture Using Infrared Sensor Network
Interactive Full-Body Motion Capture Using Infrared Sensor Network  Interactive Full-Body Motion Capture Using Infrared Sensor Network
Interactive Full-Body Motion Capture Using Infrared Sensor Network
 
Interactive full body motion capture using infrared sensor network
Interactive full body motion capture using infrared sensor networkInteractive full body motion capture using infrared sensor network
Interactive full body motion capture using infrared sensor network
 
10.1109@ICCMC48092.2020.ICCMC-000167.pdf
10.1109@ICCMC48092.2020.ICCMC-000167.pdf10.1109@ICCMC48092.2020.ICCMC-000167.pdf
10.1109@ICCMC48092.2020.ICCMC-000167.pdf
 
Portfolio
PortfolioPortfolio
Portfolio
 
Emerging vision technologies
Emerging vision technologiesEmerging vision technologies
Emerging vision technologies
 
3D_Exploitation
3D_Exploitation3D_Exploitation
3D_Exploitation
 
IRJET- 3D Object Recognition of Car Image Detection
IRJET-  	  3D Object Recognition of Car Image DetectionIRJET-  	  3D Object Recognition of Car Image Detection
IRJET- 3D Object Recognition of Car Image Detection
 

More from Edge AI and Vision Alliance

“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
"OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a...
"OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a..."OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a...
"OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a...
Edge AI and Vision Alliance
 
“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...
“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...
“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...
Edge AI and Vision Alliance
 
“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...
“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...
“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...
Edge AI and Vision Alliance
 
“What’s Next in On-device Generative AI,” a Presentation from Qualcomm
“What’s Next in On-device Generative AI,” a Presentation from Qualcomm“What’s Next in On-device Generative AI,” a Presentation from Qualcomm
“What’s Next in On-device Generative AI,” a Presentation from Qualcomm
Edge AI and Vision Alliance
 
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
Edge AI and Vision Alliance
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
Edge AI and Vision Alliance
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
Edge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
Edge AI and Vision Alliance
 

More from Edge AI and Vision Alliance (20)

“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
"OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a...
"OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a..."OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a...
"OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a...
 
“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...
“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...
“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...
 
“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...
“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...
“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...
 
“What’s Next in On-device Generative AI,” a Presentation from Qualcomm
“What’s Next in On-device Generative AI,” a Presentation from Qualcomm“What’s Next in On-device Generative AI,” a Presentation from Qualcomm
“What’s Next in On-device Generative AI,” a Presentation from Qualcomm
 
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 

Recently uploaded

Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 

Recently uploaded (20)

Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 

"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation from Qualcomm

  • 1. 1 High-resolution 3D Reconstruction on a Mobile Processor Michael Mangan Senior Product Manager Qualcomm Technologies, Inc. May 3, 2016
  • 2. 2 30 years of driving the evolution of wireless #1 in 3G/4G LTE modem #1 in RF Source: Qualcomm Incorporated data. Currently, Qualcomm semiconductors are products of Qualcomm Technologies, Inc. or its subsidiaries IHS, Jan. ’16 (RF); Strategy Analytics, Dec. ’15 (modem, AP)
  • 3. 3 Qualcomm® Snapdragon™ Chipsets drive new experiences Context aware computing Machine learning Computing performance VR / AR - beyond small screen 360 degree camera 3D and low-light photography Security Biometric sensor Virtual SIM/Multiple devices Ultra HD VoLTE / audio quality 4G+ Wi-FiSuperior converged connectivity Qualcomm Snapdragon is a product of Qualcomm Technologies, Inc. Gaming
  • 4. 4 What is Active Depth Capture? Depth provides z-dimension to a scene; a photograph provides only x-y information. Two ways to capture depth information from a scene or object: Passive Depth Capture: (No IR Transmitter) • Stereo RGB cameras can passively generate a depth map of a scene. • Baseline separation between the cameras causes parallax between the two received images. • Parallax can be used to infer a disparity estimate, which in turn is used to generate a depth map. Active Depth Capture: (IR Transmitter) • IR laser transmits, various techniques are used to infer depth from the reflected laser. » Time of Flight » Active Stereo » Structured Light
  • 5. 5 Depth from Structured Light— Technology Overview Depth information is generated using a structured light sensor • Coded pattern is projected onto the scene using near infrared (NIR) light • NIR camera receives the reflected, distorted pattern • Codes in the received image are matched against known codes in the transmitted pattern • Depth at each code location estimated from the disparity between original and received code positions, leading to a dense depth map NIR image Depth map coded pattern transmitter receiver
  • 7. 7 Scanner Block Diagram Scan Starts Color + Depth (Structure Light Depth Based Generation) Live 3D Renderer/Viewer USER MOVES USER STOPS Scan Finishes USE CASE: 3D Printing, Social Networking, Gaming Avatars, etc. Computer Vision Based Initial Pose Estimation Inertial Motion Sensor Fusion Bundle Adjustment HD Texture Generation 3D Mesh Generation Color Correction TRACKING/ALIGNMENT
  • 8. 8 Scanner System Architecture 3D Scanner Application RGBD Image Grabber Camera 2 APIDepth JNI 3D Scanner JNI Depth Engine (DSP/HVX) RGB Grabber NIR Grabber 3D Scanner Engine (CPU/GPU) SysFS Camera HAL Camera HAL Raw RGB Data Raw NIR Data Driver Laser NIR Camera RGB Camera Active Sensing Module Note: Arrows indicate dependency, not dataflow Apps(Java)Middleware(C++)Drivers(C)Hardware
  • 9. 9 3DR Workload Summary— Running on Snapdragon 820 3D Reconstruction requires running several computational demanding processes simultaneously: 1. Camera Pose Tracking 2. Sensor Fusion 3. Bundle Adjustment 4. Rendering 5. Mesh Generation 6. Texture Mapping 7. Structured Light Sensor Decoding Thanks to the heterogeneous computational framework of the Snapdragon 820, we are able to do all of this at 15 FPS: Cryo—CPU/Neon: • Pose Tracking • Bundle Adjustment • Sensor Fusion • Mesh Generation Adreno—GPU: • Rendering • Texture Mapping Hexagon—DSP/HVX: • Depth from Structured Light 3DR powered by Snapdragon 820 Spectra ISP: • RGB sensor processing • Depth sensor interface
  • 10. 10 Highest quality 3DR requires great HW & SW. Efficient CV SW algorithms, operating with accurate depth sensors, & power efficient processors, bring commercial grade 3DR to mobile platforms. Lessons Learned Running 3DR on mobile requires tuning algorithms for power as well as performance. Power efficient heterogeneous processors are mandatory for 3DR to run within mobile power and thermal envelopes. The heterogeneous processing cores on Snapdragon 820, enable a high-quality, 3DR experience on mobile platforms.
  • 12. 12 Scanner Block Diagram Scan Starts Color + Depth (Structure Light Depth Based Generation) Live 3D Renderer/Viewer USER MOVES USER STOPS Scan Finishes USE CASE: 3D Printing, Social Networking, Gaming Avatars, etc. Computer Vision Based Initial Pose Estimation Inertial Motion Sensor Fusion Bundle Adjustment HD Texture Generation 3D Mesh Generation Color Correction TRACKING/ALIGNMENT
  • 13. 13 Based on the Iterative Closest Point (ICP) Concept, minimize the sum of pixel intensity differences (errors) and the sum of depth errors to align Images 𝑐𝑜𝑠𝑡 = 𝑃𝑖𝑥𝑒𝑙 𝐼𝑛𝑡𝑒𝑛𝑠𝑖𝑡𝑦 𝐸𝑟𝑟𝑜𝑟 2 + 𝜆 𝑃𝑖𝑥𝑒𝑙 𝐷𝑒𝑝𝑡ℎ 𝐸𝑟𝑟𝑜𝑟 2 Pixel Intensity Error Depth Error • F. Steinbruecker,et al., “Real-Time Visual Odometry from Dense RGB-D Images”, ICCV 2011 • C. Kerl et al., “Dense Continuous-Time Tracking and Mapping with Rolling Shutter RGB-D Cameras”, ICCV 2015 Computer Vision Based Pose Estimation (6-DOF)
  • 14. 14 Flow Reference Image Current Image Warp subtract Repeat to Minimize Error – = Warped Image Error Image Computer Vision Based Pose Estimation (6-DOF)
  • 16. 16 The Vision Pose will likely contain some errors. • One example is lack of geometrical and textural structures This can be overcome by fusing the vision pose with the Inertial Motion Unit (IMU) of the tablet Using The Extended Kalman Filter (EKF) concept, one can predict poses from the IMU. These are then fused in the update step of EKF to obtain the fused pose estimate Motion Sensor Fusion • M. Li et al., “3-D motion estimation and online temporal calibration for camera-IMU systems”, ICRA 2013 • S. Weiss et al., “Real-Time Metric State Estimation for Modular Vision-Inertial Systems. in IEEE International Conference on Robotics and Automation ”, ICRA 2011 Extended Kalman Filter (Predict) Vision Based Pose Estimation Extended Kalman Filter (Update) Gyro Accelerometer
  • 17. 17 Fused Poses need to be refined in order to reduce the visual errors. • Reason: Poses are being computed locally, “between consecutive frames” We use bundle adjustment to find optimal global or semi-global poses • Construct links (red lines) between captured frames (blue nodes). Links are established if the re-projection between captured images is above a certain threshold • Jointly optimize the connected nodes Bundle Adjustment • V. Indelman et al., “Incremental Light Bundle Adjustment for Robotics Navigation”, IROS 2013 • R. Newcombe et al., “KinectFusion: Real-Time Dense Surface Mapping and Tracking”, IEEE ISMAR 2011 • K. Konolige et al., “FrameSLAM: from Bundle Adjustment to Realtime Visual Mappping”. IEEE Transactions on Robotics 2008 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 -0.2 0 0.2 0.4 0.6 0.8 1 1.2
  • 18. 18 Having computed the 3D points, we need to generate the 3D surface mesh that best describes the scene while reducing the noise Many methods are available in the literature for surface reconstruction: Moving Least Squares (MLS), TSDF & Poisson. Any can be used in theory. TSDF is the least computational demanding, MLS and Poisson are more demanding These are then followed by the marching cubes concept to generate the mesh Surface Reconstruction / Mesh Generation • S. Fleischmann et al., “Robust Moving Least-squares Fitting with Sharp Features”, ACM SIGGRAPH 2005 • M. Kazdan et al., “Poisson Surface Reconstruction”, Symposium on Geometry Processing 2006 • R. Newcombe et al., “KinectFusion: Real-Time Dense Surface Mapping and Tracking”, IEEE ISMAR 2011
  • 19. 19 Captured color images can suffer from casting due to many reasons like different lighting sources. We need to correct that so that the overall color of the 3D model is in harmony Solution: Estimate Color Casts & Remove them • Gray points provide best estimate about color • Estimate gray pixels & shift the appropriate channel gain to bring them to neutral gray • Repeat until convergence Color Correction • J. Huo et al., ‘”Robust Automatic White Balance Algorithm Using Gray Color Points in Images”, IEEE Trans. Consumer Electronics, 2006 BEFORE AFTER
  • 20. 20 The captured images need to be joined in one or more images called Texture Maps Texture mapping can be thought of as “3D stitching of the images on the 3D model” Obtaining the Texture Map consists in general of two steps: • Determine where to put the pixels on a 3D model (texture coordinates) • Determine what is the color of the pixel given a sequence of input images Texture Mapping • P. Debevec et al., “Efficient View-Dependent Image-Based Rendering with Projective Texture-Mapping”, Eurographics Rendering Workshop 1998 • M. Waechter et al., “Let There Be Color! Large-Scale Texturing of 3D Reconstructions”, ECCV 2015 Input Camera Images Output Texture Map Colored 3D Model Using the Texture Map
  • 22. 22 Using our system we can scan a small toy, human face/body or an object All of this can happen easily on the Snapdragon 820, thanks to its powerful heterogeneous computational framework Some Results
  • 23. Thank you Follow us on: For more information, visit us at: www.qualcomm.com & www.qualcomm.com/blog Nothing in these materials is an offer to sell any of the components or devices referenced herein. ©2016 Qualcomm Technologies, Inc. and/or its affiliated companies. All Rights Reserved. Qualcomm and Snapdragon are trademarks of Qualcomm Incorporated, registered in the United States and other countries. Why Wait is a trademark of Qualcomm Incorporated. Other products and brand names may be trademarks or registered trademarks of their respective owners. References in this presentation to “Qualcomm” may mean Qualcomm Incorporated, Qualcomm Technologies, Inc., and/or other subsidiaries or business units within the Qualcomm corporate structure, as applicable.

Qualcomm Incorporated includes Qualcomm’s licensing business, QTL, and the vast majority of its patent portfolio. Qualcomm Technologies, Inc., a wholly-owned subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of Qualcomm’s engineering, research and development functions, and substantially all of its product and services businesses, including its semiconductor business, QCT. 23