SlideShare a Scribd company logo
1 of 18
Download to read offline
© 2019 Qualcomm
Qualcomm® Snapdragon™ Hybrid
Computer Vision/ Deep Learning
Architecture for Imaging Applications
Robert Lay, Product Manager, Staff
Qualcomm Technologies, Inc (QTI)
May 2019
Qualcomm Snapdragon is a product of Qualcomm Technologies, Inc. and or its subsidiaries.
© 2019 Qualcomm 2
Today and tomorrow’s imaging challenges
The challenges of
Camera workloads
Very compute intensive
Complex algorithms
and Data flows
Many concurrencies
Low latency,
high bandwidths
Must be thermally efficient
for sleek, ultra-light designs
Requires long battery
life for all-day use
Storage / Memory Compute/
Bandwidth limitations
© 2019 Qualcomm
Extreme
Speed Boost
Up to 4x*
Power Savings
Multi-object Classification
Multi-Object Tracking
Object Segmentation
Depth Sensing at 60fps
6DoF XR Body Tracking
CV Stabilization
QTI has enabled the world’s first announced CV-ISP
Computer Vision HW Acceleration
A Powerful Heterogeneous Platform
for Advanced Imaging Experiences
*vs running on cDSP, Q6
Qualcomm Adreno, Qualcomm Hexagon, Qualcomm Kryo, Qualcomm Spectra and Qualcomm Snapdragon are products of Qualcomm Technologies Inc., and/or its
subsidiaries.
© 2019 Qualcomm 4
Heterogeneous Approach at QTI to Computer Vision
ISP
Algorithmic
advancements
Neural network algorithm design
optimized for hardware
Optimization for space and
runtime efficiency
Software
tools
PC and target based simulators
Flexible SW frameworks
Sample code
Efficient
hardware
Efficient architecture design
Selecting the right compute
engine for the right task
© 2019 Qualcomm
Qualcomm® SpectraTM CV HW Processor - CVP
CVP Acceleration Applications and Benefits
Preprocessing
& Detection
• Pyramidal scaler
• Downscaler
• Feature descriptor
Tracking/
Motion processing
• Optical flow
• Feature point extraction
• Binary feature descriptor and inline descriptor matching
• Normalized cross correlation (NCC)
Classification • Non-linear classifier
Geometric Correction
Engine
• Low-power high-quality image warping and alignment engine
Depth Processing • Depth from Stereo
Embedded Processor • Scheduling and concurrency handling
© 2019 Qualcomm
CVP
Accelerated
Examples
© 2019 Qualcomm
Example for CVP accelerated real time video
bokeh/segmentation
Blue blocks accelerated using CVP
R+L
Images
RT
Bokeh
DL Post-Proc
Depth from
Stereo
DFS
Rectification
GCE, Scalers
DL Post Proc
Blur Map
DL
Segmentation
RT
Segmented
RT
Recalibration
Descriptors, OF
Scalers
Factory
Calibration
© 2019 Qualcomm
Depth sensing at 60fps
World’s First announced 4K HDR
with portrait mode
Example for CVP accelerated
real time video bokeh
© 2019 Qualcomm
XR Case:
CVP acceleration for 3D scene understanding
Book:
Ireland, ISBN,
<url@amazon>
Table: IKEA #,
dimensions W x H x D
unknown:
3D model
Unknown:
cube W x H x D
Known 3D coordinates
of everything
Room floor plan
and furniture layout
© 2019 Qualcomm
Blue blocks accelerated using CVP
3D reconstruction
Camera pose
Geometry
Semantic
information
…
Viewer perspective
6DOF
Depth
Image
Data
Planes
Per object
extraction
Room
layout
3D
Segmentation
DL
fusion
DL semantic
segmentation Obstacle
warningIR
RGB
Mono
© 2019 Qualcomm 11
XR 6DoF accelerated with CVP
Feature Detector and Tracker
(Major tasks offloaded to CVP)
Extended Kalman Filter
(Done in cDSP and optionally mDSP)
Predicted
2D feature
positions
Measured
2D feature
position
Pose
New
Feature
Inertial
Sensor
calibration
Pose
3D
Map Gravity
Camera
Frames
Camera intrinsic &
extrinsic calibration
Accelerometers
Gyroscopes
© 2019 Qualcomm
XR 6DoF accelerated with CVP
CVP
Left
Camera
Frames
Right
Camera
Frames
DSP
Pyramid Scaler
3 Levels
Feature
Tracking
3 Levels NCC
Keypoint
Detection
Predict
features
location
Camera
Pose
EKF
Update
Key
Frame?
Stereo
Matching
NCC
Coordinates Coordinates
© 2019 Qualcomm
6DoF Body Tracking Object Detection and Tracking
CPU+DSP+GPU CVP
2xPower
Savings
4xPower
Savings
Power savings with CVP*
*Qualcomm R&D characterization on Snapdragon 855 mobile platform
© 2019 Qualcomm
6DoF Body Tracking Object Detection and Tracking
CPU+DSP+GPU CVP
39%Compute
Savings
96%Compute
Savings
Compute savings with CVP *
*Qualcomm R&D characterization on Snapdragon 855 mobile platform
© 2019 Qualcomm 15
Conclusions
Computer Vision
underpins cutting edge
imaging experiences
Classic camera and video
experiences enhanced with:
• Motion compensated temporal filtering
• Multi-frame techniques
• Depth enabled features and algorithms
• Video superresolution
• Video frame rate upconversion
Emerging industries enabled by vision:
QTI is taking a heterogeneous
approach to computer vision enhanced
camera experiences
Meeting KPIs by combining general purpose
compute engines with dedicated accelerators:
IVI/ADASXR IoT
Power Latency Processor
and memory
capacity
© 2019 Qualcomm 16
Resources
Snapdragon Computer Vision SDK
Please contact Robert Lay: rlay@qti.qualcomm.com
FastCV Computer Vision SDK
https://developer.qualcomm.com/software/fastcv-sdk
Snapdragon On-Device AI
https://www.qualcomm.com/invention/artificial-intelligence
https://www.qualcomm.com/snapdragon/artificial-intelligence
Qualcomm® Neural Processing SDK
https://developer.qualcomm.com/software/qualcomm-neural-processing-sdk
Hexagon SDK
https://developer.qualcomm.com/software/hexagon-dsp-sdk
Qualcomm Neural Processing SDK is a product of Qualcomm Technologies, Inc. and/or its subsidiaries.
© 2019 Qualcomm
CVP SDK for Customer Enablement
• Documentation:
• CVP 1.0 (SD855) API Document: Available now
• Computer Vision Processor (CVP) v 1.x Programming Guide
• CVP 2.0 (SD865) API Document: May 2019
• Computer Vision Processor (CVP) v 2.x Programming Guide
• Bit-exact PC simulator (Windows and Linux) for algorithm development:
• CVP 1.0 simulator: Available now
• CVP 2.0 simulator: Q2/2019)
• Reference examples code for each of CVP blocks: CVP1.0 now, CVP2.0 Q2/2019
• Offline training tool for Object Detection: CVP1.0 now, CVP2.0 Q3/2019
• This tool works with annotations same as used for training DL networks
© 2019 Qualcomm
Thank you
Nothing in these materials is an offer to sell any of the components or devices referenced herein.
Qualcomm, Snapdragon, Adreno, Hexagon, Kryo and Qualcomm Spectra are trademarks of Qualcomm
Incorporated, registered in the United States and other countries. Other products and brand names may be a
trademark or registered trademark of their respective owners.
References in this presentation to “Qualcomm” may mean Qualcomm Incorporated, Qualcomm Technologies,
Inc., and/or other subsidiaries or business units within the Qualcomm corporate structure, as applicable.
Qualcomm Incorporated includes Qualcomm’s licensing business, QTL, and the vast majority of its patent
portfolio. Qualcomm Technologies, Inc., a wholly-owned subsidiary of Qualcomm Incorporated, operates, along
with its subsidiaries, substantially all of Qualcomm’s engineering, research and development functions, and
substantially all of its product and services businesses, including its semiconductor business, QCT.

More Related Content

What's hot

Insights on the Performance and Configuration of AVB and TSN in Automotive Ap...
Insights on the Performance and Configuration of AVB and TSN in Automotive Ap...Insights on the Performance and Configuration of AVB and TSN in Automotive Ap...
Insights on the Performance and Configuration of AVB and TSN in Automotive Ap...
RealTime-at-Work (RTaW)
 
PyKinect: Body Iteration Application Development Using Python
PyKinect: Body Iteration Application Development Using PythonPyKinect: Body Iteration Application Development Using Python
PyKinect: Body Iteration Application Development Using Python
pycontw
 

What's hot (20)

7nm "Navi" GPU - A GPU Built For Performance
7nm "Navi" GPU - A GPU Built For Performance 7nm "Navi" GPU - A GPU Built For Performance
7nm "Navi" GPU - A GPU Built For Performance
 
XPDDS17: PVH Dom0: The Road so Far - Roger Pau Monné, Citrix
XPDDS17: PVH Dom0: The Road so Far - Roger Pau Monné, CitrixXPDDS17: PVH Dom0: The Road so Far - Roger Pau Monné, Citrix
XPDDS17: PVH Dom0: The Road so Far - Roger Pau Monné, Citrix
 
CNN Quantization
CNN QuantizationCNN Quantization
CNN Quantization
 
FPGA Hardware Accelerator for Machine Learning
FPGA Hardware Accelerator for Machine Learning FPGA Hardware Accelerator for Machine Learning
FPGA Hardware Accelerator for Machine Learning
 
“Seamless Deployment of Multimedia and Machine Learning Applications at the E...
“Seamless Deployment of Multimedia and Machine Learning Applications at the E...“Seamless Deployment of Multimedia and Machine Learning Applications at the E...
“Seamless Deployment of Multimedia and Machine Learning Applications at the E...
 
GPU
GPUGPU
GPU
 
Introduction to Deep Learning (NVIDIA)
Introduction to Deep Learning (NVIDIA)Introduction to Deep Learning (NVIDIA)
Introduction to Deep Learning (NVIDIA)
 
Insights on the Performance and Configuration of AVB and TSN in Automotive Ap...
Insights on the Performance and Configuration of AVB and TSN in Automotive Ap...Insights on the Performance and Configuration of AVB and TSN in Automotive Ap...
Insights on the Performance and Configuration of AVB and TSN in Automotive Ap...
 
Wix's ML Platform
Wix's ML PlatformWix's ML Platform
Wix's ML Platform
 
Riscv 20160507-patterson
Riscv 20160507-pattersonRiscv 20160507-patterson
Riscv 20160507-patterson
 
IBM Rational Rhapsody and Qt Integration
IBM Rational Rhapsody and Qt IntegrationIBM Rational Rhapsody and Qt Integration
IBM Rational Rhapsody and Qt Integration
 
RISC-V Introduction
RISC-V IntroductionRISC-V Introduction
RISC-V Introduction
 
Hardware Acceleration for Machine Learning
Hardware Acceleration for Machine LearningHardware Acceleration for Machine Learning
Hardware Acceleration for Machine Learning
 
FrameGraph: Extensible Rendering Architecture in Frostbite
FrameGraph: Extensible Rendering Architecture in FrostbiteFrameGraph: Extensible Rendering Architecture in Frostbite
FrameGraph: Extensible Rendering Architecture in Frostbite
 
PR-043: HyperNetworks
PR-043: HyperNetworksPR-043: HyperNetworks
PR-043: HyperNetworks
 
Using FMI (Functional Mock-up Interface) for MBSE at all steps of System Design
Using FMI (Functional Mock-up Interface) for MBSE at all steps of System DesignUsing FMI (Functional Mock-up Interface) for MBSE at all steps of System Design
Using FMI (Functional Mock-up Interface) for MBSE at all steps of System Design
 
Enabling on-device learning at scale
Enabling on-device learning at scaleEnabling on-device learning at scale
Enabling on-device learning at scale
 
PyKinect: Body Iteration Application Development Using Python
PyKinect: Body Iteration Application Development Using PythonPyKinect: Body Iteration Application Development Using Python
PyKinect: Body Iteration Application Development Using Python
 
HPG 2018 - Game Ray Tracing: State-of-the-Art and Open Problems
HPG 2018 - Game Ray Tracing: State-of-the-Art and Open ProblemsHPG 2018 - Game Ray Tracing: State-of-the-Art and Open Problems
HPG 2018 - Game Ray Tracing: State-of-the-Art and Open Problems
 
BKK16-315 Graphics Stack Update
BKK16-315 Graphics Stack UpdateBKK16-315 Graphics Stack Update
BKK16-315 Graphics Stack Update
 

Similar to "Snapdragon Hybrid Computer Vision/Deep Learning Architecture for Imaging Applications," a Presentation from Qualcomm

"Efficient Deployment of Quantized ML Models at the Edge Using Snapdragon SoC...
"Efficient Deployment of Quantized ML Models at the Edge Using Snapdragon SoC..."Efficient Deployment of Quantized ML Models at the Edge Using Snapdragon SoC...
"Efficient Deployment of Quantized ML Models at the Edge Using Snapdragon SoC...
Edge AI and Vision Alliance
 
"Portable Performance via the OpenVX Computer Vision Library: Case Studies," ...
"Portable Performance via the OpenVX Computer Vision Library: Case Studies," ..."Portable Performance via the OpenVX Computer Vision Library: Case Studies," ...
"Portable Performance via the OpenVX Computer Vision Library: Case Studies," ...
Edge AI and Vision Alliance
 

Similar to "Snapdragon Hybrid Computer Vision/Deep Learning Architecture for Imaging Applications," a Presentation from Qualcomm (20)

"Efficient Deployment of Quantized ML Models at the Edge Using Snapdragon SoC...
"Efficient Deployment of Quantized ML Models at the Edge Using Snapdragon SoC..."Efficient Deployment of Quantized ML Models at the Edge Using Snapdragon SoC...
"Efficient Deployment of Quantized ML Models at the Edge Using Snapdragon SoC...
 
SYCL 2020 Specification
SYCL 2020 SpecificationSYCL 2020 Specification
SYCL 2020 Specification
 
AWS re:Invent 2016: Powering the Next Generation of Virtual Reality with Veri...
AWS re:Invent 2016: Powering the Next Generation of Virtual Reality with Veri...AWS re:Invent 2016: Powering the Next Generation of Virtual Reality with Veri...
AWS re:Invent 2016: Powering the Next Generation of Virtual Reality with Veri...
 
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
 
Imaging automotive 2015 addfor v002
Imaging automotive 2015   addfor v002Imaging automotive 2015   addfor v002
Imaging automotive 2015 addfor v002
 
Imaging automotive 2015 addfor v002
Imaging automotive 2015   addfor v002Imaging automotive 2015   addfor v002
Imaging automotive 2015 addfor v002
 
Achieving AI @scale on Mobile Devices
Achieving AI @scale on Mobile DevicesAchieving AI @scale on Mobile Devices
Achieving AI @scale on Mobile Devices
 
GCP Meetup #3 - Approaches to Cloud Native Architectures
GCP Meetup #3 - Approaches to Cloud Native ArchitecturesGCP Meetup #3 - Approaches to Cloud Native Architectures
GCP Meetup #3 - Approaches to Cloud Native Architectures
 
"Portable Performance via the OpenVX Computer Vision Library: Case Studies," ...
"Portable Performance via the OpenVX Computer Vision Library: Case Studies," ..."Portable Performance via the OpenVX Computer Vision Library: Case Studies," ...
"Portable Performance via the OpenVX Computer Vision Library: Case Studies," ...
 
Computer Vision Technology and Expertise
Computer Vision Technology and ExpertiseComputer Vision Technology and Expertise
Computer Vision Technology and Expertise
 
“Machine Learning for the Real World: What is Acceptable Accuracy, and How Ca...
“Machine Learning for the Real World: What is Acceptable Accuracy, and How Ca...“Machine Learning for the Real World: What is Acceptable Accuracy, and How Ca...
“Machine Learning for the Real World: What is Acceptable Accuracy, and How Ca...
 
C++ for Marine Streamer Positioning and Navigation - ACCU 2011
C++ for Marine Streamer Positioning and Navigation - ACCU 2011C++ for Marine Streamer Positioning and Navigation - ACCU 2011
C++ for Marine Streamer Positioning and Navigation - ACCU 2011
 
DCC Labs Company Presentation
DCC Labs Company PresentationDCC Labs Company Presentation
DCC Labs Company Presentation
 
Resume_suresh_final
Resume_suresh_finalResume_suresh_final
Resume_suresh_final
 
Accelerating Edge Computing Adoption
Accelerating Edge Computing Adoption Accelerating Edge Computing Adoption
Accelerating Edge Computing Adoption
 
Snapdragon 845 Benchmarking UX Workshop
Snapdragon 845 Benchmarking UX WorkshopSnapdragon 845 Benchmarking UX Workshop
Snapdragon 845 Benchmarking UX Workshop
 
"Highly Efficient, Scalable Vision and AI Processors IP for the Edge," a Pres...
"Highly Efficient, Scalable Vision and AI Processors IP for the Edge," a Pres..."Highly Efficient, Scalable Vision and AI Processors IP for the Edge," a Pres...
"Highly Efficient, Scalable Vision and AI Processors IP for the Edge," a Pres...
 
xmobots at the Scade User Group Conference 2013
xmobots at the Scade User Group Conference 2013xmobots at the Scade User Group Conference 2013
xmobots at the Scade User Group Conference 2013
 
Where should I run my code? Serverless, Containers, Virtual Machines and more
Where should I run my code? Serverless, Containers, Virtual Machines and moreWhere should I run my code? Serverless, Containers, Virtual Machines and more
Where should I run my code? Serverless, Containers, Virtual Machines and more
 
"Computer Vision and Machine Learning at the Edge," a Presentation from Qualc...
"Computer Vision and Machine Learning at the Edge," a Presentation from Qualc..."Computer Vision and Machine Learning at the Edge," a Presentation from Qualc...
"Computer Vision and Machine Learning at the Edge," a Presentation from Qualc...
 

More from Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
Edge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
Edge AI and Vision Alliance
 

More from Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Recently uploaded

Recently uploaded (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 

"Snapdragon Hybrid Computer Vision/Deep Learning Architecture for Imaging Applications," a Presentation from Qualcomm

  • 1. © 2019 Qualcomm Qualcomm® Snapdragon™ Hybrid Computer Vision/ Deep Learning Architecture for Imaging Applications Robert Lay, Product Manager, Staff Qualcomm Technologies, Inc (QTI) May 2019 Qualcomm Snapdragon is a product of Qualcomm Technologies, Inc. and or its subsidiaries.
  • 2. © 2019 Qualcomm 2 Today and tomorrow’s imaging challenges The challenges of Camera workloads Very compute intensive Complex algorithms and Data flows Many concurrencies Low latency, high bandwidths Must be thermally efficient for sleek, ultra-light designs Requires long battery life for all-day use Storage / Memory Compute/ Bandwidth limitations
  • 3. © 2019 Qualcomm Extreme Speed Boost Up to 4x* Power Savings Multi-object Classification Multi-Object Tracking Object Segmentation Depth Sensing at 60fps 6DoF XR Body Tracking CV Stabilization QTI has enabled the world’s first announced CV-ISP Computer Vision HW Acceleration A Powerful Heterogeneous Platform for Advanced Imaging Experiences *vs running on cDSP, Q6 Qualcomm Adreno, Qualcomm Hexagon, Qualcomm Kryo, Qualcomm Spectra and Qualcomm Snapdragon are products of Qualcomm Technologies Inc., and/or its subsidiaries.
  • 4. © 2019 Qualcomm 4 Heterogeneous Approach at QTI to Computer Vision ISP Algorithmic advancements Neural network algorithm design optimized for hardware Optimization for space and runtime efficiency Software tools PC and target based simulators Flexible SW frameworks Sample code Efficient hardware Efficient architecture design Selecting the right compute engine for the right task
  • 5. © 2019 Qualcomm Qualcomm® SpectraTM CV HW Processor - CVP CVP Acceleration Applications and Benefits Preprocessing & Detection • Pyramidal scaler • Downscaler • Feature descriptor Tracking/ Motion processing • Optical flow • Feature point extraction • Binary feature descriptor and inline descriptor matching • Normalized cross correlation (NCC) Classification • Non-linear classifier Geometric Correction Engine • Low-power high-quality image warping and alignment engine Depth Processing • Depth from Stereo Embedded Processor • Scheduling and concurrency handling
  • 7. © 2019 Qualcomm Example for CVP accelerated real time video bokeh/segmentation Blue blocks accelerated using CVP R+L Images RT Bokeh DL Post-Proc Depth from Stereo DFS Rectification GCE, Scalers DL Post Proc Blur Map DL Segmentation RT Segmented RT Recalibration Descriptors, OF Scalers Factory Calibration
  • 8. © 2019 Qualcomm Depth sensing at 60fps World’s First announced 4K HDR with portrait mode Example for CVP accelerated real time video bokeh
  • 9. © 2019 Qualcomm XR Case: CVP acceleration for 3D scene understanding Book: Ireland, ISBN, <url@amazon> Table: IKEA #, dimensions W x H x D unknown: 3D model Unknown: cube W x H x D Known 3D coordinates of everything Room floor plan and furniture layout
  • 10. © 2019 Qualcomm Blue blocks accelerated using CVP 3D reconstruction Camera pose Geometry Semantic information … Viewer perspective 6DOF Depth Image Data Planes Per object extraction Room layout 3D Segmentation DL fusion DL semantic segmentation Obstacle warningIR RGB Mono
  • 11. © 2019 Qualcomm 11 XR 6DoF accelerated with CVP Feature Detector and Tracker (Major tasks offloaded to CVP) Extended Kalman Filter (Done in cDSP and optionally mDSP) Predicted 2D feature positions Measured 2D feature position Pose New Feature Inertial Sensor calibration Pose 3D Map Gravity Camera Frames Camera intrinsic & extrinsic calibration Accelerometers Gyroscopes
  • 12. © 2019 Qualcomm XR 6DoF accelerated with CVP CVP Left Camera Frames Right Camera Frames DSP Pyramid Scaler 3 Levels Feature Tracking 3 Levels NCC Keypoint Detection Predict features location Camera Pose EKF Update Key Frame? Stereo Matching NCC Coordinates Coordinates
  • 13. © 2019 Qualcomm 6DoF Body Tracking Object Detection and Tracking CPU+DSP+GPU CVP 2xPower Savings 4xPower Savings Power savings with CVP* *Qualcomm R&D characterization on Snapdragon 855 mobile platform
  • 14. © 2019 Qualcomm 6DoF Body Tracking Object Detection and Tracking CPU+DSP+GPU CVP 39%Compute Savings 96%Compute Savings Compute savings with CVP * *Qualcomm R&D characterization on Snapdragon 855 mobile platform
  • 15. © 2019 Qualcomm 15 Conclusions Computer Vision underpins cutting edge imaging experiences Classic camera and video experiences enhanced with: • Motion compensated temporal filtering • Multi-frame techniques • Depth enabled features and algorithms • Video superresolution • Video frame rate upconversion Emerging industries enabled by vision: QTI is taking a heterogeneous approach to computer vision enhanced camera experiences Meeting KPIs by combining general purpose compute engines with dedicated accelerators: IVI/ADASXR IoT Power Latency Processor and memory capacity
  • 16. © 2019 Qualcomm 16 Resources Snapdragon Computer Vision SDK Please contact Robert Lay: rlay@qti.qualcomm.com FastCV Computer Vision SDK https://developer.qualcomm.com/software/fastcv-sdk Snapdragon On-Device AI https://www.qualcomm.com/invention/artificial-intelligence https://www.qualcomm.com/snapdragon/artificial-intelligence Qualcomm® Neural Processing SDK https://developer.qualcomm.com/software/qualcomm-neural-processing-sdk Hexagon SDK https://developer.qualcomm.com/software/hexagon-dsp-sdk Qualcomm Neural Processing SDK is a product of Qualcomm Technologies, Inc. and/or its subsidiaries.
  • 17. © 2019 Qualcomm CVP SDK for Customer Enablement • Documentation: • CVP 1.0 (SD855) API Document: Available now • Computer Vision Processor (CVP) v 1.x Programming Guide • CVP 2.0 (SD865) API Document: May 2019 • Computer Vision Processor (CVP) v 2.x Programming Guide • Bit-exact PC simulator (Windows and Linux) for algorithm development: • CVP 1.0 simulator: Available now • CVP 2.0 simulator: Q2/2019) • Reference examples code for each of CVP blocks: CVP1.0 now, CVP2.0 Q2/2019 • Offline training tool for Object Detection: CVP1.0 now, CVP2.0 Q3/2019 • This tool works with annotations same as used for training DL networks
  • 18. © 2019 Qualcomm Thank you Nothing in these materials is an offer to sell any of the components or devices referenced herein. Qualcomm, Snapdragon, Adreno, Hexagon, Kryo and Qualcomm Spectra are trademarks of Qualcomm Incorporated, registered in the United States and other countries. Other products and brand names may be a trademark or registered trademark of their respective owners. References in this presentation to “Qualcomm” may mean Qualcomm Incorporated, Qualcomm Technologies, Inc., and/or other subsidiaries or business units within the Qualcomm corporate structure, as applicable. Qualcomm Incorporated includes Qualcomm’s licensing business, QTL, and the vast majority of its patent portfolio. Qualcomm Technologies, Inc., a wholly-owned subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of Qualcomm’s engineering, research and development functions, and substantially all of its product and services businesses, including its semiconductor business, QCT.