SlideShare a Scribd company logo
Copyright © 2017 CEVA Inc. 1
Fast Inference in Low Power Systems
via CEVA’s Deep Neural Network Solution
Yair Siegel, Director of Strategic Marketing
May 2017
Copyright © 2017 CEVA Inc. 2
The Need for Embedded Inferencing & AI
Security & Surveillance
Automotive
Visual Perception & Analytics
AR and VR Drones
Smartphones & Tablets
Copyright © 2017 CEVA Inc. 3
• Semantic Segmentation
• Automotive free space detection
• Bokeh camera effect (single cam)
• Object Detection & Recognition
• Video search & tagging
• Object overlay for AR
• Human-Machine Interaction
• Driver drowsiness & driver distraction
• Foveated rendering / Gaze Detection
• Authentication (Face, Iris, Palm)
Example Use-Cases for Inferencing
Copyright © 2017 CEVA Inc. 4
• Neural Networks vs. “Traditional Vision”
• Robustness, accuracy, proven, combination
• Cloud vs. edge processing
• Latency, computation power, data privacy
• Growth in network size
• Alexnet – 0.6 GMACs per frame
• YOLO Tiny – 2.3 GMACs per frame
• Battery life and device cost
• Opposite requirement to trend
• Need for efficient engines and networks
Neural Networks Inferencing Considerations
Copyright © 2017 CEVA Inc. 5
• Performance & Energy
State of the art CNN extremely demanding in computation and memory
bandwidth, battery envelope steady and limited
• Flexibility
Deep Neural Networks research keeps evolving
• New layer types, topologies frequently added
Inferencing Solution Motivation
Neural Networks Evolution
• R-CNN, LE, Alexnet
• Object Identification
• Linear Networks
• Faster R-CNN
• Semantic segmentation
• Separate networks for
detection and
classification
• Yolo, SSD, Mask
R-CNN
• Single stage networks
for Identification and
Classification
Copyright © 2017 CEVA Inc. 6
CEVA Deep Neural Network (CDNN) Flow
OEM / Partner (offline) CEVA (offline + real-time)
Copyright © 2017 CEVA Inc. 7
CDNN Toolkit
CNN
CEVA-CV
CEVA-VX
Comprehensive toolkit simplifying dev. & deployment of deep learning
systems for mass-market inferencing
Copyright © 2017 CEVA Inc. 8
• Comprehensive inferencing solution
• Input: Trained network
• Output: Optimal inference network on vision DSP + HW accelerator (optional)
CDNN Toolkit – Solution Overview
CDNN2 Runtime library
CNN HWA Driver
CDNN2 Application API
CEVA-XMHost CPU CNN HWA
CEVA
CNN Hardware
Accelerator
CDNN2
ApplicationAPI
Application
CDNN2 Network
Generator
Offline - PC
Copyright © 2017 CEVA Inc. 9
• CEVA Network Generator
• Offline auto converts floating to fixed-point nets
• Adapts for embedded constraints
• Application API
• High level interface for easy integration on host application
• Responsible for connectivity between host and DSP
• Runtime Library
• Executes network inference
• Optimized for CEVA-XM vision DSP
• On-the-fly bandwidth optimizations
• HWA Driver
• Controls the CNN HWA
CDNN Solution Overview (cont.)
CDNN2 Runtime library
CNN HWA Driver
CDNN2 Application API
CEVA-XMHost CPU CNN HWA
CEVA
CNN Hardware
Accelerator
CDNN2
ApplicationAPI
Application
CDNN2 Network
Generator
Offline - PC
Copyright © 2017 CEVA Inc. 10
CNN Development Scene
• Flexible and future proof, while
providing top performance
• Combines programmable DSP
with hardware acceleration
• Hardware accelerator focuses on
demanding layers, boosting
inference performance up to 12x*
• DNN rapidly evolving,
yet convolution layers
consistently handling
main function
• Convolution layers are
most demanding
portion of inference
CEVA Deep Neural Network Concept
CDNN Solution Concept
* vs. CEVA-XM4
Copyright © 2017 CEVA Inc. 11
Flexible Embedded CNN Solution
Flexible embedded solution and 16-bit support required
to cope with evolving lead neural networks
CEVA-XM
Vision DSP
CNN
Hardware Accelerator
• Controls full network execution
• Invokes CNN HWA
• Executes all other layers:
Normalization,
Pooling,
Deconvolution,
Etc.
• Supports multiple CNN HWAs
CDNN2 Real-Time SW Library
• Up to 512 MAC units
• 16 b x 16 b support
• Executes convolutions
• Internal memories
• Internal DMA units
• Autonomous execution
CNN HWA V1
Copyright © 2017 CEVA Inc. 12
Caffe (32-bit PC) vs. CDNN2 (16-bit Fixed)
https://youtu.be/VnbCVFyuWYk
Legend
Running Deepglint
proprietary
network
Original version
CDNN version
Copyright © 2017 CEVA Inc. 13
Example: Faster R-CNN Challenge
Automatic network analysis & optimization via CDNN without user involvement
Before
High Bandwidth
After
Low Bandwidth
Keeping Bit Accuracy
Copyright © 2017 CEVA Inc. 14
• Network generator converts network to
fixed point
• Reduces size and bandwidth by
combining layers and other methods
• Auto partitions Yolo layers, balanced
across multiple HWAs and DSP
• Convolutions handled by HWAs
• Utilizes local memories, avoids tiling,
lowers latency
• Yolo Tiny performance
• Standalone DSP – 10s frames / sec
• Hardware acc. – 100s frames / sec
Example: YOLO Network on CDNN
Solution utilizes DSP coupled with HWA, controlled via CDNN tools
Copyright © 2017 CEVA Inc. 15
• 2nd gen SW framework support
• Caffe and TensorFlow Frameworks
• Various networks*
• All network topologies
• All the leading layers
• Variable ROIs
• “Push-button” conversion from pre-trained
networks to optimized real-time
• Accelerates AI deployment for embedded
• Optimized for CEVA-XM DSPs and CNN
HW accelerator
CEVA Deep Neural Network (CDNN2)
(*) Including AlexNet, GoogLeNet, ResNet, SegNet, VGG, NIN and others
CDNN2 Wins
China Electronics Market
(CEM) Editor's Choice Award
Copyright © 2017 CEVA Inc. 16
• An Analysis of Deep Neural Network Models for Practical Applications
• Mask R-CNN + example of Inferencing for Mobile AI (Facebook F8)
• CDNN on CEVA web page
Come view CDNN live
Running Alexnet, Googlenet, Yolo and more,
@CEVA booth
Additional Resources
Live CDNN2 demo
Copyright © 2017 CEVA Inc. 17
Thank You!
Contact: Yairs@ceva-dsp.com
Copyright © 2017 CEVA Inc. 18
Backup Materials
Copyright © 2017 CEVA Inc. 19
Corporate Facts Worldwide Operations
World Leading IP Supplier
Corporate Introduction
• Headquartered in Mountain View, Calif.
• Publicly traded
• NASDAQ:CEVA
• Profitable, strong business
and cash positive – over $150M cash
• Licensing IP since 1991
• Low Power Signal Processing IP for a
range of high volume applications:
Communications, Wi-Fi, Bluetooth,
Audio, Voice, Imaging & Vision,
Storage
• U.S. (Mountain View, CA, Austin, TX &
Detroit, MI), Israel, Ireland, France, U.K,
Sweden, China, Taiwan, Korea & Japan
Copyright © 2017 CEVA Inc. 20
CEVA By Numbers
More than 300 licensees to date
>8 Billion
CEVA-powered devices
shipped worldwide to date
100
licensees of Wi-Fi & Bluetooth
IP – and more than 1 billion
chips shipped
3X
the market share in DSP over
any other DSP IP vendor
1 in 3
handsets worldwide are
powered by CEVA DSP
>5 billion
DSP cores in audio/voice
devices shipped to date
>30
licensees for imaging and
vision – shipping broadly in
2017
Copyright © 2017 CEVA Inc. 21
CEVA IP Platforms
Wireless
(Short Range)
Wireless
(Long Range)
Sensing Intelligence
CEVA-XC
LTE / 5G SDR
CEVA-X
LTE / 5G
PHY Control
LTE-IoT
RivieraWaves Wi-Fi
802.11n/ac AP and
Client
RivieraWaves Bluetooth
Dual mode, BLE and
Audio over BLE
CEVA-X
Sense & Connect
Hub
CEVA-TeakLite-4
Voice Activation
CEVA-XM
Machine Vision and
Deep Learning
Copyright © 2017 CEVA Inc. 22
• Comprehensive vision IP platform
• Centered on CEVA-XM6 Vision DSP
• Enables mass market intelligent vision
applications
• Simplifies deep learning inferencing
for low-power embedded devices
CEVA’s 5th Generation Imaging
& Vision Technology

More Related Content

More from Edge AI and Vision Alliance

“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
Edge AI and Vision Alliance
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
Edge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
Edge AI and Vision Alliance
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
Edge AI and Vision Alliance
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
Edge AI and Vision Alliance
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
Edge AI and Vision Alliance
 
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
Edge AI and Vision Alliance
 

More from Edge AI and Vision Alliance (20)

“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
 

Recently uploaded

Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
Rohit Gautam
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 

Recently uploaded (20)

Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 

"Fast Inference in Low Power Systems via CEVA's Deep Neural Network Solution," a Presentation from CEVA

  • 1. Copyright © 2017 CEVA Inc. 1 Fast Inference in Low Power Systems via CEVA’s Deep Neural Network Solution Yair Siegel, Director of Strategic Marketing May 2017
  • 2. Copyright © 2017 CEVA Inc. 2 The Need for Embedded Inferencing & AI Security & Surveillance Automotive Visual Perception & Analytics AR and VR Drones Smartphones & Tablets
  • 3. Copyright © 2017 CEVA Inc. 3 • Semantic Segmentation • Automotive free space detection • Bokeh camera effect (single cam) • Object Detection & Recognition • Video search & tagging • Object overlay for AR • Human-Machine Interaction • Driver drowsiness & driver distraction • Foveated rendering / Gaze Detection • Authentication (Face, Iris, Palm) Example Use-Cases for Inferencing
  • 4. Copyright © 2017 CEVA Inc. 4 • Neural Networks vs. “Traditional Vision” • Robustness, accuracy, proven, combination • Cloud vs. edge processing • Latency, computation power, data privacy • Growth in network size • Alexnet – 0.6 GMACs per frame • YOLO Tiny – 2.3 GMACs per frame • Battery life and device cost • Opposite requirement to trend • Need for efficient engines and networks Neural Networks Inferencing Considerations
  • 5. Copyright © 2017 CEVA Inc. 5 • Performance & Energy State of the art CNN extremely demanding in computation and memory bandwidth, battery envelope steady and limited • Flexibility Deep Neural Networks research keeps evolving • New layer types, topologies frequently added Inferencing Solution Motivation Neural Networks Evolution • R-CNN, LE, Alexnet • Object Identification • Linear Networks • Faster R-CNN • Semantic segmentation • Separate networks for detection and classification • Yolo, SSD, Mask R-CNN • Single stage networks for Identification and Classification
  • 6. Copyright © 2017 CEVA Inc. 6 CEVA Deep Neural Network (CDNN) Flow OEM / Partner (offline) CEVA (offline + real-time)
  • 7. Copyright © 2017 CEVA Inc. 7 CDNN Toolkit CNN CEVA-CV CEVA-VX Comprehensive toolkit simplifying dev. & deployment of deep learning systems for mass-market inferencing
  • 8. Copyright © 2017 CEVA Inc. 8 • Comprehensive inferencing solution • Input: Trained network • Output: Optimal inference network on vision DSP + HW accelerator (optional) CDNN Toolkit – Solution Overview CDNN2 Runtime library CNN HWA Driver CDNN2 Application API CEVA-XMHost CPU CNN HWA CEVA CNN Hardware Accelerator CDNN2 ApplicationAPI Application CDNN2 Network Generator Offline - PC
  • 9. Copyright © 2017 CEVA Inc. 9 • CEVA Network Generator • Offline auto converts floating to fixed-point nets • Adapts for embedded constraints • Application API • High level interface for easy integration on host application • Responsible for connectivity between host and DSP • Runtime Library • Executes network inference • Optimized for CEVA-XM vision DSP • On-the-fly bandwidth optimizations • HWA Driver • Controls the CNN HWA CDNN Solution Overview (cont.) CDNN2 Runtime library CNN HWA Driver CDNN2 Application API CEVA-XMHost CPU CNN HWA CEVA CNN Hardware Accelerator CDNN2 ApplicationAPI Application CDNN2 Network Generator Offline - PC
  • 10. Copyright © 2017 CEVA Inc. 10 CNN Development Scene • Flexible and future proof, while providing top performance • Combines programmable DSP with hardware acceleration • Hardware accelerator focuses on demanding layers, boosting inference performance up to 12x* • DNN rapidly evolving, yet convolution layers consistently handling main function • Convolution layers are most demanding portion of inference CEVA Deep Neural Network Concept CDNN Solution Concept * vs. CEVA-XM4
  • 11. Copyright © 2017 CEVA Inc. 11 Flexible Embedded CNN Solution Flexible embedded solution and 16-bit support required to cope with evolving lead neural networks CEVA-XM Vision DSP CNN Hardware Accelerator • Controls full network execution • Invokes CNN HWA • Executes all other layers: Normalization, Pooling, Deconvolution, Etc. • Supports multiple CNN HWAs CDNN2 Real-Time SW Library • Up to 512 MAC units • 16 b x 16 b support • Executes convolutions • Internal memories • Internal DMA units • Autonomous execution CNN HWA V1
  • 12. Copyright © 2017 CEVA Inc. 12 Caffe (32-bit PC) vs. CDNN2 (16-bit Fixed) https://youtu.be/VnbCVFyuWYk Legend Running Deepglint proprietary network Original version CDNN version
  • 13. Copyright © 2017 CEVA Inc. 13 Example: Faster R-CNN Challenge Automatic network analysis & optimization via CDNN without user involvement Before High Bandwidth After Low Bandwidth Keeping Bit Accuracy
  • 14. Copyright © 2017 CEVA Inc. 14 • Network generator converts network to fixed point • Reduces size and bandwidth by combining layers and other methods • Auto partitions Yolo layers, balanced across multiple HWAs and DSP • Convolutions handled by HWAs • Utilizes local memories, avoids tiling, lowers latency • Yolo Tiny performance • Standalone DSP – 10s frames / sec • Hardware acc. – 100s frames / sec Example: YOLO Network on CDNN Solution utilizes DSP coupled with HWA, controlled via CDNN tools
  • 15. Copyright © 2017 CEVA Inc. 15 • 2nd gen SW framework support • Caffe and TensorFlow Frameworks • Various networks* • All network topologies • All the leading layers • Variable ROIs • “Push-button” conversion from pre-trained networks to optimized real-time • Accelerates AI deployment for embedded • Optimized for CEVA-XM DSPs and CNN HW accelerator CEVA Deep Neural Network (CDNN2) (*) Including AlexNet, GoogLeNet, ResNet, SegNet, VGG, NIN and others CDNN2 Wins China Electronics Market (CEM) Editor's Choice Award
  • 16. Copyright © 2017 CEVA Inc. 16 • An Analysis of Deep Neural Network Models for Practical Applications • Mask R-CNN + example of Inferencing for Mobile AI (Facebook F8) • CDNN on CEVA web page Come view CDNN live Running Alexnet, Googlenet, Yolo and more, @CEVA booth Additional Resources Live CDNN2 demo
  • 17. Copyright © 2017 CEVA Inc. 17 Thank You! Contact: Yairs@ceva-dsp.com
  • 18. Copyright © 2017 CEVA Inc. 18 Backup Materials
  • 19. Copyright © 2017 CEVA Inc. 19 Corporate Facts Worldwide Operations World Leading IP Supplier Corporate Introduction • Headquartered in Mountain View, Calif. • Publicly traded • NASDAQ:CEVA • Profitable, strong business and cash positive – over $150M cash • Licensing IP since 1991 • Low Power Signal Processing IP for a range of high volume applications: Communications, Wi-Fi, Bluetooth, Audio, Voice, Imaging & Vision, Storage • U.S. (Mountain View, CA, Austin, TX & Detroit, MI), Israel, Ireland, France, U.K, Sweden, China, Taiwan, Korea & Japan
  • 20. Copyright © 2017 CEVA Inc. 20 CEVA By Numbers More than 300 licensees to date >8 Billion CEVA-powered devices shipped worldwide to date 100 licensees of Wi-Fi & Bluetooth IP – and more than 1 billion chips shipped 3X the market share in DSP over any other DSP IP vendor 1 in 3 handsets worldwide are powered by CEVA DSP >5 billion DSP cores in audio/voice devices shipped to date >30 licensees for imaging and vision – shipping broadly in 2017
  • 21. Copyright © 2017 CEVA Inc. 21 CEVA IP Platforms Wireless (Short Range) Wireless (Long Range) Sensing Intelligence CEVA-XC LTE / 5G SDR CEVA-X LTE / 5G PHY Control LTE-IoT RivieraWaves Wi-Fi 802.11n/ac AP and Client RivieraWaves Bluetooth Dual mode, BLE and Audio over BLE CEVA-X Sense & Connect Hub CEVA-TeakLite-4 Voice Activation CEVA-XM Machine Vision and Deep Learning
  • 22. Copyright © 2017 CEVA Inc. 22 • Comprehensive vision IP platform • Centered on CEVA-XM6 Vision DSP • Enables mass market intelligent vision applications • Simplifies deep learning inferencing for low-power embedded devices CEVA’s 5th Generation Imaging & Vision Technology