SlideShare a Scribd company logo
A New AI Platform Architecture
for the Smart Toys of the
Future
Gabriel Costache
Senior R&D Director
XPERI
40+
offices worldwide
headquarters in San
Jose, CA
$1.5B
+
market cap
public company,
trading under XPER
1,600
+
employees
worldwide
1,500
+
engineers
11,000
+
patent assets
100B+
devices worldwide
empowered by
technologies
delivered via Xperi
brands
• Safe
• Secure
• Private
• Enhances child development
• Uses natural interaction
• Monitors child cognitive load
• Develops with the child
• Long battery life
• Re-usable
Ideal Smart Toy
3
© 2022 XPERI
Smart Toy Examples
4
© 2022 XPERI
Privacy Issues
5
2022 XPERI
• Data privacy
• Safety
• Battery life
• Fast response
• AI technologies for children
• Data bias in AI
• Natural interaction with children
• Multimodal: audio, imaging, sensing
Smart Toy Challenges
6
© 2022 XPERI
DTIF (Disruptive Technology Innovation Fund)
D.A.V.I.D
DAVID will develop a “privacy by design” AI platform, capable of multi-modal, ultra-low power
consumption, “data center” level processing of audio and vision data on-device, without the need to
transmit any personal data to the cloud.
What DAVID will deliver to the smart toy market:
• A platform for a wide range of learning and interactive applications in the toy market
• A smart, trusted proof-of-concept toy using this platform that helps children learn and develop, using XPERI imaging
technology, Perceive® Ergo® chip and SoapBox Labs speech technology capabilities in collaboration with the National
University of Ireland, Galway.
• Cloud-free capabilities to ensure privacy and wonderfully immersive user experiences for children of all abilities.
DAVID – Data-center Audio/Video Intelligence
on Device
7
© 2022 XPERI
All-in-one Chip/Platform
Designed for Privacy
Multi-modal Platform Communication
Speech, Expressions, Emotions, Gesture, Context and
others..
• Perception
• Imaging/Vision
• Face Analytics
• Body Analytics
• Hand Analytics
• Video Compression
• Thermal Imaging
• Audio
• Wake Words / VAD
• Speech2Text / ASR
• Voice Analytics / Biometrics
• Sensing
AI Technologies to be Considered
8
© 2022 XPERI
• Interaction
• Visual
• Audio
• Text2Speech
• Sound Generation
• Others
• Language Models / Conversational Models
• Multi Modal Intent
• Cognitive and Behaviour Analysis
• Personalization
• Interactive Games
Perceive® Ergo® AI Processor
9
Source: A Reuther et al. MIT Lincoln Laboratory Supercomputing Center-arXiv:2009.00993
Ergo*
*Note: Ergo uses a proprietary representation. Ergo is not INT8.
© 2022 XPERI
DAVID Platform Design
10
© 2022 XPERI
• Interfaces:
- I2S (Tx, Rx), I2C (Tx, Rx) – (HUB and Ergo)
- MIPI and Parallel (Ergo)
- SPI & QSPI (HUB & Ergo)
- GPIO (HUB and Ergo)
- FTDI (JTAG, UART) (HUB)
- WiFi/BT (HUB)
- USB OTG (HUB)
• Computation Units:
- 3 x Ergo (55 TOPS/Watt + Arc CPUDSP)
- HUB STM32 MCU (Arm M7)
- ESP32 (2x Xtensa LX6)
• Memory:
- 16MB QSPI Flash (Ergo)
- 128MB QSPI Flash + 32MB SRAM (HUB)
- 448 KB ROM + 520 KB SRAM (ESP32)
- SDCard (HUB)
DAVID Platform Specifications
11
© 2022 XPERI
DAVID Toy PoC
12
© 2022 XPERI
microphones
camera
Thermal
LCDs
PIR
Speaker
Contacts
Wireless
charging
Boards, battery
& sensors
Current Ergo Vision Application
13
© 2022 XPERI
Face, Body & Hand
Detection
Facial Analytics FR CNN
Face Alignment
ERGO
x, y, w, h, confidence,
trackID
Facial Landmarks
Face Orientation
Face Expression
Face Embedding FR
x1,y1,
x2,y2
….
Tx, Ty, Rot, Scale
x, y, w, h
Body Analytics
Body Landmarks/Skeleton
Hand Analytics
Hand Gestures
Video Encoder
Encoded stream
1 2 3
4
5
6
Example Ergo Application
• Frame rate 30 fps
• Resolution 320x320
• Power ~100 mW
Fully neural video encoder (Ergo) and decoder (generic)
• Trained end-to-end
• Custom stream – data privacy
• Extra security can be added
• Y only currently but can be easily extended to color
• Enabler of other image enhancement technologies: colorization, super resolution
• Can enable smart monitoring
Video Encoding
14
© 2022 XPERI
ERGO
Video Encoder
Camera
MIPI/Parallel Stream Packing
Hub
Streaming App
Video Decoder
ONNX, TFLite, NNAPI
Mobile App
Decoded Frame
Hub
• Current Ergo board 3 application Text2Speech -> spectrogram generation +
vocoder
• Focus on comprehension, less on naturalness
• Next focus on: voice adaptation, voice cloning
• Extend to sound/music generation
Speech/Audio Neural Synthesis
15
© 2022 XPERI
powers magical and joyful
experiences for kids using speech technology
that is engaging, fun, and frictionless.
PLAY
DAVID Partners
NUIG C3I - Center for Computational,
Cognitive & Connected Imaging
© 2022 XPERI 16
• Smart Toy requirements:
• Privacy
• Battery life
• Multimodal interaction
• Platform requirements:
• Dedicated NN unit with very high OPs/W
• Communication unit
• Multiple sensor support
• Generic processing unit
• DAVID platform and toy PoC
• Available Q3/Q4 2022 for selected partners
Conclusions
17
© 2022 XPERI
Resources
• Xperi – www.Xperi.com
• Perceive, Ergo – www.perceive.io
• SoapBox Labs – www.soapboxlabs.com
• C3I, National University of Ireland, Galway - www.nuigalway.ie/c3i
• Disruptive Technologies Innovation Fund – DTIF
• STMicroelectronics STM32 MCU
• Espressif Systems ESP32
Resources
© 2022 XPERI 19

More Related Content

Similar to “A New AI Platform Architecture for the Smart Toys of the Future,” a Presentation from Xperi

AiLIbrary White paper05
AiLIbrary White paper05AiLIbrary White paper05
AiLIbrary White paper05
Gordon Kraft
 
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
IntelAPAC
 
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
IntelAPAC
 
The AI Takeover in Hollywood by Yves Bergquist
The AI Takeover in Hollywood by Yves BergquistThe AI Takeover in Hollywood by Yves Bergquist
The AI Takeover in Hollywood by Yves Bergquist
Data Con LA
 
google glass
google glassgoogle glass
google glass
Vipin Sudhakar
 
NUX Presentation from TechMixer Birmingham 2011
NUX Presentation from TechMixer Birmingham 2011NUX Presentation from TechMixer Birmingham 2011
NUX Presentation from TechMixer Birmingham 2011
Michael Heydt
 
AiLibrary Whitepaper 2
AiLibrary Whitepaper 2AiLibrary Whitepaper 2
AiLibrary Whitepaper 2
Gordon Kraft
 
HPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program GuideHPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program Guide
Isaac Rodriguez
 
Hololens
HololensHololens
Taller IoT en la Actualidad
Taller IoT en la ActualidadTaller IoT en la Actualidad
Taller IoT en la Actualidad
Laurence HR
 
Unity: What does it take to port a browser title to mobiles
Unity: What does it take to port a browser title to mobilesUnity: What does it take to port a browser title to mobiles
Unity: What does it take to port a browser title to mobiles
DevGAMM Conference
 
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel:  Creating Smart Spaces with All-in-OnesIT@Intel:  Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel
 
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-OnesIT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-Ones
Intel IT Center
 
Google glass and the wearable revolution - NYCCamp 2013
Google glass and the wearable revolution - NYCCamp 2013Google glass and the wearable revolution - NYCCamp 2013
Google glass and the wearable revolution - NYCCamp 2013
Frank Carey
 
Dell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western OntarioDell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western Ontario
Bill Wong
 
Realsense only STAGE 01 - Firstman Marpaung
Realsense only STAGE 01 - Firstman Marpaung Realsense only STAGE 01 - Firstman Marpaung
Realsense only STAGE 01 - Firstman Marpaung
binusgamelab
 
The Internet of Things and You - A Developers Guide to IoT
The Internet of Things and You - A Developers Guide to IoTThe Internet of Things and You - A Developers Guide to IoT
The Internet of Things and You - A Developers Guide to IoT
Jim McKeeth
 
Telepresence Cisco
Telepresence CiscoTelepresence Cisco
Telepresence Cisco
Sunmedia Corporation
 
Robotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensingRobotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensing
Design World
 
Ai Development Company
Ai Development CompanyAi Development Company
Ai Development Company
Ruchir Kakkad
 

Similar to “A New AI Platform Architecture for the Smart Toys of the Future,” a Presentation from Xperi (20)

AiLIbrary White paper05
AiLIbrary White paper05AiLIbrary White paper05
AiLIbrary White paper05
 
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
 
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
 
The AI Takeover in Hollywood by Yves Bergquist
The AI Takeover in Hollywood by Yves BergquistThe AI Takeover in Hollywood by Yves Bergquist
The AI Takeover in Hollywood by Yves Bergquist
 
google glass
google glassgoogle glass
google glass
 
NUX Presentation from TechMixer Birmingham 2011
NUX Presentation from TechMixer Birmingham 2011NUX Presentation from TechMixer Birmingham 2011
NUX Presentation from TechMixer Birmingham 2011
 
AiLibrary Whitepaper 2
AiLibrary Whitepaper 2AiLibrary Whitepaper 2
AiLibrary Whitepaper 2
 
HPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program GuideHPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program Guide
 
Hololens
HololensHololens
Hololens
 
Taller IoT en la Actualidad
Taller IoT en la ActualidadTaller IoT en la Actualidad
Taller IoT en la Actualidad
 
Unity: What does it take to port a browser title to mobiles
Unity: What does it take to port a browser title to mobilesUnity: What does it take to port a browser title to mobiles
Unity: What does it take to port a browser title to mobiles
 
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel:  Creating Smart Spaces with All-in-OnesIT@Intel:  Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-Ones
 
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-OnesIT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-Ones
 
Google glass and the wearable revolution - NYCCamp 2013
Google glass and the wearable revolution - NYCCamp 2013Google glass and the wearable revolution - NYCCamp 2013
Google glass and the wearable revolution - NYCCamp 2013
 
Dell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western OntarioDell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western Ontario
 
Realsense only STAGE 01 - Firstman Marpaung
Realsense only STAGE 01 - Firstman Marpaung Realsense only STAGE 01 - Firstman Marpaung
Realsense only STAGE 01 - Firstman Marpaung
 
The Internet of Things and You - A Developers Guide to IoT
The Internet of Things and You - A Developers Guide to IoTThe Internet of Things and You - A Developers Guide to IoT
The Internet of Things and You - A Developers Guide to IoT
 
Telepresence Cisco
Telepresence CiscoTelepresence Cisco
Telepresence Cisco
 
Robotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensingRobotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensing
 
Ai Development Company
Ai Development CompanyAi Development Company
Ai Development Company
 

More from Edge AI and Vision Alliance

“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...
“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...
“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...
Edge AI and Vision Alliance
 
"Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr...
"Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr..."Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr...
"Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr...
Edge AI and Vision Alliance
 
“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...
“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...
“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...
Edge AI and Vision Alliance
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...
“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...
“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...
Edge AI and Vision Alliance
 
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
Edge AI and Vision Alliance
 
“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...
“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...
“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...
Edge AI and Vision Alliance
 
“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...
“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...
“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...
Edge AI and Vision Alliance
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
"OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a...
"OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a..."OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a...
"OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a...
Edge AI and Vision Alliance
 
“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...
“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...
“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...
Edge AI and Vision Alliance
 
“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...
“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...
“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...
Edge AI and Vision Alliance
 
“What’s Next in On-device Generative AI,” a Presentation from Qualcomm
“What’s Next in On-device Generative AI,” a Presentation from Qualcomm“What’s Next in On-device Generative AI,” a Presentation from Qualcomm
“What’s Next in On-device Generative AI,” a Presentation from Qualcomm
Edge AI and Vision Alliance
 
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
Edge AI and Vision Alliance
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
Edge AI and Vision Alliance
 

More from Edge AI and Vision Alliance (20)

“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...
“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...
“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...
 
"Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr...
"Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr..."Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr...
"Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr...
 
“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...
“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...
“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...
“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...
“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...
 
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
 
“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...
“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...
“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...
 
“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...
“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...
“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
"OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a...
"OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a..."OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a...
"OpenCV for High-performance, Low-power Vision Applications on Snapdragon," a...
 
“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...
“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...
“Deploying Large Models on the Edge: Success Stories and Challenges,” a Prese...
 
“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...
“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...
“Scaling Vision-based Edge AI Solutions: From Prototype to Global Deployment,...
 
“What’s Next in On-device Generative AI,” a Presentation from Qualcomm
“What’s Next in On-device Generative AI,” a Presentation from Qualcomm“What’s Next in On-device Generative AI,” a Presentation from Qualcomm
“What’s Next in On-device Generative AI,” a Presentation from Qualcomm
 
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 

Recently uploaded

Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Pitangent Analytics & Technology Solutions Pvt. Ltd
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
saastr
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Precisely
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
Ajin Abraham
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 

Recently uploaded (20)

Artificial Intelligence and Electronic Warfare
Artificial Intelligence and Electronic WarfareArtificial Intelligence and Electronic Warfare
Artificial Intelligence and Electronic Warfare
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 

“A New AI Platform Architecture for the Smart Toys of the Future,” a Presentation from Xperi

  • 1. A New AI Platform Architecture for the Smart Toys of the Future Gabriel Costache Senior R&D Director XPERI
  • 2. 40+ offices worldwide headquarters in San Jose, CA $1.5B + market cap public company, trading under XPER 1,600 + employees worldwide 1,500 + engineers 11,000 + patent assets 100B+ devices worldwide empowered by technologies delivered via Xperi brands
  • 3. • Safe • Secure • Private • Enhances child development • Uses natural interaction • Monitors child cognitive load • Develops with the child • Long battery life • Re-usable Ideal Smart Toy 3 © 2022 XPERI
  • 6. • Data privacy • Safety • Battery life • Fast response • AI technologies for children • Data bias in AI • Natural interaction with children • Multimodal: audio, imaging, sensing Smart Toy Challenges 6 © 2022 XPERI DTIF (Disruptive Technology Innovation Fund) D.A.V.I.D
  • 7. DAVID will develop a “privacy by design” AI platform, capable of multi-modal, ultra-low power consumption, “data center” level processing of audio and vision data on-device, without the need to transmit any personal data to the cloud. What DAVID will deliver to the smart toy market: • A platform for a wide range of learning and interactive applications in the toy market • A smart, trusted proof-of-concept toy using this platform that helps children learn and develop, using XPERI imaging technology, Perceive® Ergo® chip and SoapBox Labs speech technology capabilities in collaboration with the National University of Ireland, Galway. • Cloud-free capabilities to ensure privacy and wonderfully immersive user experiences for children of all abilities. DAVID – Data-center Audio/Video Intelligence on Device 7 © 2022 XPERI All-in-one Chip/Platform Designed for Privacy Multi-modal Platform Communication Speech, Expressions, Emotions, Gesture, Context and others..
  • 8. • Perception • Imaging/Vision • Face Analytics • Body Analytics • Hand Analytics • Video Compression • Thermal Imaging • Audio • Wake Words / VAD • Speech2Text / ASR • Voice Analytics / Biometrics • Sensing AI Technologies to be Considered 8 © 2022 XPERI • Interaction • Visual • Audio • Text2Speech • Sound Generation • Others • Language Models / Conversational Models • Multi Modal Intent • Cognitive and Behaviour Analysis • Personalization • Interactive Games
  • 9. Perceive® Ergo® AI Processor 9 Source: A Reuther et al. MIT Lincoln Laboratory Supercomputing Center-arXiv:2009.00993 Ergo* *Note: Ergo uses a proprietary representation. Ergo is not INT8. © 2022 XPERI
  • 11. • Interfaces: - I2S (Tx, Rx), I2C (Tx, Rx) – (HUB and Ergo) - MIPI and Parallel (Ergo) - SPI & QSPI (HUB & Ergo) - GPIO (HUB and Ergo) - FTDI (JTAG, UART) (HUB) - WiFi/BT (HUB) - USB OTG (HUB) • Computation Units: - 3 x Ergo (55 TOPS/Watt + Arc CPUDSP) - HUB STM32 MCU (Arm M7) - ESP32 (2x Xtensa LX6) • Memory: - 16MB QSPI Flash (Ergo) - 128MB QSPI Flash + 32MB SRAM (HUB) - 448 KB ROM + 520 KB SRAM (ESP32) - SDCard (HUB) DAVID Platform Specifications 11 © 2022 XPERI
  • 12. DAVID Toy PoC 12 © 2022 XPERI microphones camera Thermal LCDs PIR Speaker Contacts Wireless charging Boards, battery & sensors
  • 13. Current Ergo Vision Application 13 © 2022 XPERI Face, Body & Hand Detection Facial Analytics FR CNN Face Alignment ERGO x, y, w, h, confidence, trackID Facial Landmarks Face Orientation Face Expression Face Embedding FR x1,y1, x2,y2 …. Tx, Ty, Rot, Scale x, y, w, h Body Analytics Body Landmarks/Skeleton Hand Analytics Hand Gestures Video Encoder Encoded stream 1 2 3 4 5 6 Example Ergo Application • Frame rate 30 fps • Resolution 320x320 • Power ~100 mW
  • 14. Fully neural video encoder (Ergo) and decoder (generic) • Trained end-to-end • Custom stream – data privacy • Extra security can be added • Y only currently but can be easily extended to color • Enabler of other image enhancement technologies: colorization, super resolution • Can enable smart monitoring Video Encoding 14 © 2022 XPERI ERGO Video Encoder Camera MIPI/Parallel Stream Packing Hub Streaming App Video Decoder ONNX, TFLite, NNAPI Mobile App Decoded Frame Hub
  • 15. • Current Ergo board 3 application Text2Speech -> spectrogram generation + vocoder • Focus on comprehension, less on naturalness • Next focus on: voice adaptation, voice cloning • Extend to sound/music generation Speech/Audio Neural Synthesis 15 © 2022 XPERI
  • 16. powers magical and joyful experiences for kids using speech technology that is engaging, fun, and frictionless. PLAY DAVID Partners NUIG C3I - Center for Computational, Cognitive & Connected Imaging © 2022 XPERI 16
  • 17. • Smart Toy requirements: • Privacy • Battery life • Multimodal interaction • Platform requirements: • Dedicated NN unit with very high OPs/W • Communication unit • Multiple sensor support • Generic processing unit • DAVID platform and toy PoC • Available Q3/Q4 2022 for selected partners Conclusions 17 © 2022 XPERI
  • 19. • Xperi – www.Xperi.com • Perceive, Ergo – www.perceive.io • SoapBox Labs – www.soapboxlabs.com • C3I, National University of Ireland, Galway - www.nuigalway.ie/c3i • Disruptive Technologies Innovation Fund – DTIF • STMicroelectronics STM32 MCU • Espressif Systems ESP32 Resources © 2022 XPERI 19