SlideShare a Scribd company logo
1 of 25
Download to read offline
© 2020 Skydio, Inc
Tackling Extreme Visual Conditions
for Autonomous UAVs In the Wild
Hayk Martiros
Head of Autonomy, Skydio
September 2020
© 2020 Skydio, Inc 2
Skydio 2
Q4 2019 Launch
Consumer video drone
Built for autonomous flight
Six navigation cameras with 200 deg FOV
User camera w/ 3-axis gimbal
NVIDIA Tegra X2 chip-down design
$999
Skydio R1
Q1 2018
First autonomous robot warrantee of its kind:
If it crashes within our operating guidelines, we’ll replace it.
© 2020 Skydio, Inc
Skydio 2
© 2020 Skydio, Inc
Skydio X2
© 2020 Skydio, Inc
Skydio Autonomy Engine
5
© 2020 Skydio, Inc
Skydio Autonomy Engine
© 2020 Skydio, Inc
Extreme Visual Conditions
The challenge of an autonomous drone: Customer pays for the robot to fly itself in a place
we’ve never seen and quickly gains unconditional trust. Every limit is pushed.
• Few semantic priors - fly inside a tree, in a factory, at Burning Man, on a mountain
• No room for missing obstacles or failing state estimation, robot will crash or fly away
• No room for false positives, drone will be erratic and unpredictable
• Time is ticking! Limited battery life, limited connectivity, high speed motion
7
© 2020 Skydio, Inc
High-speed Follow + Film Scenario
8
© 2020 Skydio, Inc
Extreme Visual Conditions
Daunting set of challenges to visual navigation:
• Thin objects, extreme glare, shadows, camera artifacts, motion blur, dirt, smudges
• Waves, snow, rain, fog, desert dunes, salt flats, reflective surfaces, textureless walls/ceilings
• Calibration errors, rolling shutter errors, time-sync errors, vibrations, electrical interference
9
© 2020 Skydio, Inc 10
a) thin branches, b) sun glare with dirty lens, c) severe motion blur,
d) reflections, e) textureless sky or ceiling, f) water droplets
© 2020 Skydio, Inc
Rolling Shutter
Problem: Skydio 2 uses rolling shutter navigation
cameras (higher quality, lighter, cheaper). Need to
do geometric processing during high speed, high
rotation motions close to objects!
Solution
- Careful modeling of rolling shutter effects,
accounting for translation and rotation
- Accurate time synchronization between camera
triggers, exposure periods, IMUs
11
Global Shutter Rolling Shutter
pixel shift through exposure period
© 2020 Skydio, Inc
Lens Intrinsics
Problem: Environmental conditions significantly
change intrinsic lens properties. Need to estimate
online to maintain high quality mapping.
Solution: Visual-inertial odometry system jointly
estimates intrinsics, extrinsics, IMU biases, and
world features during flight, accounting for rolling
shutter.
12
stationary lens in temperature chamber
© 2020 Skydio, Inc
Moving Objects
13
Problem: All of the close visual features in this
scene are moving and malicious for state
estimation purposes.
Solutions:
+ Careful feature gating relative to IMU
+ Joint consideration of visual and GPS uncertainty
+ Semantic information
© 2020 Skydio, Inc
Real World Problems: Dirty Camera Detection
Problem: Dirt, dust, fingerprints, water
on the lenses can ruin photometric
consistency and cause false matches.
Solution: Estimate dirty regions of
cameras in flight.
14
© 2020 Skydio, Inc 15
Idea: Regions with poor photometric
consistency over a variety of background
content are likely from dirty cameras.
Aggregate from our matching algorithms
across time and vehicle rotations, use as
an invalid mask.
Continue flying, applying the mask, but
warn the user to land and clean lenses!
If the mask is a large portion of a camera,
go into a high-level error state and tell
the pilot (if possible) to land quickly.
Real World Problems: Dirty Camera Detection
© 2020 Skydio, Inc 16
Real World Problems: Dirty Camera Detection
© 2020 Skydio, Inc
Challenges for Learning Depth
No ground truth - infeasible to get human labels
Billions of images captured by S2 to learn from —> unsupervised learning!
• SOTA has not made this work yet. Photometric consistency fails.
17
Need to find ways to use leverage large amounts of
in-domain unlabeled data for extreme visual
conditions.
More acausal reasoning? Better features? Cycle
consistency?
© 2020 Skydio, Inc
Synthetic Generation
Synthetic augmentations of real data
allows forms of supervision without
knowing ground-truth.
Example: Generate suns on real data,
penalize prediction errors between
original and augmented images (or
expect uncertain prediction).
18
a) Examples of synthetic sun glare generated by simulating diffraction of light through
lens imperfections. b) Real sun (left) and augmented sun (right) on a real image.
© 2020 Skydio, Inc
Deep-Learned Depth Estimation
19
End-to-End Learning of Geometry and Context for Deep Stereo Regression
Kendall et al. ICCV 2017
KITTI 2012: #1 KITTI 2015: #2
~1 fps on a Titan X GPU
© 2020 Skydio, Inc
Deep-Learned Depth Estimation
20
© 2020 Skydio, Inc
Deep-Learned Depth Estimation Results
21
© 2020 Skydio, Inc
Intelligent Error Handing + Graceful Degradation
22
Comprehensive error handling is extremely difficult and requires complex integration of
the whole stack. This what separates lab demos from real products – there is no substitute.
© 2020 Skydio, Inc
Conclusions
23
Autonomous robots, especially UAVs, encounter extreme visual conditions - handling them
is the key to mainstream adoption. We are not close to the ceiling for this.
Requirements:
Product / user mindset in research.
Accurate modeling AND learning from data.
Dedicated time and iteration.
Hardware and rapid fearless testing ability.
Great software engineering.
© 2020 Skydio, Inc
3D Reconstruction
24
© 2020 Skydio, Inc
Resources
Skydio product info
https://www.skydio.com/
Skydio Blog
https://medium.com/skydio
Skydio 3D Scan
https://youtu.be/VxLXTeycyeE
Skydio House Scan
https://youtu.be/MSy_06aOBzg
Recorded MIT lecture on autonomy
https://youtu.be/R5K-IV3J8XM
“End-to-End Learning of Geometry and
Context for Deep Stereo Regression”
https://arxiv.org/abs/1703.04309
Hiring contact:
https://skydio.com/jobs
25

More Related Content

What's hot

Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Lior Rokach
 
Iris recognition system
Iris recognition systemIris recognition system
Iris recognition system
Nilu Desai
 

What's hot (20)

The 5 key V's of Big Data
The 5 key V's of Big DataThe 5 key V's of Big Data
The 5 key V's of Big Data
 
AI Everywhere: How Microsoft is Democratizing AI
AI Everywhere: How Microsoft is Democratizing AIAI Everywhere: How Microsoft is Democratizing AI
AI Everywhere: How Microsoft is Democratizing AI
 
Advantages of Artificial Intelligence - Avantika University
Advantages of Artificial Intelligence - Avantika UniversityAdvantages of Artificial Intelligence - Avantika University
Advantages of Artificial Intelligence - Avantika University
 
Introduction to Computer Vision.pdf
Introduction to Computer Vision.pdfIntroduction to Computer Vision.pdf
Introduction to Computer Vision.pdf
 
Data Science - Part XVII - Deep Learning & Image Processing
Data Science - Part XVII - Deep Learning & Image ProcessingData Science - Part XVII - Deep Learning & Image Processing
Data Science - Part XVII - Deep Learning & Image Processing
 
FinTech, AI, Machine Learning in Finance
FinTech, AI, Machine Learning in FinanceFinTech, AI, Machine Learning in Finance
FinTech, AI, Machine Learning in Finance
 
IA 101 - Introduction to Artificial Intelligence
IA 101 - Introduction to Artificial Intelligence IA 101 - Introduction to Artificial Intelligence
IA 101 - Introduction to Artificial Intelligence
 
introduction to Artificial intelligence
introduction  to Artificial intelligenceintroduction  to Artificial intelligence
introduction to Artificial intelligence
 
Machine Learning for Fraud Detection
Machine Learning for Fraud DetectionMachine Learning for Fraud Detection
Machine Learning for Fraud Detection
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Optical character recognition of handwritten Arabic using hidden Markov models
Optical character recognition of handwritten Arabic using hidden Markov modelsOptical character recognition of handwritten Arabic using hidden Markov models
Optical character recognition of handwritten Arabic using hidden Markov models
 
Quantum cryptography
Quantum cryptographyQuantum cryptography
Quantum cryptography
 
Decision Intelligence: a new discipline emerges
Decision Intelligence: a new discipline emergesDecision Intelligence: a new discipline emerges
Decision Intelligence: a new discipline emerges
 
Iris recognition system
Iris recognition systemIris recognition system
Iris recognition system
 
fusion of Camera and lidar for autonomous driving I
fusion of Camera and lidar for autonomous driving Ifusion of Camera and lidar for autonomous driving I
fusion of Camera and lidar for autonomous driving I
 
Credit card fraud detection methods using Data-mining.pptx (2)
Credit card fraud detection methods using Data-mining.pptx (2)Credit card fraud detection methods using Data-mining.pptx (2)
Credit card fraud detection methods using Data-mining.pptx (2)
 
DC02. Interpretation of predictions
DC02. Interpretation of predictionsDC02. Interpretation of predictions
DC02. Interpretation of predictions
 
Philosophy of Artificial Intelligence
Philosophy of Artificial IntelligencePhilosophy of Artificial Intelligence
Philosophy of Artificial Intelligence
 
Iris scanning
Iris scanningIris scanning
Iris scanning
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?
 

Similar to “Tackling Extreme Visual Conditions for Autonomous UAVs In the Wild,” a Presentation from Skydio

Daniel Bochicchio, Skybernetics - “Valuable Insights from On High: Drone use ...
Daniel Bochicchio, Skybernetics - “Valuable Insights from On High: Drone use ...Daniel Bochicchio, Skybernetics - “Valuable Insights from On High: Drone use ...
Daniel Bochicchio, Skybernetics - “Valuable Insights from On High: Drone use ...
Michael Hewitt, GISP
 
Improving Quality & Profits with Vision
Improving Quality & Profits with VisionImproving Quality & Profits with Vision
Improving Quality & Profits with Vision
Tim Seymour
 

Similar to “Tackling Extreme Visual Conditions for Autonomous UAVs In the Wild,” a Presentation from Skydio (20)

“Video Activity Recognition with Limited Data for Smart Home Applications,” a...
“Video Activity Recognition with Limited Data for Smart Home Applications,” a...“Video Activity Recognition with Limited Data for Smart Home Applications,” a...
“Video Activity Recognition with Limited Data for Smart Home Applications,” a...
 
Jonathan Waldern (DigiLens): DigiLens Switchable Bragg Grating Waveguide Opti...
Jonathan Waldern (DigiLens): DigiLens Switchable Bragg Grating Waveguide Opti...Jonathan Waldern (DigiLens): DigiLens Switchable Bragg Grating Waveguide Opti...
Jonathan Waldern (DigiLens): DigiLens Switchable Bragg Grating Waveguide Opti...
 
“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...
“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...
“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...
 
"2D and 3D Sensing: Markets, Applications, and Technologies," a Presentation ...
"2D and 3D Sensing: Markets, Applications, and Technologies," a Presentation ..."2D and 3D Sensing: Markets, Applications, and Technologies," a Presentation ...
"2D and 3D Sensing: Markets, Applications, and Technologies," a Presentation ...
 
MEMS-Driven Laser Beam Scanning LiDAR: The Future of Variable Spatial Resolu...
MEMS-Driven Laser Beam Scanning LiDAR:  The Future of Variable Spatial Resolu...MEMS-Driven Laser Beam Scanning LiDAR:  The Future of Variable Spatial Resolu...
MEMS-Driven Laser Beam Scanning LiDAR: The Future of Variable Spatial Resolu...
 
Daniel Bochicchio, Skybernetics - “Valuable Insights from On High: Drone use ...
Daniel Bochicchio, Skybernetics - “Valuable Insights from On High: Drone use ...Daniel Bochicchio, Skybernetics - “Valuable Insights from On High: Drone use ...
Daniel Bochicchio, Skybernetics - “Valuable Insights from On High: Drone use ...
 
The Case for Smart Solar Trackers in Southeast Asia
The Case for Smart Solar Trackers in Southeast AsiaThe Case for Smart Solar Trackers in Southeast Asia
The Case for Smart Solar Trackers in Southeast Asia
 
May 2018 Embedded Vision Summit Vision Tank Start-up Competition Finalist Pre...
May 2018 Embedded Vision Summit Vision Tank Start-up Competition Finalist Pre...May 2018 Embedded Vision Summit Vision Tank Start-up Competition Finalist Pre...
May 2018 Embedded Vision Summit Vision Tank Start-up Competition Finalist Pre...
 
AIFrienz_Webinar_Tomomi_Research_Inc).pdf
AIFrienz_Webinar_Tomomi_Research_Inc).pdfAIFrienz_Webinar_Tomomi_Research_Inc).pdf
AIFrienz_Webinar_Tomomi_Research_Inc).pdf
 
Improving Quality & Profits with Vision
Improving Quality & Profits with VisionImproving Quality & Profits with Vision
Improving Quality & Profits with Vision
 
IRJET- Adroit Speculum for Institutional Updates (Smart Mirror)
IRJET- Adroit Speculum for Institutional Updates (Smart Mirror)IRJET- Adroit Speculum for Institutional Updates (Smart Mirror)
IRJET- Adroit Speculum for Institutional Updates (Smart Mirror)
 
IRJET- Adroit Speculum for Institutional Updates (Smart Mirror)
IRJET- Adroit Speculum for Institutional Updates (Smart Mirror)IRJET- Adroit Speculum for Institutional Updates (Smart Mirror)
IRJET- Adroit Speculum for Institutional Updates (Smart Mirror)
 
Making Aerial Micro-Drones a Reality (AIS306) - AWS re:Invent 2018
Making Aerial Micro-Drones a Reality (AIS306) - AWS re:Invent 2018Making Aerial Micro-Drones a Reality (AIS306) - AWS re:Invent 2018
Making Aerial Micro-Drones a Reality (AIS306) - AWS re:Invent 2018
 
How X-ray Technology is Improving the Electronics Assembly Process
How X-ray Technology is Improving the Electronics Assembly ProcessHow X-ray Technology is Improving the Electronics Assembly Process
How X-ray Technology is Improving the Electronics Assembly Process
 
"Is Vision the New Wireless?," a Presentation from Qualcomm
"Is Vision the New Wireless?," a Presentation from Qualcomm"Is Vision the New Wireless?," a Presentation from Qualcomm
"Is Vision the New Wireless?," a Presentation from Qualcomm
 
IRJET-Implementation of Image Processing using Augmented Reality
IRJET-Implementation of Image Processing using Augmented RealityIRJET-Implementation of Image Processing using Augmented Reality
IRJET-Implementation of Image Processing using Augmented Reality
 
IRJET-Smart Helmet with Live Map Navigation System.
IRJET-Smart Helmet with Live Map Navigation System.IRJET-Smart Helmet with Live Map Navigation System.
IRJET-Smart Helmet with Live Map Navigation System.
 
Cwin16 tls cnes-realite_augmentee_eng_v1 2
Cwin16 tls cnes-realite_augmentee_eng_v1 2Cwin16 tls cnes-realite_augmentee_eng_v1 2
Cwin16 tls cnes-realite_augmentee_eng_v1 2
 
Mitchell Reifel (pmdtechnologies ag): pmd Time-of-Flight – the Swiss Army Kni...
Mitchell Reifel (pmdtechnologies ag): pmd Time-of-Flight – the Swiss Army Kni...Mitchell Reifel (pmdtechnologies ag): pmd Time-of-Flight – the Swiss Army Kni...
Mitchell Reifel (pmdtechnologies ag): pmd Time-of-Flight – the Swiss Army Kni...
 
“Machine Learning for the Real World: What is Acceptable Accuracy, and How Ca...
“Machine Learning for the Real World: What is Acceptable Accuracy, and How Ca...“Machine Learning for the Real World: What is Acceptable Accuracy, and How Ca...
“Machine Learning for the Real World: What is Acceptable Accuracy, and How Ca...
 

More from Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
Edge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
Edge AI and Vision Alliance
 

More from Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

“Tackling Extreme Visual Conditions for Autonomous UAVs In the Wild,” a Presentation from Skydio

  • 1. © 2020 Skydio, Inc Tackling Extreme Visual Conditions for Autonomous UAVs In the Wild Hayk Martiros Head of Autonomy, Skydio September 2020
  • 2. © 2020 Skydio, Inc 2 Skydio 2 Q4 2019 Launch Consumer video drone Built for autonomous flight Six navigation cameras with 200 deg FOV User camera w/ 3-axis gimbal NVIDIA Tegra X2 chip-down design $999 Skydio R1 Q1 2018 First autonomous robot warrantee of its kind: If it crashes within our operating guidelines, we’ll replace it.
  • 3. © 2020 Skydio, Inc Skydio 2
  • 4. © 2020 Skydio, Inc Skydio X2
  • 5. © 2020 Skydio, Inc Skydio Autonomy Engine 5
  • 6. © 2020 Skydio, Inc Skydio Autonomy Engine
  • 7. © 2020 Skydio, Inc Extreme Visual Conditions The challenge of an autonomous drone: Customer pays for the robot to fly itself in a place we’ve never seen and quickly gains unconditional trust. Every limit is pushed. • Few semantic priors - fly inside a tree, in a factory, at Burning Man, on a mountain • No room for missing obstacles or failing state estimation, robot will crash or fly away • No room for false positives, drone will be erratic and unpredictable • Time is ticking! Limited battery life, limited connectivity, high speed motion 7
  • 8. © 2020 Skydio, Inc High-speed Follow + Film Scenario 8
  • 9. © 2020 Skydio, Inc Extreme Visual Conditions Daunting set of challenges to visual navigation: • Thin objects, extreme glare, shadows, camera artifacts, motion blur, dirt, smudges • Waves, snow, rain, fog, desert dunes, salt flats, reflective surfaces, textureless walls/ceilings • Calibration errors, rolling shutter errors, time-sync errors, vibrations, electrical interference 9
  • 10. © 2020 Skydio, Inc 10 a) thin branches, b) sun glare with dirty lens, c) severe motion blur, d) reflections, e) textureless sky or ceiling, f) water droplets
  • 11. © 2020 Skydio, Inc Rolling Shutter Problem: Skydio 2 uses rolling shutter navigation cameras (higher quality, lighter, cheaper). Need to do geometric processing during high speed, high rotation motions close to objects! Solution - Careful modeling of rolling shutter effects, accounting for translation and rotation - Accurate time synchronization between camera triggers, exposure periods, IMUs 11 Global Shutter Rolling Shutter pixel shift through exposure period
  • 12. © 2020 Skydio, Inc Lens Intrinsics Problem: Environmental conditions significantly change intrinsic lens properties. Need to estimate online to maintain high quality mapping. Solution: Visual-inertial odometry system jointly estimates intrinsics, extrinsics, IMU biases, and world features during flight, accounting for rolling shutter. 12 stationary lens in temperature chamber
  • 13. © 2020 Skydio, Inc Moving Objects 13 Problem: All of the close visual features in this scene are moving and malicious for state estimation purposes. Solutions: + Careful feature gating relative to IMU + Joint consideration of visual and GPS uncertainty + Semantic information
  • 14. © 2020 Skydio, Inc Real World Problems: Dirty Camera Detection Problem: Dirt, dust, fingerprints, water on the lenses can ruin photometric consistency and cause false matches. Solution: Estimate dirty regions of cameras in flight. 14
  • 15. © 2020 Skydio, Inc 15 Idea: Regions with poor photometric consistency over a variety of background content are likely from dirty cameras. Aggregate from our matching algorithms across time and vehicle rotations, use as an invalid mask. Continue flying, applying the mask, but warn the user to land and clean lenses! If the mask is a large portion of a camera, go into a high-level error state and tell the pilot (if possible) to land quickly. Real World Problems: Dirty Camera Detection
  • 16. © 2020 Skydio, Inc 16 Real World Problems: Dirty Camera Detection
  • 17. © 2020 Skydio, Inc Challenges for Learning Depth No ground truth - infeasible to get human labels Billions of images captured by S2 to learn from —> unsupervised learning! • SOTA has not made this work yet. Photometric consistency fails. 17 Need to find ways to use leverage large amounts of in-domain unlabeled data for extreme visual conditions. More acausal reasoning? Better features? Cycle consistency?
  • 18. © 2020 Skydio, Inc Synthetic Generation Synthetic augmentations of real data allows forms of supervision without knowing ground-truth. Example: Generate suns on real data, penalize prediction errors between original and augmented images (or expect uncertain prediction). 18 a) Examples of synthetic sun glare generated by simulating diffraction of light through lens imperfections. b) Real sun (left) and augmented sun (right) on a real image.
  • 19. © 2020 Skydio, Inc Deep-Learned Depth Estimation 19 End-to-End Learning of Geometry and Context for Deep Stereo Regression Kendall et al. ICCV 2017 KITTI 2012: #1 KITTI 2015: #2 ~1 fps on a Titan X GPU
  • 20. © 2020 Skydio, Inc Deep-Learned Depth Estimation 20
  • 21. © 2020 Skydio, Inc Deep-Learned Depth Estimation Results 21
  • 22. © 2020 Skydio, Inc Intelligent Error Handing + Graceful Degradation 22 Comprehensive error handling is extremely difficult and requires complex integration of the whole stack. This what separates lab demos from real products – there is no substitute.
  • 23. © 2020 Skydio, Inc Conclusions 23 Autonomous robots, especially UAVs, encounter extreme visual conditions - handling them is the key to mainstream adoption. We are not close to the ceiling for this. Requirements: Product / user mindset in research. Accurate modeling AND learning from data. Dedicated time and iteration. Hardware and rapid fearless testing ability. Great software engineering.
  • 24. © 2020 Skydio, Inc 3D Reconstruction 24
  • 25. © 2020 Skydio, Inc Resources Skydio product info https://www.skydio.com/ Skydio Blog https://medium.com/skydio Skydio 3D Scan https://youtu.be/VxLXTeycyeE Skydio House Scan https://youtu.be/MSy_06aOBzg Recorded MIT lecture on autonomy https://youtu.be/R5K-IV3J8XM “End-to-End Learning of Geometry and Context for Deep Stereo Regression” https://arxiv.org/abs/1703.04309 Hiring contact: https://skydio.com/jobs 25