SlideShare a Scribd company logo
1 of 30
AI NEXTCon Silicon Valley 18
SANTA CLARA | APRIL 10-13
#ainextcon
Adrian Kaehler, CEO Giant.AI
} Author: Learning OpenCV and Learning OpenCV 3
} Winner 2005 DARPA Grand Challenge – wrote
vision system
} Vice President and head of Robotics and Machine
Learning at Applied Minds for 8 years, built
autonomous vehicles for JIEDDO for the Iraq and
Afghanistan wars.
} Magic Leap
} Founder: Silicon Valley Deep Learning Group
} Founder and CEO: Giant.AI
4/13/2018 2
} Introduction
} Section I: Turning point, DARPA challenges, and old approaches
} Section II: From then until now, important events since
} Wrap Up: What’s now and what’s coming.
4/13/2018 3Adrian Kaehler
4/13/2018 4
Images and text courtesy of the Computer History Museum
Illustration for Popular Mechanics article
on automated highways, 1961. Credit:
Popular Mechanics
Sketch of a pre-programmed clockwork cart by
Leonardo Da Vinci, circa 1478
Had it been built, this cart would have been powered
by large coiled clockwork springs, propelling it over
130 feet. The clever control mechanism could have
taken the vehicle through a predetermined course.
Credit: Biblioteca Ambrosiana, Milan, Italy / De
Agostini Picture Library / Metis e Mida Informatica /
Veneranda Biblioteca Ambrosiana / The Bridgeman
Art Library
Stanford Artificial Intelligence Laboratory Cart,
1964-71. Along with Shakey the robot at
neighboring SRI, the Stanford Cart pioneered
techniques for navigating through an unfamiliar
environment with artificial intelligence and
machine vision. The cart famously wandered into
a nearby road, but survived unscathed. Credit: ©
Mark Richards
} 1991: USDOT mandate to "demonstrate an automated vehicle and highway system by 1997”
} 1994: Daimler-Benz and Ernst Dickmanns of UniBwM drove more than 1,000 km in Paris demonstrating
autonomous driving in free lanes, convoy driving, and lane changes with autonomous passing of other
cars
} 1995: CMU Navlab project completed a 5,000 km cross-country journey, of which 98.2% was
autonomously controlled. NavLab projects heavily used computer vision. In one case, steering was a
neural network controlled using video images.
} 1995: Dichmanns demonstrates the use of saccadic computer vision for control.
} 1996: ARGO project uses stereo vision to find lane markings and drives almost 2000km.
} 1997: “Demo 97” in San Diego major carmakers showed WHAT DID THEY SHOW.
} 1999: First driverless transportation of human passengers by the Netherlands’ ParkShuttle project using
embedded magnets (not computer vision).
4/13/2018 5
During this period, LIDAR systems were extremely
expensive and did not play a large role in
autonomous driving projects. The general belief
among researchers was that cameras and
computer vision techniques would solve this
problem.
Adrian Kaehler
4/13/2018 6
• 1985 – ALV: Road Following Demonstration: Vehicle traverses a 2 km at 10 km/hr.
Forward motion only, with no obstacle avoidance required.
• 1986 - ALV: Obstacle Avoidance Demonstration: Vehicle traverses 5 km road
course at speeds up to 20 km/hr; must recognize and maneuver to avoid fixed
objects that are small with respect to road width.
• 1987 – ALV: Cross-country Route Planning Demonstration: Vehicle plans and
executes a 5 km traverse of open desert terrain. Demonstrates soil and ground
cover typing.
• 1988 - ALV: Road Network Route Planning and Obstacle Avoidance Demonstration:
Vehicle plans and executes a 20 km point-to-point traverse through a road using
landmarks as navigation aids. Demonstration includes map updating and off-road
maneuvering to avoid obstacles.
• 1989 - ALVINN: a 3-layer back-propagation network is designed for the task of road
following. ALVINN takes images from a camera and a laser range finder and
determines the direction the vehicle should travel in order to follow the road. The
network can effectively follow real roads under certain field conditions.
• 1992 - Demo I (Army): Though the emphasis of this program was teleoperation, it
used a stereo camera system for obstacle detection and safe vehicle stopping as
part of an "enhanced teleoperation" focus.
• 1998 - Demo II (DARPA): Introduced high resolution stereo vision as a means of
obstacle detection.
• 2001 - Demo III (Army) Demonstrated the ability of unmanned ground vehicles to
navigate miles of difficult off-road terrain, avoiding obstacles such as rocks and
trees.
4/13/2018 7Adrian Kaehler
4/13/2018 8Adrian Kaehler
By using computer vision, Stanley was able to extrapolate, from the very small patch
of road visible to its LIDAR sensors, far into the distance as a means of
comprehending the road ahead.
However, this was the only function vision was used for in the winning vehicle.
4/13/2018 9Adrian Kaehler
The TerraMax vehicle from Oshkosh Defense used LIDAR and Stereo Vision to detect
positive and negative obstacles.
4/13/2018 10Adrian Kaehler
Vehicles needed to demonstrate parking, lane following, rule adherence,
and interaction with other vehicles. Vision becomes much more
important for these tasks and appears widely in many vehicles.
4/13/2018 11Adrian Kaehler
Note that this is just a sampling of the many participants in the DARPA Urban Challenge.
4/13/2018 12
Vehicle KNIGHT RIDER CAROLINE TERRAMAX JUNIOR BOSS
Team UCF Braunschweig Oshkosh Stanford CMU
Lane Detection
Road Shape
Estimation
Drivability
Obstacles
Adrian Kaehler
Founded in 1999, MobilEye has focused on vision
based Advanced Driver Assistance Systems (ADAS)
Their technology was used by the CMU team in the
DARPA Urban Challenge
Early systems provided:
◦ Lane detection
◦ Lane departure warning
◦ Obstacle/Pedestrian detection
◦ Vehicle distance measurements
By 2007, MobilEye systems were for sale in
commercial vehicles including the Cadillac STS and
DTS.
There is much more to say about MobilEye in the current time. They are developing much
broader scope solutions that use vision to provide breaking, sign, and traffic light
detection. Their planned EyeQ-5 chip may do much more (e.g. path planning).
4/13/2018 13Adrian Kaehler
} In 2009 Google launched what is now
Waymo, their self driving car project,
bringing together many key players from the
DARPA challenges.
} Stanford’s distrust of computer vision was
clearly inherited by the Google vehicles, but
still vision was unavoidable for many tasks.
} In this era LIDAR continued to dominate
mainstream thought about vehicle
autonomy.
} We will see, however, that other players
would not be so hesitant to use computer
vision.
} Many saw LIDAR as an insurmountable cost
barrier for wide adoption of vehicle
autonomy.
4/13/2018 14Adrian Kaehler
In the years following the DARPA challenges, many researchers
worked to integrate the learnings from the many challenge
teams and to put many of the techniques developed for those
races onto a firmer theoretical footing.
4/13/2018 15
In computer vision, important algorithms such as the Dalal and Triggs “HOG”
pedestrian detector (2005) were refined and evolved and new techniques were
developed, such as the use of deformable parts models to reliably find objects
such as bicyclists (2011).
Adrian Kaehler
} In 2012, the KITTI dataset was
released.
} GPS RTK INS, Stereo Cameras (mono
and color), Velodyne LIDAR.
} In this system, the INS and LIDAR are
used to create a ground truth dataset
that can be used to evaluate the
performance of vision algorithms.
Professional annotator assigned
bounding boxes and classes to
important objects (cars, pedestrians,
bicycles, etc.)
} For development of: Stereo, Optical
Flow, 3D reconstruction, 3D object
detection, and 3d Object Tracking.
4/13/2018 16Adrian Kaehler
} The appearance of such datasets as KITTI allow new algorithms to be developed
faster, to be compared to one another more efficiently, and for more players to
work on the problem.
4/13/2018 17Adrian Kaehler
4/13/2018 18Adrian Kaehler
} In 2012 “AlexNet” put Neural Networks
back on the map by decisively defeating
all other approaches on an important
computer vision benchmark task.
} More importantly, this approach, now
call Deep Neural Networks, opened the
door to solving a wide variety of
problems wit computer vision that were
previously considered unreachable.
} Suddenly the economics of vision
become extremely compelling. A
camera can cost as little as $1, while a
Velodyne HD-64 costs nearly $70,000.
4/13/2018 19Adrian Kaehler
The computer vision and machine learning
communities immediately begin exploring
the limits of this technology, many new
things are found to be possible.
Suddenly, results thought to be
generations away begin to occur on a
regular basis.
◦ Computers beat Go masters
◦ Voice recognition starts to really work
◦ Chatbots passing the Turing Test
◦ Soon robots will be folding you laundry!
We will look at just a few interesting
results...
4/13/2018 20
The original SegNet was used for
Segmentation
A Deep Neural Network (DNN) learns a latent
representation for a scene that it then
interprets into semantic pixel labeling.
Later, similar structures were used to learn
such things as color for black-and-white
movies and depth images from monocular
video.
4/13/2018 21Adrian Kaehler
4/13/2018 22Adrian Kaehler
YOLO finds objects in an image and predicts bounding
boxes for them while simultaneously classifying them.
The system uses a combination of DNNs and other
more traditional probabilistic techniques.
The result is a high performance extremely fast system
for detecting a wide variety of different classes of
interest. The latest version, YOLO-9k, can find nine
thousand different classes of objects.
This is another example of a result that only a few
years ago would have seemed decades away.
4/13/2018 23Adrian Kaehler
4/13/2018 24Adrian Kaehler
And there is no end in sight. Deep Neural
Network approaches to problems keep
giving better and better results, and
nobody can say where it will stop.
Mask R-CNN is in improvement on the
prior algorithms as it combines object
finding with pixel level labeling for each
object type.
Mark R-CNN can also be used for human
pose estimation. We’ll see in a moment
what that has to do with self driving cars.
4/13/2018 25Adrian Kaehler
4/13/2018 26Adrian Kaehler
4/13/2018 27
Supervised Autonomy Full Autonomy
Traditional
Approaches with Deep
Learning Components
Fully Learned Systems
that Map Vision
Directly to Control
Adrian Kaehler
The overall problem of a self driving car is not
solved, but many important components are “solved
enough” to deploy into the market.
Some are now working to further improve those
same components, while others are looking at
additional ways the available technology can benefit
the market.
Here are some new directions to watch:
Using vision to
◦ make sure the driver is aware (glancing behaviors)
◦ make sure the driver is physically engaged (grasping
the steering wheel)
◦ make sure the driver is satisfied with their experience
(emotion)
◦ Using vision to build maps and perform localization
(SLAM)
4/13/2018 28
Images from Lex Friedman, MIT
https://selfdrivingcars.mit.edu
Adrian Kaehler
Though there are many approaches to autonomy and how it will be applied in the
products of individual companies, some things seem certain:
} The future for computer vision is bright
} The future for autonomous cars is bright
} The future for computer vision in autonomous cars is very bright
My Predictions:
} LIDAR gets cheaper, but vision is getting better even faster.
} This does not mean that vision solves all problems, but it is fast on it’s way to
being the central sensor in autonomous vehicles.
4/13/2018 29
4/13/2018 30

More Related Content

What's hot

Self-Driving Cars With Convolutional Neural Networks (CNN.pptx
Self-Driving Cars With Convolutional Neural Networks (CNN.pptxSelf-Driving Cars With Convolutional Neural Networks (CNN.pptx
Self-Driving Cars With Convolutional Neural Networks (CNN.pptx
ssuserf79e761
 
Machine Learning and Robotics
Machine Learning and RoboticsMachine Learning and Robotics
Machine Learning and Robotics
butest
 
Face recognition technology - BEST PPT
Face recognition technology - BEST PPTFace recognition technology - BEST PPT
Face recognition technology - BEST PPT
Siddharth Modi
 

What's hot (20)

Google driverless car technical seminar report (.docx)
Google driverless car technical seminar report (.docx)Google driverless car technical seminar report (.docx)
Google driverless car technical seminar report (.docx)
 
Self driving cars.pptx
Self driving cars.pptxSelf driving cars.pptx
Self driving cars.pptx
 
Autonomous cars
Autonomous carsAutonomous cars
Autonomous cars
 
Self Driving Autopilot Car
Self Driving Autopilot CarSelf Driving Autopilot Car
Self Driving Autopilot Car
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
Autonomous car
Autonomous carAutonomous car
Autonomous car
 
Deep Learning for Autonomous Driving
Deep Learning for Autonomous DrivingDeep Learning for Autonomous Driving
Deep Learning for Autonomous Driving
 
Self Driving Car Seminar presentation
Self Driving Car Seminar presentationSelf Driving Car Seminar presentation
Self Driving Car Seminar presentation
 
Autonomous Vehicles
Autonomous VehiclesAutonomous Vehicles
Autonomous Vehicles
 
Google car ppt
Google car pptGoogle car ppt
Google car ppt
 
Autonomous car ; google car
Autonomous car ; google carAutonomous car ; google car
Autonomous car ; google car
 
Autonomous cars
Autonomous carsAutonomous cars
Autonomous cars
 
Self-Driving Cars With Convolutional Neural Networks (CNN.pptx
Self-Driving Cars With Convolutional Neural Networks (CNN.pptxSelf-Driving Cars With Convolutional Neural Networks (CNN.pptx
Self-Driving Cars With Convolutional Neural Networks (CNN.pptx
 
AI in Autonomous Vehicles
AI in Autonomous VehiclesAI in Autonomous Vehicles
AI in Autonomous Vehicles
 
Introduction to Computer Vision.pdf
Introduction to Computer Vision.pdfIntroduction to Computer Vision.pdf
Introduction to Computer Vision.pdf
 
Edge detection
Edge detectionEdge detection
Edge detection
 
IMAGE SEGMENTATION.
IMAGE SEGMENTATION.IMAGE SEGMENTATION.
IMAGE SEGMENTATION.
 
Machine Learning and Robotics
Machine Learning and RoboticsMachine Learning and Robotics
Machine Learning and Robotics
 
Face recognition technology - BEST PPT
Face recognition technology - BEST PPTFace recognition technology - BEST PPT
Face recognition technology - BEST PPT
 
Ai lecture 03 computer vision
Ai lecture 03 computer visionAi lecture 03 computer vision
Ai lecture 03 computer vision
 

Similar to Computer Vision for autonomous driving

Autonomous Vehicle Navigation
Autonomous Vehicle NavigationAutonomous Vehicle Navigation
Autonomous Vehicle Navigation
Torben Haagh
 
Ford roadmap to autonomous driving
Ford   roadmap to autonomous drivingFord   roadmap to autonomous driving
Ford roadmap to autonomous driving
pkggman
 
Lane and Object Detection for Autonomous Vehicle using Advanced Computer Vision
Lane and Object Detection for Autonomous Vehicle using Advanced Computer VisionLane and Object Detection for Autonomous Vehicle using Advanced Computer Vision
Lane and Object Detection for Autonomous Vehicle using Advanced Computer Vision
YogeshIJTSRD
 

Similar to Computer Vision for autonomous driving (20)

Driverless Vehicles: Future Outlook on Intelligent Transportation
Driverless Vehicles: Future Outlook on Intelligent TransportationDriverless Vehicles: Future Outlook on Intelligent Transportation
Driverless Vehicles: Future Outlook on Intelligent Transportation
 
ObjectDetection.pptx
ObjectDetection.pptxObjectDetection.pptx
ObjectDetection.pptx
 
Autonomous Vehicle Navigation
Autonomous Vehicle NavigationAutonomous Vehicle Navigation
Autonomous Vehicle Navigation
 
Autonomous Driving using Deep Reinforcement Learning in Urban Environment
Autonomous Driving using Deep Reinforcement Learning in Urban EnvironmentAutonomous Driving using Deep Reinforcement Learning in Urban Environment
Autonomous Driving using Deep Reinforcement Learning in Urban Environment
 
“How QuantCube Technology uses alternative data to create macroeconomic, fina...
“How QuantCube Technology uses alternative data to create macroeconomic, fina...“How QuantCube Technology uses alternative data to create macroeconomic, fina...
“How QuantCube Technology uses alternative data to create macroeconomic, fina...
 
Autonomous driving
Autonomous drivingAutonomous driving
Autonomous driving
 
Autonomous Vehicles: the Intersection of Robotics and Artificial Intelligence
Autonomous Vehicles: the Intersection of Robotics and Artificial IntelligenceAutonomous Vehicles: the Intersection of Robotics and Artificial Intelligence
Autonomous Vehicles: the Intersection of Robotics and Artificial Intelligence
 
RAISE AND CRASH OF INNOVATOR- ANTHONY LEVANDOWSKI
RAISE AND CRASH OF INNOVATOR- ANTHONY LEVANDOWSKIRAISE AND CRASH OF INNOVATOR- ANTHONY LEVANDOWSKI
RAISE AND CRASH OF INNOVATOR- ANTHONY LEVANDOWSKI
 
"Is Vision the New Wireless?," a Presentation from Qualcomm
"Is Vision the New Wireless?," a Presentation from Qualcomm"Is Vision the New Wireless?," a Presentation from Qualcomm
"Is Vision the New Wireless?," a Presentation from Qualcomm
 
Car and Road Test.pdf
Car and Road Test.pdfCar and Road Test.pdf
Car and Road Test.pdf
 
Ford roadmap to autonomous driving
Ford   roadmap to autonomous drivingFord   roadmap to autonomous driving
Ford roadmap to autonomous driving
 
ROAD The Road Event Awareness Dataset for Autonomous Driving.docx
ROAD The Road Event Awareness Dataset for Autonomous Driving.docxROAD The Road Event Awareness Dataset for Autonomous Driving.docx
ROAD The Road Event Awareness Dataset for Autonomous Driving.docx
 
AUTONOMOUS SELF DRIVING CARS
AUTONOMOUS SELF DRIVING CARSAUTONOMOUS SELF DRIVING CARS
AUTONOMOUS SELF DRIVING CARS
 
Autonomous driving publications
Autonomous driving publicationsAutonomous driving publications
Autonomous driving publications
 
Google's Driverless Car
Google's Driverless CarGoogle's Driverless Car
Google's Driverless Car
 
Research Paper
Research PaperResearch Paper
Research Paper
 
Lane and Object Detection for Autonomous Vehicle using Advanced Computer Vision
Lane and Object Detection for Autonomous Vehicle using Advanced Computer VisionLane and Object Detection for Autonomous Vehicle using Advanced Computer Vision
Lane and Object Detection for Autonomous Vehicle using Advanced Computer Vision
 
Identification and classification of moving vehicles on road
Identification and classification of moving vehicles on roadIdentification and classification of moving vehicles on road
Identification and classification of moving vehicles on road
 
Vigilance: Vehicle Detector and Tracker
Vigilance: Vehicle Detector and TrackerVigilance: Vehicle Detector and Tracker
Vigilance: Vehicle Detector and Tracker
 
elon musk.pptx
elon musk.pptxelon musk.pptx
elon musk.pptx
 

More from Bill Liu

More from Bill Liu (20)

Walk Through a Real World ML Production Project
Walk Through a Real World ML Production ProjectWalk Through a Real World ML Production Project
Walk Through a Real World ML Production Project
 
Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...
 
Productizing Machine Learning at the Edge
Productizing Machine Learning at the EdgeProductizing Machine Learning at the Edge
Productizing Machine Learning at the Edge
 
Transformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to HeroTransformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to Hero
 
Deep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps WorkflowsDeep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps Workflows
 
Metaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at NetflixMetaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at Netflix
 
Practical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at ScalePractical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at Scale
 
Building large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudiBuilding large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudi
 
Deep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its ApplicationsDeep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its Applications
 
Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19
 
Highly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world ApplicationsHighly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world Applications
 
Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine Learning
 
Weekly #106: Deep Learning on Mobile
Weekly #106: Deep Learning on MobileWeekly #106: Deep Learning on Mobile
Weekly #106: Deep Learning on Mobile
 
Weekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
Weekly #105: AutoViz and Auto_ViML Visualization and Machine LearningWeekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
Weekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
 
AISF19 - On Blending Machine Learning with Microeconomics
AISF19 - On Blending Machine Learning with MicroeconomicsAISF19 - On Blending Machine Learning with Microeconomics
AISF19 - On Blending Machine Learning with Microeconomics
 
AISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First WorldAISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First World
 
AISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the EdgeAISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the Edge
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
 
Toronto meetup 20190917
Toronto meetup 20190917Toronto meetup 20190917
Toronto meetup 20190917
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

Computer Vision for autonomous driving

  • 1. AI NEXTCon Silicon Valley 18 SANTA CLARA | APRIL 10-13 #ainextcon Adrian Kaehler, CEO Giant.AI
  • 2. } Author: Learning OpenCV and Learning OpenCV 3 } Winner 2005 DARPA Grand Challenge – wrote vision system } Vice President and head of Robotics and Machine Learning at Applied Minds for 8 years, built autonomous vehicles for JIEDDO for the Iraq and Afghanistan wars. } Magic Leap } Founder: Silicon Valley Deep Learning Group } Founder and CEO: Giant.AI 4/13/2018 2
  • 3. } Introduction } Section I: Turning point, DARPA challenges, and old approaches } Section II: From then until now, important events since } Wrap Up: What’s now and what’s coming. 4/13/2018 3Adrian Kaehler
  • 4. 4/13/2018 4 Images and text courtesy of the Computer History Museum Illustration for Popular Mechanics article on automated highways, 1961. Credit: Popular Mechanics Sketch of a pre-programmed clockwork cart by Leonardo Da Vinci, circa 1478 Had it been built, this cart would have been powered by large coiled clockwork springs, propelling it over 130 feet. The clever control mechanism could have taken the vehicle through a predetermined course. Credit: Biblioteca Ambrosiana, Milan, Italy / De Agostini Picture Library / Metis e Mida Informatica / Veneranda Biblioteca Ambrosiana / The Bridgeman Art Library Stanford Artificial Intelligence Laboratory Cart, 1964-71. Along with Shakey the robot at neighboring SRI, the Stanford Cart pioneered techniques for navigating through an unfamiliar environment with artificial intelligence and machine vision. The cart famously wandered into a nearby road, but survived unscathed. Credit: © Mark Richards
  • 5. } 1991: USDOT mandate to "demonstrate an automated vehicle and highway system by 1997” } 1994: Daimler-Benz and Ernst Dickmanns of UniBwM drove more than 1,000 km in Paris demonstrating autonomous driving in free lanes, convoy driving, and lane changes with autonomous passing of other cars } 1995: CMU Navlab project completed a 5,000 km cross-country journey, of which 98.2% was autonomously controlled. NavLab projects heavily used computer vision. In one case, steering was a neural network controlled using video images. } 1995: Dichmanns demonstrates the use of saccadic computer vision for control. } 1996: ARGO project uses stereo vision to find lane markings and drives almost 2000km. } 1997: “Demo 97” in San Diego major carmakers showed WHAT DID THEY SHOW. } 1999: First driverless transportation of human passengers by the Netherlands’ ParkShuttle project using embedded magnets (not computer vision). 4/13/2018 5 During this period, LIDAR systems were extremely expensive and did not play a large role in autonomous driving projects. The general belief among researchers was that cameras and computer vision techniques would solve this problem. Adrian Kaehler
  • 6. 4/13/2018 6 • 1985 – ALV: Road Following Demonstration: Vehicle traverses a 2 km at 10 km/hr. Forward motion only, with no obstacle avoidance required. • 1986 - ALV: Obstacle Avoidance Demonstration: Vehicle traverses 5 km road course at speeds up to 20 km/hr; must recognize and maneuver to avoid fixed objects that are small with respect to road width. • 1987 – ALV: Cross-country Route Planning Demonstration: Vehicle plans and executes a 5 km traverse of open desert terrain. Demonstrates soil and ground cover typing. • 1988 - ALV: Road Network Route Planning and Obstacle Avoidance Demonstration: Vehicle plans and executes a 20 km point-to-point traverse through a road using landmarks as navigation aids. Demonstration includes map updating and off-road maneuvering to avoid obstacles. • 1989 - ALVINN: a 3-layer back-propagation network is designed for the task of road following. ALVINN takes images from a camera and a laser range finder and determines the direction the vehicle should travel in order to follow the road. The network can effectively follow real roads under certain field conditions. • 1992 - Demo I (Army): Though the emphasis of this program was teleoperation, it used a stereo camera system for obstacle detection and safe vehicle stopping as part of an "enhanced teleoperation" focus. • 1998 - Demo II (DARPA): Introduced high resolution stereo vision as a means of obstacle detection. • 2001 - Demo III (Army) Demonstrated the ability of unmanned ground vehicles to navigate miles of difficult off-road terrain, avoiding obstacles such as rocks and trees.
  • 9. By using computer vision, Stanley was able to extrapolate, from the very small patch of road visible to its LIDAR sensors, far into the distance as a means of comprehending the road ahead. However, this was the only function vision was used for in the winning vehicle. 4/13/2018 9Adrian Kaehler
  • 10. The TerraMax vehicle from Oshkosh Defense used LIDAR and Stereo Vision to detect positive and negative obstacles. 4/13/2018 10Adrian Kaehler
  • 11. Vehicles needed to demonstrate parking, lane following, rule adherence, and interaction with other vehicles. Vision becomes much more important for these tasks and appears widely in many vehicles. 4/13/2018 11Adrian Kaehler
  • 12. Note that this is just a sampling of the many participants in the DARPA Urban Challenge. 4/13/2018 12 Vehicle KNIGHT RIDER CAROLINE TERRAMAX JUNIOR BOSS Team UCF Braunschweig Oshkosh Stanford CMU Lane Detection Road Shape Estimation Drivability Obstacles Adrian Kaehler
  • 13. Founded in 1999, MobilEye has focused on vision based Advanced Driver Assistance Systems (ADAS) Their technology was used by the CMU team in the DARPA Urban Challenge Early systems provided: ◦ Lane detection ◦ Lane departure warning ◦ Obstacle/Pedestrian detection ◦ Vehicle distance measurements By 2007, MobilEye systems were for sale in commercial vehicles including the Cadillac STS and DTS. There is much more to say about MobilEye in the current time. They are developing much broader scope solutions that use vision to provide breaking, sign, and traffic light detection. Their planned EyeQ-5 chip may do much more (e.g. path planning). 4/13/2018 13Adrian Kaehler
  • 14. } In 2009 Google launched what is now Waymo, their self driving car project, bringing together many key players from the DARPA challenges. } Stanford’s distrust of computer vision was clearly inherited by the Google vehicles, but still vision was unavoidable for many tasks. } In this era LIDAR continued to dominate mainstream thought about vehicle autonomy. } We will see, however, that other players would not be so hesitant to use computer vision. } Many saw LIDAR as an insurmountable cost barrier for wide adoption of vehicle autonomy. 4/13/2018 14Adrian Kaehler
  • 15. In the years following the DARPA challenges, many researchers worked to integrate the learnings from the many challenge teams and to put many of the techniques developed for those races onto a firmer theoretical footing. 4/13/2018 15 In computer vision, important algorithms such as the Dalal and Triggs “HOG” pedestrian detector (2005) were refined and evolved and new techniques were developed, such as the use of deformable parts models to reliably find objects such as bicyclists (2011). Adrian Kaehler
  • 16. } In 2012, the KITTI dataset was released. } GPS RTK INS, Stereo Cameras (mono and color), Velodyne LIDAR. } In this system, the INS and LIDAR are used to create a ground truth dataset that can be used to evaluate the performance of vision algorithms. Professional annotator assigned bounding boxes and classes to important objects (cars, pedestrians, bicycles, etc.) } For development of: Stereo, Optical Flow, 3D reconstruction, 3D object detection, and 3d Object Tracking. 4/13/2018 16Adrian Kaehler
  • 17. } The appearance of such datasets as KITTI allow new algorithms to be developed faster, to be compared to one another more efficiently, and for more players to work on the problem. 4/13/2018 17Adrian Kaehler
  • 19. } In 2012 “AlexNet” put Neural Networks back on the map by decisively defeating all other approaches on an important computer vision benchmark task. } More importantly, this approach, now call Deep Neural Networks, opened the door to solving a wide variety of problems wit computer vision that were previously considered unreachable. } Suddenly the economics of vision become extremely compelling. A camera can cost as little as $1, while a Velodyne HD-64 costs nearly $70,000. 4/13/2018 19Adrian Kaehler
  • 20. The computer vision and machine learning communities immediately begin exploring the limits of this technology, many new things are found to be possible. Suddenly, results thought to be generations away begin to occur on a regular basis. ◦ Computers beat Go masters ◦ Voice recognition starts to really work ◦ Chatbots passing the Turing Test ◦ Soon robots will be folding you laundry! We will look at just a few interesting results... 4/13/2018 20
  • 21. The original SegNet was used for Segmentation A Deep Neural Network (DNN) learns a latent representation for a scene that it then interprets into semantic pixel labeling. Later, similar structures were used to learn such things as color for black-and-white movies and depth images from monocular video. 4/13/2018 21Adrian Kaehler
  • 23. YOLO finds objects in an image and predicts bounding boxes for them while simultaneously classifying them. The system uses a combination of DNNs and other more traditional probabilistic techniques. The result is a high performance extremely fast system for detecting a wide variety of different classes of interest. The latest version, YOLO-9k, can find nine thousand different classes of objects. This is another example of a result that only a few years ago would have seemed decades away. 4/13/2018 23Adrian Kaehler
  • 25. And there is no end in sight. Deep Neural Network approaches to problems keep giving better and better results, and nobody can say where it will stop. Mask R-CNN is in improvement on the prior algorithms as it combines object finding with pixel level labeling for each object type. Mark R-CNN can also be used for human pose estimation. We’ll see in a moment what that has to do with self driving cars. 4/13/2018 25Adrian Kaehler
  • 27. 4/13/2018 27 Supervised Autonomy Full Autonomy Traditional Approaches with Deep Learning Components Fully Learned Systems that Map Vision Directly to Control Adrian Kaehler
  • 28. The overall problem of a self driving car is not solved, but many important components are “solved enough” to deploy into the market. Some are now working to further improve those same components, while others are looking at additional ways the available technology can benefit the market. Here are some new directions to watch: Using vision to ◦ make sure the driver is aware (glancing behaviors) ◦ make sure the driver is physically engaged (grasping the steering wheel) ◦ make sure the driver is satisfied with their experience (emotion) ◦ Using vision to build maps and perform localization (SLAM) 4/13/2018 28 Images from Lex Friedman, MIT https://selfdrivingcars.mit.edu Adrian Kaehler
  • 29. Though there are many approaches to autonomy and how it will be applied in the products of individual companies, some things seem certain: } The future for computer vision is bright } The future for autonomous cars is bright } The future for computer vision in autonomous cars is very bright My Predictions: } LIDAR gets cheaper, but vision is getting better even faster. } This does not mean that vision solves all problems, but it is fast on it’s way to being the central sensor in autonomous vehicles. 4/13/2018 29