SlideShare a Scribd company logo
Thesis Seminar:
Imitation Learning and
Direct Perception for
Autonomous Driving
Rocky Liang
University of Waterloo CogDrive Lab
Supervisor: Dr. Dongpu Cao
1
Table of Contents
▪ Problem Definition
▪ Conditional Imitation Learning
▪ Direct Perception
▪ Curvature Based Dynamic Controller
2
Goal: To design a policy that can operate an autonomous system in a more
streamlined way than the traditional robotics approach, which is complex and difficult
to scale.
Problem Definition
Environment
Understandin
g
Decision
Making
Sensor Input
Control
Action
Actuation
Imitation
Learning
Policy
3
Conditional Imitation Learning
1. Record expert demonstrations of (state, action) pairs
2. Initialize model weights
3. Update weights by minimizing the model’s predicted action and real action
Model
State/
Observation
Context
Expert Action/
Ground Truth
4
Conditional Imitation Learning - Context
▪ Driving is a multimodal task: there
can be multiple correct actions in
any given location
▪ Without a context, predicting the
appropriate action is an under
constrained problem
5
Conditional Imitation Learning - Dataset
Dataset used: CARLA Imitation Learning
Dataset
Available contexts:
▪ Follow lane
▪ Left turn
▪ Right turn
▪ Straight
Anatomy of a datapoint
Context
Input (Observation & context) Label (action)
Steering, throttle, brake
Vx
6
Scene Description
▪ Suburban scenery
▪ Single lane roads
▪ Intersections
▪ Other vehicles & pedestrians
Conditional Imitation Learning - Model 1
Context as Input Model
7
Conditional Imitation Learning - Model 2
Branched Model
8
Conditional Imitation Learning - Limitations
▪ Low explainability
▪ Why did it take that specific action?
▪ Hard to debug
▪ Immutable behavior
▪ The behavior shown in the dataset is the behavior the trained model will
have
9
Direct Perception
Environment
Understandin
g
Decision
Making
Sensor Input
Control
Action
Actuation
Imitation
learning
Policy
Affordance
Prediction
Actuation
10
Direct Perception - Affordances
● Affordances are states that are key to the
robot’s operation
● Unlike mediated perception which seeks to
build a full reconstruction of the environment,
direct perception only extracts affordances
from sensors
● Affordances:
○ Lane deviation
■ Heading error & crosstrack error
○ Road curvature
○ Distance to car in front
● Scales across different vehicles with minimal
vehicle specific development
● Affordances are fed to a controller to drive the
car
11
Nonzero curvature
Zero curvature
Direct Perception - Model
Added lane change
contexts
12
Data Recorder
Python Clients
Weather Client
NPC Client
Recorder Client
CARLA Server
User Input
Saved
Data
Display
Window
Left Lane Change
Right Lane
Change
Left Turn Right Turn
13
Curvature Based Dynamic Controller
Lateral Dynamic Model
Source: Vehicle System Dynamics,
Khajepour
Loss Calculation Update Delta
14
1. Propagate lateral model and get curvature prediction
2. Calculate loss between predicted curvature and
affordance curvature
3. Minimize loss by updating control input using its
gradient
Curvature Based Dynamic Controller
Lateral
Dynamic Model
Vehicle states from
sensors
Control input
Predicted states
Error
Calculation
Iteratively solve
Solver Loop
15
Results
16
Summary & Contributions
▪ Developed context aware imitation learning models for vehicle control
▪ Developed direct perception model for vehicle control
▪ Context aware affordance predictor
▪ Curvature based dynamic controller
▪ Learning based but still vehicle agnostic
▪ Data recorder
▪ Able to record contextual driving data including lane change affordances
17
Future Work
More robust context modeling
▪ Currently, contexts are discrete categories
▪ Cannot navigate more complicated intersections
▪ Would like to represent context in a continuous way
▪ How to represent this information?
▪ How to pass it to the network?
Domain adaptation
▪ Fine tuning with real world dataset (simple, low generalizability)
▪ Learn common scene representation across multiple domains (tough, high
generalizability)
18
Key Sources
▪ End-to-end Driving via Conditional Imitation Learning, Codevilla et al.
▪ Agile Autonomous Driving using End-to-End Deep Imitation Learning, Pan et al.
▪ Imitation Learning for Vision-based Lane Keeping Assistance, Innocenti et al.
19

More Related Content

Similar to Imitation Learning and Direct Perception for Autonomous Driving

Machine Learning and Robotics
Machine Learning and RoboticsMachine Learning and Robotics
Machine Learning and Roboticsbutest
 
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
Neural Approximate Dynamic Programming for On-Demand Ride-PoolingNeural Approximate Dynamic Programming for On-Demand Ride-Pooling
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
ivaderivader
 
Driving Behavior for ADAS and Autonomous Driving II
Driving Behavior for ADAS and Autonomous Driving IIDriving Behavior for ADAS and Autonomous Driving II
Driving Behavior for ADAS and Autonomous Driving II
Yu Huang
 
MSCV Capstone Spring 2020 Presentation - RL for AD
MSCV Capstone Spring 2020 Presentation - RL for ADMSCV Capstone Spring 2020 Presentation - RL for AD
MSCV Capstone Spring 2020 Presentation - RL for AD
Mayank Gupta
 
IRJET- A Review on Deep Reinforcement Learning Induced Autonomous Driving Fra...
IRJET- A Review on Deep Reinforcement Learning Induced Autonomous Driving Fra...IRJET- A Review on Deep Reinforcement Learning Induced Autonomous Driving Fra...
IRJET- A Review on Deep Reinforcement Learning Induced Autonomous Driving Fra...
IRJET Journal
 
SPLT Transformer.pptx
SPLT Transformer.pptxSPLT Transformer.pptx
SPLT Transformer.pptx
Seungeon Baek
 
Left Turn Display Mechanism for Facilitating Left Hand Turns
Left Turn Display Mechanism for Facilitating Left Hand TurnsLeft Turn Display Mechanism for Facilitating Left Hand Turns
Left Turn Display Mechanism for Facilitating Left Hand Turns
Ali Vira
 
PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...
PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...
PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...
Dongmin Lee
 
Project Proposal Presentation
Project Proposal PresentationProject Proposal Presentation
Project Proposal PresentationNadeem Qandeel
 
Embedded ppt
Embedded pptEmbedded ppt
Embedded ppt
Akash Satamkar
 
Prediction,Planninng & Control at Baidu
Prediction,Planninng & Control at BaiduPrediction,Planninng & Control at Baidu
Prediction,Planninng & Control at Baidu
Yu Huang
 
Driving Behavior for ADAS and Autonomous Driving III
Driving Behavior for ADAS and Autonomous Driving IIIDriving Behavior for ADAS and Autonomous Driving III
Driving Behavior for ADAS and Autonomous Driving III
Yu Huang
 
Intelligent Mobility: From Last Mile to Long Distance Route Optimization
Intelligent Mobility: From Last Mile to Long Distance Route OptimizationIntelligent Mobility: From Last Mile to Long Distance Route Optimization
Intelligent Mobility: From Last Mile to Long Distance Route Optimization
BigML, Inc
 
Deep reinforcement learning framework for autonomous driving
Deep reinforcement learning framework for autonomous drivingDeep reinforcement learning framework for autonomous driving
Deep reinforcement learning framework for autonomous driving
GopikaGopinath5
 
Optimization : Back to the Core
Optimization : Back to the CoreOptimization : Back to the Core
Optimization : Back to the Core
Dimitris Bountolos
 
Application Oriented Computer Vision Pipeline for Automotive Industry
Application Oriented Computer Vision Pipeline for Automotive IndustryApplication Oriented Computer Vision Pipeline for Automotive Industry
Application Oriented Computer Vision Pipeline for Automotive Industry
Oytun Eren Sengul
 
The Future of Mixed-Autonomy Traffic (AIS302) - AWS re:Invent 2018
The Future of Mixed-Autonomy Traffic (AIS302) - AWS re:Invent 2018The Future of Mixed-Autonomy Traffic (AIS302) - AWS re:Invent 2018
The Future of Mixed-Autonomy Traffic (AIS302) - AWS re:Invent 2018
Amazon Web Services
 
Driving Behavior for ADAS and Autonomous Driving VI
Driving Behavior for ADAS and Autonomous Driving VIDriving Behavior for ADAS and Autonomous Driving VI
Driving Behavior for ADAS and Autonomous Driving VI
Yu Huang
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Simplilearn
 

Similar to Imitation Learning and Direct Perception for Autonomous Driving (20)

Machine Learning and Robotics
Machine Learning and RoboticsMachine Learning and Robotics
Machine Learning and Robotics
 
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
Neural Approximate Dynamic Programming for On-Demand Ride-PoolingNeural Approximate Dynamic Programming for On-Demand Ride-Pooling
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
 
Driving Behavior for ADAS and Autonomous Driving II
Driving Behavior for ADAS and Autonomous Driving IIDriving Behavior for ADAS and Autonomous Driving II
Driving Behavior for ADAS and Autonomous Driving II
 
MSCV Capstone Spring 2020 Presentation - RL for AD
MSCV Capstone Spring 2020 Presentation - RL for ADMSCV Capstone Spring 2020 Presentation - RL for AD
MSCV Capstone Spring 2020 Presentation - RL for AD
 
IRJET- A Review on Deep Reinforcement Learning Induced Autonomous Driving Fra...
IRJET- A Review on Deep Reinforcement Learning Induced Autonomous Driving Fra...IRJET- A Review on Deep Reinforcement Learning Induced Autonomous Driving Fra...
IRJET- A Review on Deep Reinforcement Learning Induced Autonomous Driving Fra...
 
SPLT Transformer.pptx
SPLT Transformer.pptxSPLT Transformer.pptx
SPLT Transformer.pptx
 
Left Turn Display Mechanism for Facilitating Left Hand Turns
Left Turn Display Mechanism for Facilitating Left Hand TurnsLeft Turn Display Mechanism for Facilitating Left Hand Turns
Left Turn Display Mechanism for Facilitating Left Hand Turns
 
PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...
PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...
PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...
 
FurkatKochkarov-propoal
FurkatKochkarov-propoalFurkatKochkarov-propoal
FurkatKochkarov-propoal
 
Project Proposal Presentation
Project Proposal PresentationProject Proposal Presentation
Project Proposal Presentation
 
Embedded ppt
Embedded pptEmbedded ppt
Embedded ppt
 
Prediction,Planninng & Control at Baidu
Prediction,Planninng & Control at BaiduPrediction,Planninng & Control at Baidu
Prediction,Planninng & Control at Baidu
 
Driving Behavior for ADAS and Autonomous Driving III
Driving Behavior for ADAS and Autonomous Driving IIIDriving Behavior for ADAS and Autonomous Driving III
Driving Behavior for ADAS and Autonomous Driving III
 
Intelligent Mobility: From Last Mile to Long Distance Route Optimization
Intelligent Mobility: From Last Mile to Long Distance Route OptimizationIntelligent Mobility: From Last Mile to Long Distance Route Optimization
Intelligent Mobility: From Last Mile to Long Distance Route Optimization
 
Deep reinforcement learning framework for autonomous driving
Deep reinforcement learning framework for autonomous drivingDeep reinforcement learning framework for autonomous driving
Deep reinforcement learning framework for autonomous driving
 
Optimization : Back to the Core
Optimization : Back to the CoreOptimization : Back to the Core
Optimization : Back to the Core
 
Application Oriented Computer Vision Pipeline for Automotive Industry
Application Oriented Computer Vision Pipeline for Automotive IndustryApplication Oriented Computer Vision Pipeline for Automotive Industry
Application Oriented Computer Vision Pipeline for Automotive Industry
 
The Future of Mixed-Autonomy Traffic (AIS302) - AWS re:Invent 2018
The Future of Mixed-Autonomy Traffic (AIS302) - AWS re:Invent 2018The Future of Mixed-Autonomy Traffic (AIS302) - AWS re:Invent 2018
The Future of Mixed-Autonomy Traffic (AIS302) - AWS re:Invent 2018
 
Driving Behavior for ADAS and Autonomous Driving VI
Driving Behavior for ADAS and Autonomous Driving VIDriving Behavior for ADAS and Autonomous Driving VI
Driving Behavior for ADAS and Autonomous Driving VI
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
 

Recently uploaded

Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 

Recently uploaded (20)

Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 

Imitation Learning and Direct Perception for Autonomous Driving

  • 1. Thesis Seminar: Imitation Learning and Direct Perception for Autonomous Driving Rocky Liang University of Waterloo CogDrive Lab Supervisor: Dr. Dongpu Cao 1
  • 2. Table of Contents ▪ Problem Definition ▪ Conditional Imitation Learning ▪ Direct Perception ▪ Curvature Based Dynamic Controller 2
  • 3. Goal: To design a policy that can operate an autonomous system in a more streamlined way than the traditional robotics approach, which is complex and difficult to scale. Problem Definition Environment Understandin g Decision Making Sensor Input Control Action Actuation Imitation Learning Policy 3
  • 4. Conditional Imitation Learning 1. Record expert demonstrations of (state, action) pairs 2. Initialize model weights 3. Update weights by minimizing the model’s predicted action and real action Model State/ Observation Context Expert Action/ Ground Truth 4
  • 5. Conditional Imitation Learning - Context ▪ Driving is a multimodal task: there can be multiple correct actions in any given location ▪ Without a context, predicting the appropriate action is an under constrained problem 5
  • 6. Conditional Imitation Learning - Dataset Dataset used: CARLA Imitation Learning Dataset Available contexts: ▪ Follow lane ▪ Left turn ▪ Right turn ▪ Straight Anatomy of a datapoint Context Input (Observation & context) Label (action) Steering, throttle, brake Vx 6 Scene Description ▪ Suburban scenery ▪ Single lane roads ▪ Intersections ▪ Other vehicles & pedestrians
  • 7. Conditional Imitation Learning - Model 1 Context as Input Model 7
  • 8. Conditional Imitation Learning - Model 2 Branched Model 8
  • 9. Conditional Imitation Learning - Limitations ▪ Low explainability ▪ Why did it take that specific action? ▪ Hard to debug ▪ Immutable behavior ▪ The behavior shown in the dataset is the behavior the trained model will have 9
  • 11. Direct Perception - Affordances ● Affordances are states that are key to the robot’s operation ● Unlike mediated perception which seeks to build a full reconstruction of the environment, direct perception only extracts affordances from sensors ● Affordances: ○ Lane deviation ■ Heading error & crosstrack error ○ Road curvature ○ Distance to car in front ● Scales across different vehicles with minimal vehicle specific development ● Affordances are fed to a controller to drive the car 11 Nonzero curvature Zero curvature
  • 12. Direct Perception - Model Added lane change contexts 12
  • 13. Data Recorder Python Clients Weather Client NPC Client Recorder Client CARLA Server User Input Saved Data Display Window Left Lane Change Right Lane Change Left Turn Right Turn 13
  • 14. Curvature Based Dynamic Controller Lateral Dynamic Model Source: Vehicle System Dynamics, Khajepour Loss Calculation Update Delta 14 1. Propagate lateral model and get curvature prediction 2. Calculate loss between predicted curvature and affordance curvature 3. Minimize loss by updating control input using its gradient
  • 15. Curvature Based Dynamic Controller Lateral Dynamic Model Vehicle states from sensors Control input Predicted states Error Calculation Iteratively solve Solver Loop 15
  • 17. Summary & Contributions ▪ Developed context aware imitation learning models for vehicle control ▪ Developed direct perception model for vehicle control ▪ Context aware affordance predictor ▪ Curvature based dynamic controller ▪ Learning based but still vehicle agnostic ▪ Data recorder ▪ Able to record contextual driving data including lane change affordances 17
  • 18. Future Work More robust context modeling ▪ Currently, contexts are discrete categories ▪ Cannot navigate more complicated intersections ▪ Would like to represent context in a continuous way ▪ How to represent this information? ▪ How to pass it to the network? Domain adaptation ▪ Fine tuning with real world dataset (simple, low generalizability) ▪ Learn common scene representation across multiple domains (tough, high generalizability) 18
  • 19. Key Sources ▪ End-to-end Driving via Conditional Imitation Learning, Codevilla et al. ▪ Agile Autonomous Driving using End-to-End Deep Imitation Learning, Pan et al. ▪ Imitation Learning for Vision-based Lane Keeping Assistance, Innocenti et al. 19