SlideShare a Scribd company logo
1 of 14
Deep Reinforcement Learning for Self
driving Car using Unreal engine
environment
- Priyanshu Jain(SC17B113)
- Ashish Bansal (SC17B078)
Introduction
● Self driving car has become one of the most prominent topic in today's world.
● We are trying to implement self driving car using Deep-Q network which has one front camera
with field of view of 110 degree to get glance of real life driver ,we are then using Deep neural
network architecture to classify image into 5 classes which is then used by deep Q network to get
the Q table.
● The objective of this project is to allow the car to identify safe way of driving using a Deep Neural
Network.
Introduction to Carla
● CARLA is an unreal engine based environment aimed at assisting Deep learning studies
● The CARLA simulator consists of a scalable client-server architecture.
● The server is responsible for everything related with the simulation itself - sensor rendering,
computation of physics, updates on the world-state and its actors and much more. As it aims for
realistic result.
● Control over the simulation is granted through an API handled in Python and C++
Deep-Q learning
● In deep Q-learning, we use a neural network to approximate the Q-value function. The state is
given as the input and the Q-value of all possible actions is generated as the output.
● The DeepMind system used a deep convolutional neural network, with layers of tiled
convolutional filters to mimic the effects of receptive fields.
● We are comparing two architecture of deep-q learning first with simple convnet and second with
Xception model for CNN
DNN implemented
Need of CNN:
-To label each state
- original size 640X480
Policy
Actions
1)Car stops (break).
2)Car accelerates forward(full throttle)
3)Car accelerates forward (partial forward with left steer)
4)Car accelerates forward (partial forward with right steer)
5)Car remains in its previous state (no brake , no throttle, no steer)
Awards
Based on speed ( to avoid any loophole that network may find)
e.g:
elif
KPH == 40:
done = False (continue episode if false)
reward = 1
Simulations
● We started training the deep neural network from scratch using random parameter from uniform
distribution but as no. of episodes increased model learned to train itself to drive a car.
● Simulation environment consist of Carla unreal simulator where the actor(Car) is placed with front
camera (RGB) only .
● We made the environment to have collision sensor attached to car and using it as parameter to
detect collision and update necessary weight.
Results
● Final results after training are very interesting as car taught itself to drive starting from random
action.
● We can also see with only 500 episodes we were able to get some exciting turns to protect itself
from collision.
● We started from random samples in start but we can see learning to drive and take turn is really
interesting.
Comparative study between different
architecture
● We can observe that for Xception the car started to rotate continuously and not moving straight ,
this can be because of complexity of model and the parameter is not changing much due to very
complex model.
● We decided to work with 3 layer conv network described above and results were better..After
training with conv network,car was not only moving straight perfectly ,sometime it was able to
avoid obstacles and go ahead.
● Deep-Q is training as well as GPU intensive task ,we trained for 500 episodes and as we can see in
next slide the trend of rewards ,the car is learning to drive itself better with every episode.
Testing of Convolution model with 3 layers
Testing of Xception model
References
[1] https://towardsdatascience.com/deep-q-learning-tutorial-mindqn-
2a4c855abffc
[2] https://www.analyticsvidhya.com/blog/2019/04/introduction-deep-q-
learning-python/
[3] implementation concept learned from : https://pythonprogramming.net/
[4] https://www.geeksforgeeks.org/deep-q-learning/
[5] https://en.wikipedia.org/wiki/Q-learning
[6] https://en.wikipedia.org/wiki/Deep-reinforcement-learning

More Related Content

Similar to Deep presentation.pptx

Similar to Deep presentation.pptx (20)

IRJET-Survey on Simulation of Self-Driving Cars using Supervised and Reinforc...
IRJET-Survey on Simulation of Self-Driving Cars using Supervised and Reinforc...IRJET-Survey on Simulation of Self-Driving Cars using Supervised and Reinforc...
IRJET-Survey on Simulation of Self-Driving Cars using Supervised and Reinforc...
 
Deep reinforcement learning framework for autonomous driving
Deep reinforcement learning framework for autonomous drivingDeep reinforcement learning framework for autonomous driving
Deep reinforcement learning framework for autonomous driving
 
A machine learning model for average fuel consumption in heavy vehicles
A machine learning model for average fuel consumption in heavy vehiclesA machine learning model for average fuel consumption in heavy vehicles
A machine learning model for average fuel consumption in heavy vehicles
 
IRJET- Self-Driving Cars: Automation Testing using Udacity Simulator
IRJET- Self-Driving Cars: Automation Testing using Udacity SimulatorIRJET- Self-Driving Cars: Automation Testing using Udacity Simulator
IRJET- Self-Driving Cars: Automation Testing using Udacity Simulator
 
Self-Driving Car to Drive Autonomously using Image Processing and Deep Learning
Self-Driving Car to Drive Autonomously using Image Processing and Deep LearningSelf-Driving Car to Drive Autonomously using Image Processing and Deep Learning
Self-Driving Car to Drive Autonomously using Image Processing and Deep Learning
 
Distracted Drivers- Inspirit AI
Distracted Drivers- Inspirit AIDistracted Drivers- Inspirit AI
Distracted Drivers- Inspirit AI
 
Kaggle Lyft Motion Prediction for Autonomous Vehicles 4th Place Solution
Kaggle Lyft Motion Prediction for Autonomous Vehicles 4th Place SolutionKaggle Lyft Motion Prediction for Autonomous Vehicles 4th Place Solution
Kaggle Lyft Motion Prediction for Autonomous Vehicles 4th Place Solution
 
MSCV Capstone Spring 2020 Presentation - RL for AD
MSCV Capstone Spring 2020 Presentation - RL for ADMSCV Capstone Spring 2020 Presentation - RL for AD
MSCV Capstone Spring 2020 Presentation - RL for AD
 
GNR638_Course Project for spring semester
GNR638_Course Project for spring semesterGNR638_Course Project for spring semester
GNR638_Course Project for spring semester
 
Predicting Steering Angle for Self Driving Vehicles
Predicting Steering Angle for Self Driving VehiclesPredicting Steering Angle for Self Driving Vehicles
Predicting Steering Angle for Self Driving Vehicles
 
Car Steering Angle Prediction Using Deep Learning
Car Steering Angle Prediction Using Deep LearningCar Steering Angle Prediction Using Deep Learning
Car Steering Angle Prediction Using Deep Learning
 
self driving car nvidia whitepaper
self driving car nvidia whitepaperself driving car nvidia whitepaper
self driving car nvidia whitepaper
 
Automatism System Using Faster R-CNN and SVM
Automatism System Using Faster R-CNN and SVMAutomatism System Using Faster R-CNN and SVM
Automatism System Using Faster R-CNN and SVM
 
AWS Finland Meetup June 2019 - DeepRacer story
AWS Finland Meetup June 2019 - DeepRacer storyAWS Finland Meetup June 2019 - DeepRacer story
AWS Finland Meetup June 2019 - DeepRacer story
 
Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Convolution Neural Network (CNN)
Convolution Neural Network (CNN)
 
Final PPT.pdf
Final PPT.pdfFinal PPT.pdf
Final PPT.pdf
 
Comparative Analysis of Tuning Hyperparameters in Policy-Based DRL Algorithm ...
Comparative Analysis of Tuning Hyperparameters in Policy-Based DRL Algorithm ...Comparative Analysis of Tuning Hyperparameters in Policy-Based DRL Algorithm ...
Comparative Analysis of Tuning Hyperparameters in Policy-Based DRL Algorithm ...
 
Presentation5
Presentation5Presentation5
Presentation5
 
C3 w1
C3 w1C3 w1
C3 w1
 
GNR638_project ppt.pdf
GNR638_project ppt.pdfGNR638_project ppt.pdf
GNR638_project ppt.pdf
 

Recently uploaded

Artificial intelligence presentation2-171219131633.pdf
Artificial intelligence presentation2-171219131633.pdfArtificial intelligence presentation2-171219131633.pdf
Artificial intelligence presentation2-171219131633.pdf
Kira Dess
 
electrical installation and maintenance.
electrical installation and maintenance.electrical installation and maintenance.
electrical installation and maintenance.
benjamincojr
 

Recently uploaded (20)

8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
 
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptxSLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
 
Basics of Relay for Engineering Students
Basics of Relay for Engineering StudentsBasics of Relay for Engineering Students
Basics of Relay for Engineering Students
 
5G and 6G refer to generations of mobile network technology, each representin...
5G and 6G refer to generations of mobile network technology, each representin...5G and 6G refer to generations of mobile network technology, each representin...
5G and 6G refer to generations of mobile network technology, each representin...
 
21scheme vtu syllabus of visveraya technological university
21scheme vtu syllabus of visveraya technological university21scheme vtu syllabus of visveraya technological university
21scheme vtu syllabus of visveraya technological university
 
analog-vs-digital-communication (concept of analog and digital).pptx
analog-vs-digital-communication (concept of analog and digital).pptxanalog-vs-digital-communication (concept of analog and digital).pptx
analog-vs-digital-communication (concept of analog and digital).pptx
 
Artificial Intelligence in due diligence
Artificial Intelligence in due diligenceArtificial Intelligence in due diligence
Artificial Intelligence in due diligence
 
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
 
Artificial intelligence presentation2-171219131633.pdf
Artificial intelligence presentation2-171219131633.pdfArtificial intelligence presentation2-171219131633.pdf
Artificial intelligence presentation2-171219131633.pdf
 
electrical installation and maintenance.
electrical installation and maintenance.electrical installation and maintenance.
electrical installation and maintenance.
 
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas SachpazisSeismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
 
15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon
 
Raashid final report on Embedded Systems
Raashid final report on Embedded SystemsRaashid final report on Embedded Systems
Raashid final report on Embedded Systems
 
Diploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfDiploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdf
 
Independent Solar-Powered Electric Vehicle Charging Station
Independent Solar-Powered Electric Vehicle Charging StationIndependent Solar-Powered Electric Vehicle Charging Station
Independent Solar-Powered Electric Vehicle Charging Station
 
UNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxUNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptx
 
Filters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility ApplicationsFilters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility Applications
 
Dynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptxDynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptx
 
Software Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdfSoftware Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdf
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptx
 

Deep presentation.pptx

  • 1. Deep Reinforcement Learning for Self driving Car using Unreal engine environment - Priyanshu Jain(SC17B113) - Ashish Bansal (SC17B078)
  • 2. Introduction ● Self driving car has become one of the most prominent topic in today's world. ● We are trying to implement self driving car using Deep-Q network which has one front camera with field of view of 110 degree to get glance of real life driver ,we are then using Deep neural network architecture to classify image into 5 classes which is then used by deep Q network to get the Q table. ● The objective of this project is to allow the car to identify safe way of driving using a Deep Neural Network.
  • 3. Introduction to Carla ● CARLA is an unreal engine based environment aimed at assisting Deep learning studies ● The CARLA simulator consists of a scalable client-server architecture. ● The server is responsible for everything related with the simulation itself - sensor rendering, computation of physics, updates on the world-state and its actors and much more. As it aims for realistic result. ● Control over the simulation is granted through an API handled in Python and C++
  • 4. Deep-Q learning ● In deep Q-learning, we use a neural network to approximate the Q-value function. The state is given as the input and the Q-value of all possible actions is generated as the output. ● The DeepMind system used a deep convolutional neural network, with layers of tiled convolutional filters to mimic the effects of receptive fields. ● We are comparing two architecture of deep-q learning first with simple convnet and second with Xception model for CNN
  • 5.
  • 6. DNN implemented Need of CNN: -To label each state - original size 640X480
  • 7. Policy Actions 1)Car stops (break). 2)Car accelerates forward(full throttle) 3)Car accelerates forward (partial forward with left steer) 4)Car accelerates forward (partial forward with right steer) 5)Car remains in its previous state (no brake , no throttle, no steer) Awards Based on speed ( to avoid any loophole that network may find) e.g: elif KPH == 40: done = False (continue episode if false) reward = 1
  • 8. Simulations ● We started training the deep neural network from scratch using random parameter from uniform distribution but as no. of episodes increased model learned to train itself to drive a car. ● Simulation environment consist of Carla unreal simulator where the actor(Car) is placed with front camera (RGB) only . ● We made the environment to have collision sensor attached to car and using it as parameter to detect collision and update necessary weight.
  • 9. Results ● Final results after training are very interesting as car taught itself to drive starting from random action. ● We can also see with only 500 episodes we were able to get some exciting turns to protect itself from collision. ● We started from random samples in start but we can see learning to drive and take turn is really interesting.
  • 10. Comparative study between different architecture ● We can observe that for Xception the car started to rotate continuously and not moving straight , this can be because of complexity of model and the parameter is not changing much due to very complex model. ● We decided to work with 3 layer conv network described above and results were better..After training with conv network,car was not only moving straight perfectly ,sometime it was able to avoid obstacles and go ahead. ● Deep-Q is training as well as GPU intensive task ,we trained for 500 episodes and as we can see in next slide the trend of rewards ,the car is learning to drive itself better with every episode.
  • 11.
  • 12. Testing of Convolution model with 3 layers
  • 14. References [1] https://towardsdatascience.com/deep-q-learning-tutorial-mindqn- 2a4c855abffc [2] https://www.analyticsvidhya.com/blog/2019/04/introduction-deep-q- learning-python/ [3] implementation concept learned from : https://pythonprogramming.net/ [4] https://www.geeksforgeeks.org/deep-q-learning/ [5] https://en.wikipedia.org/wiki/Q-learning [6] https://en.wikipedia.org/wiki/Deep-reinforcement-learning