Deep Reinforcement learning in Self Driving cars

Deep Reinforcement
Learning
By Sachin Bijalwan

End to end deep learning
● Predict angles directly from the image
● Supervising Learning
● Uses Data

Motivation
● Scenario - What if we want our model to crash into a tree rather than human
● What if our model avoids a leaf lying below the car?
● What if we want our model to take minimal turns?

Deep Reinforcement Learning
● Collects data by experiments
● Unsupervised Learning

Deep Reinforcement learning in Self Driving Car
● Environment is complex
● Continuous outputs
● Model is more deeper
● Too many hyperparameters to adjust

Architecture
● Previous architecture :- 4 convolutional layers + 5 fully connected layers
● Removed dropout layer
● Removed augmentation of data

Hill Climbing with Nvidia’s model
● Agent runs initially according to random values set in our model
● Stores images and predicted angle
● As the agent crashes into wall, it dies
● Took last 30 images and predictions
● Generate random values as predictions
● Train itself
● Validate the training by again running on simulator

Deep Q Learning
● Qvalue(s,a)= expected reward after taking action a on state s
● Learn Qvalue of the states
● Choose the states with Maximum Q values
● Update equation of Qvalue:
○ Q(s,a)=R+gamma*max(Q(s’,a’) a’)
∀

Episodes
● Represent the tuple of (s,a,s’,R)
● We store episodes
● Later we train our model on them
● Memory used is called Replay memory
● Resembles biological way of learning things

How to train model?
● Agent would select a random action with probability epsilon and the action of
model with probability 1-epsilon
● Keep on decreasing the value of epsilon with time
● Store episodes
● Use them later for training the model

Challenges
● Too many hyperparameters to adjust
● Model takes too much time to train

Deep Reinforcement learning in Self Driving cars

Deep Reinforcement learning in Self Driving cars

More Related Content

Similar to Deep Reinforcement learning in Self Driving cars

Recently uploaded

Deep Reinforcement learning in Self Driving cars