The document describes research on using reinforcement learning for self-driving cars. It discusses using the Soft Actor Critic algorithm to train an agent in a simulated environment. Experiments are conducted in navigation tasks with and without dynamic actors. The agent is able to complete the simple navigation task but struggles in the more complex task with actors. Future work focuses on improving algorithm stability and using image inputs instead of a manual state space.