This document describes an ATSC (Adaptive Traffic System Control) project that uses MARL (Multi-Agent Reinforcement Learning) to control traffic signals at intersections. It proposes using an A2C (Advantage Actor-Critic) method where each intersection has an independent agent. The agents aim to minimize wait times and queue lengths at their intersection and neighboring intersections. The network architecture includes layers to handle wave, wait, and neighboring policies before combining them into an LSTM layer for training the actor and critic agents separately.