The document presents a deep reinforcement learning (RL) approach called DEAR, aimed at optimizing online advertising impressions within recommendation systems by maximizing revenue while minimizing negative impacts on user experience. It discusses challenges in existing advertising techniques, proposes a Markov decision process-based model, and details the architecture and training of the DEAR framework, including a novel dataset and various experimental metrics. The findings indicate that DEAR significantly improves advertising performance despite some remaining questions about model transparency and dataset design.