This document discusses evaluating deep Q-learning models for sequential targeted marketing. It presents using deep Q-learning to model the marketing environment and learn optimal policies for targeting customers. The models are trained on a direct marketing dataset and evaluated against a K-means cluster simulator in a 10-fold cross validation process, with the deep Q-learning models outperforming baseline models in most tests. Future work focuses on automating the evaluation at scale, optimizing the neural network architecture, and improving the evaluation simulator.