This document discusses object detection in images using deep convolutional neural networks. It begins by framing object detection as classification at multiple positions and scales. The document then reviews early approaches like HOG and deformable part models before introducing R-CNN and its improvements, Fast R-CNN and Faster R-CNN, which share computation between proposals. Faster R-CNN introduces a region proposal network to generate proposals. Finally, it briefly discusses one-stage detectors like YOLO and SSD that directly predict boxes and classes.