This document discusses using a multi-column convolutional neural network (MCNN) for object detection in videos. The MCNN approach is compared to other methods like CNN and HOG-BOW-Gray pooling and is shown to achieve over 95% accuracy for pedestrian detection. The document outlines extracting frames from videos, dividing images into regions, classifying regions using CNNs, and combining results to detect objects. The MCNN approach is concluded to be useful for applications like medical imaging due to its high detection accuracy.