This document summarizes an exemplar model for learning object classes from weakly supervised training data. The model learns a region of interest (ROI) around each object instance by initializing the ROI based on the locations of the most discriminative visual words, then refining the ROI through iterative optimization. To detect objects, the model scores hypotheses based on the number of exemplar images consistent with each hypothesis and uses the highest scoring hypotheses to detect objects.