This document summarizes a tutorial on visual object recognition. It discusses several key topics:
1. Detection via classification using sliding windows and global appearance features like histograms or gradients.
2. Local invariant features for detection and description, as well as using them for specific object recognition.
3. Visual words and "bags of words" representations for image categorization by clustering local features.
4. Current challenges in visual object recognition like handling scale, clutter, context and learning with minimal supervision.