This document presents class-specific Hough forests, a method for object detection that combines spatial information from object parts with class information during learning. The method trains random forests to learn the relationship between image patches and their spatial position relative to the object center. At detection time, the forests cast class-specific votes in 3D or 4D space (x,y,scale (or x,y,scale,ratio)) that are accumulated to detect objects. The method achieves state-of-the-art results on several datasets and offers advantages over related Hough-based methods such as combining spatial and class information during learning.