This document provides an overview of data clustering techniques from a statistical pattern recognition perspective. Clustering is the unsupervised classification of patterns into groups based on similarity. The key components of clustering include pattern representation, similarity measures, clustering techniques, and cluster validation. A wide variety of clustering techniques have been developed, each with different assumptions and applications. Selecting an appropriate clustering technique requires an understanding of the data and domain expertise.