This document discusses text classification and provides an overview of the key concepts. It defines text classification as predicting which predefined category a text belongs to. Popular applications include filtering emails and news articles. The document outlines supervised learning as the main approach, where a classifier is trained on manually classified examples to learn how to categorize new texts. It also covers representing texts as vectors for classification, including feature extraction, selection, and weighting. Common supervised learning algorithms mentioned are support vector machines, boosted decision stumps, random forests and naive Bayesian methods.