Data aggregation / transformation / refinement Train corpora Tokenization process Attributes and instances Vector space modeling / Bag of words Feature selection Machine learning algorithms Classifiers – binary / multi-class, multi-label / problem transformation methods Learning evaluation Quality Management workflow / “Human in the loop” supervised learning