This document discusses a research talk on efficiently categorizing text documents using a massively parallel machine learning model. The model was used to classify over two million Japanese newspaper articles into 75 categories using advanced computing resources. Experiments on an IBM server showed that the proposed parallel model effectively categorized large text corpora, comparing different feature extraction methods and classifiers.