This document discusses using less training data for defect prediction models. It finds that simple learners like Naive Bayes can achieve good performance using only small samples of data, and that oversampling and undersampling techniques do not significantly harm classifier performance. The document advocates increasing the information content in data rather than using more complex learners or larger datasets to further improve predictions.