This document discusses evaluation metrics for machine learning models. It covers classification metrics like accuracy, precision, recall and F1 score. Regression metrics like MAE, MSE and R2 are also discussed. The document stresses the importance of proper evaluation methodology, including using representative test sets, comparing to baselines, and testing for statistical significance. Good practices outlined include task-driven not algorithm-driven evaluation and using established datasets and metrics.