This document summarizes key concepts in machine learning evaluation including: 1. Common evaluation metrics like accuracy, precision, recall, and ROC curves. 2. Offline evaluation techniques like cross-validation to estimate model performance. 3. Hyperparameter tuning to optimize model configuration. 4. A/B testing to evaluate model impact by comparing control and experiment groups. 5. Causal effect measurement techniques like propensity score matching to estimate treatment effects.