1. Bayesian learning provides a probabilistic approach to inference based on probability distributions of quantities of interest together with observed data.
2. The maximum a posteriori (MAP) hypothesis is the most probable hypothesis given observed training data. Consistent learning algorithms that make no errors on training data will always output a MAP hypothesis under certain assumptions.
3. Bayesian learning can be used to characterize the behavior of learning algorithms like decision tree induction even when the algorithms do not explicitly manipulate probabilities.