This chapter introduces machine learning and pattern recognition concepts. It discusses polynomial curve fitting to model data, including overfitting. It also covers probability theory concepts like joint, marginal, and conditional probability. Bayesian methods for curve fitting and decision theory are introduced. Key concepts covered include maximum likelihood, regularization, cross-validation, entropy, and the Kullback-Leibler divergence.