This document provides an overview of machine learning and big data concepts. It discusses machine learning applications and the data science pipeline of data collection, feature selection, model building, evaluation, and inference. Feature selection techniques for big data are covered, including subset selection and optimality criteria. Different machine learning modelling approaches are described, such as linear vs nonlinear, deterministic vs stochastic, parametric vs nonparametric, and frequentist vs Bayesian techniques. Supervised learning methods like regression and classification are explained. The document concludes with a discussion of model evaluation and performance.