This document provides an introduction to support vector machines (SVM). It discusses the history and key concepts of SVM, including how SVM finds the optimal separating hyperplane with maximum margin between classes to perform linear classification. It also describes how SVM can learn nonlinear decision boundaries using kernel tricks to implicitly map inputs to high-dimensional feature spaces. The document gives examples of commonly used kernel functions and outlines the steps to perform classification with SVM.