SVMs are classifiers derived from statistical learning theory that maximize the margin between decision boundaries. They work by mapping data to a higher dimensional space using kernels to allow for nonlinear decision boundaries. SVMs find the linear decision boundary that maximizes the margin between the closest data points of each class by solving a quadratic programming optimization problem.