The document provides an introduction to big data and data mining. It defines big data as massive volumes of structured and unstructured data that are difficult to process using traditional techniques. Data mining is described as finding new and useful information within large amounts of data. The document then discusses characteristics of big data like volume, variety and velocity. It also outlines challenges of big data like privacy and hardware resources. Finally, it presents tools for big data mining and analysis like Hadoop, Apache S4 and Mahout.