This document provides an introduction to big data, Hadoop, and NoSQL databases. It defines big data as large, diverse, and growing datasets that are difficult to process using traditional databases. Hadoop is an open-source software framework for distributed storage and processing of big data across commodity hardware. It includes HDFS for storage and MapReduce as a programming model. NoSQL databases are non-tabular databases designed for high performance on large datasets. They are more flexible and scalable than SQL databases but provide fewer consistency guarantees.