This document provides an introduction to Hadoop and big data concepts. It explains that Hadoop is a framework for processing large amounts of data across commodity hardware in a massively parallel fashion. Key terms related to Hadoop like MapReduce, HDFS, Hive, and Pig are defined. The document also notes what Hadoop is not good for, such as low latency queries or small datasets. Finally, it provides a high-level overview of HDFS, Cloudera, and Hive to demonstrate some of Hadoop's technologies.