This document provides an introduction to big data and Hadoop. It discusses the volume, velocity, and variety properties of big data. It then explains the core components of Hadoop including the NameNode, DataNodes, and HDFS. It also summarizes MapReduce and key Hadoop ecosystem tools like Hive, Pig, Oozie, and HBase. It concludes that Hadoop is an open source tool that can handle large, diverse data sets and provide fault tolerance through data replication across nodes.