This document provides an overview of big data and Hadoop. It defines big data as large volumes of diverse data that cannot be processed by traditional systems. Key characteristics are volume, velocity, variety, and veracity. Popular sources of big data include social media, emails, videos, and sensor data. Hadoop is presented as an open-source framework for distributed storage and processing of large datasets across clusters of computers. It uses HDFS for storage and MapReduce as a programming model. Major tech companies like Google, Facebook, and Amazon are discussed as big players in big data.