This document discusses big data and Hadoop. It notes that in the last 5 years, more data has been generated than all of humanity previously. It then provides examples of the scale of data generated on Google, YouTube, and worldwide daily. The document goes on to discuss how big data is being used in various domains like politics, healthcare, banking, and more. It defines big data using IBM's 3V+1 framework and introduces Hadoop as an open source software framework for distributed storage and processing of large datasets across clusters of computers.