This document discusses big data analysis and data science. It introduces common data analysis techniques like predictive modeling, machine learning, and recommendation systems. It also discusses tools for working with big data, including Hadoop, HDFS, Pig, HBase, Mahout and languages like R and Python. The document provides an example of using these techniques and tools to build a recommendation system using streaming data from Flume stored in HDFS and analyzed with Pig and HBase.