This document provides an overview and introduction to Pig, an infrastructure for analyzing large datasets using Hadoop MapReduce. It discusses what Pig is, why it should be used, how to install and set up Pig, the components of Pig including Pig Latin and the Pig engine, and provides examples of how to perform common data analysis tasks like filtering, grouping, joining and ordering data using Pig Latin scripts.