Pig is a platform for analyzing large datasets that uses a simple declarative language to express data flow tasks. It has a nested data model of fields, tuples, bags, and maps and supports common operators like FILTER, FOREACH, JOIN, GROUP, and ORDER. User-defined functions can extend its built-in functionality. Pig compiles queries into multiple MapReduce jobs as needed to perform the work in parallel across a cluster.