Pig is a platform for analyzing large datasets that runs on Hadoop. It uses a language called Pig Latin to describe data flows and transformations. Pig Latin scripts allow users to express data analysis workflows like joins and groups. While similar to SQL, Pig Latin describes how data flows rather than producing an inside-out query. Pig makes Hadoop data analysis easier through a simple language and execution on parallel hardware.