Pig is a platform for analyzing large datasets that sits on top of Hadoop. It allows users to write scripts in Pig Latin, a language similar to SQL, to transform and analyze their data without needing to write Java code. Pig scripts are compiled into sequences of MapReduce jobs that process data in parallel across a Hadoop cluster. Key features of Pig include data filtering, joining, grouping, and the ability to extend it with custom user-defined functions.