Serengeti is an open-source project, initiated by VMware, to enable the rapid deployment of Hadoop clusters in virtual environments. While Hadoop clusters are typically run on physical machines, Serengeti aims to bridge Hadoop and virtualization, and bring the classic benefits of virtualization to the Hadoop user. Leveraging virtual machines, Serengeti-deployed clusters can be simply operated, configured for HA protection, and made elastic through the decoupling of Hadoop compute and data layers. In this talk, we explore each of these aspects of running Hadoop on a virtual platform. Presenter: Kevin Leong, Product Manager, VMware