Impala is an open-source SQL query engine for Hadoop that is designed for performance. It utilizes standard Hadoop components like HDFS, HBase, and YARN. Impala allows users to issue SQL queries against data stored in HDFS and HBase and returns results very quickly. It exposes industry-standard interfaces that allow business intelligence tools to connect. Impala has added many new features in recent versions like analytic functions, subqueries, and support for joining and aggregating data that can spill to disk.