This document provides an overview of Apache Tajo, an open source distributed SQL query engine for large-scale data analytics on Hadoop. It discusses that Tajo supports ANSI SQL, batch and interactive queries, various data formats/storage including HDFS, HBase and S3. It also summarizes Tajo's architecture, use cases, SQL features like DDL, queries, indexes and storage integration with HBase.