This document discusses Apache Spark and Spark SQL. It provides an overview of Apache Spark, noting that it can run programs faster than Hadoop in memory or on disk and supports SQL, streaming, and complex analytics. It then discusses Spark SQL, Spark's module for working with structured data. Spark SQL allows users to mix SQL queries with Spark programs and connect to data sources like Hive, Avro and Parquet. It also supports a server mode that connects to Spark SQL via JDBC. The document promotes Spark SQL as a distributed SQL engine that can integrate with BI tools.