This document provides an overview of Apache Apex, an open source unified streaming and fast batching platform. It discusses key aspects of Apex including its application programming model using operators and directed acyclic graphs, native Hadoop integration using YARN and HDFS, partitioning and scaling operators for high throughput, windowing support, fault tolerance, and data locality features. Examples of building a data processing pipeline and its logical and physical plans are also presented.