This document provides an overview of Spark, including:
- Spark was developed in 2009 at UC Berkeley and open sourced in 2010, with over 200 contributors.
- Spark Core is the general execution engine that other Spark functionality is built on, providing in-memory computing and supporting various programming languages.
- Spark Streaming allows data to be ingested from sources like Kafka and Flume and integrated with Spark for advanced analytics on streaming data.