This document discusses streaming SQL and how it can be used to query streaming data sources like IoT devices, web servers, and databases. Some key points discussed include:
- Streaming SQL extends standard SQL to work over both streaming and static data sources. It allows queries to be executed continuously over streaming data.
- The replay principle states that streaming queries should produce the same results as equivalent non-streaming queries over the same static data. Techniques like watermarks and monotonic columns help ensure this.
- Windowing functions allow aggregating over sliding windows of records in a stream. Various window types like tumbling and hopping windows are described.
- Apache Calcite is an open source framework that can optimize streaming SQL queries