This document discusses powering TensorFlow with big data tools like Apache Spark, Apache Flink, and Apache Beam. It provides an overview of TensorFlowOnSpark and how Apache Arrow can improve data interchange performance. It also covers the current state of non-JVM support in Beam and what this means for TensorFlow. The document discusses how data processing frameworks like TFX fit with TensorFlow and pre-processing tools like TF.Transform. It concludes by discussing options for using big data tools from outside the JVM like Kafka and Dask.