The document discusses running Gobblin, an open source data ingestion framework, on YARN. It provides an overview of the motivations and architecture when running Gobblin on YARN, including better resource utilization, support for Gobblin as a continuous long-running service, and better fit for streaming ingestion. Key implementation details covered include the use of Apache Helix for distributed task execution and coordination, log aggregation, and security/token management.