This document discusses Flink acceptance testing and state compatibility checking at Yelp. It describes how Yelp built a testing framework called Flink Compose on top of Yelp-compose to make it easier to test Flink jobs. The framework handles common tasks like setting up dependencies and allows submitting jobs to a Flink cluster. It also discusses lessons learned around ordering of operations, event vs processing time, and best practices like publishing common images and running tests in parallel. The document concludes by discussing how state compatibility checking is another important test to check for issues when restoring state from a savepoint during upgrades.