The document introduces Relay, a new intermediate representation (IR) for machine learning frameworks. Relay aims to provide both the static graph optimizations of frameworks like TensorFlow as well as the dynamic graph expressiveness of frameworks like PyTorch. It serves as a common IR that can be lowered to hardware backends like CUDA, OpenCL, and deployed models.