This document describes a new spatial-temporal attention graph convolution network (STAGCN) for traffic forecasting. STAGCN contains three key components: 1) a graph learning layer that learns both a static graph capturing global spatial relationships and a dynamic graph capturing local spatial changes, 2) an adaptive graph convolution layer, and 3) a gated temporal attention module using causal-trend attention to model long-term temporal dependencies by focusing on important historical information. Experimental results on four traffic datasets show STAGCN achieves better prediction accuracy than existing methods.