Transformer Architectures in Vision [2018 ICML] Image Transformer [2019 CVPR] Video Action Transformer Network [2020 ECCV] End-to-End Object Detection with Transformers [2021 ICLR] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale