Bichen Wu, Chenfeng Xu, Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Zhicheng Yan, Masayoshi Tomizuka, Joseph Gonzalez 0001, Kurt Keutzer, Peter Vajda. Visual Transformers: Where Do Transformers Really Belong in Vision Models?. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021. pages 579-589, IEEE, 2021. [doi]