Visual Transformers: Where Do Transformers Really Belong in Vision Models?

Bichen Wu, Chenfeng Xu, Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Zhicheng Yan, Masayoshi Tomizuka, Joseph Gonzalez 0001, Kurt Keutzer, Peter Vajda. Visual Transformers: Where Do Transformers Really Belong in Vision Models?. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021. pages 579-589, IEEE, 2021. [doi]

@inproceedings{WuXDWZYT0KV21,
  title = {Visual Transformers: Where Do Transformers Really Belong in Vision Models?},
  author = {Bichen Wu and Chenfeng Xu and Xiaoliang Dai and Alvin Wan and Peizhao Zhang and Zhicheng Yan and Masayoshi Tomizuka and Joseph Gonzalez 0001 and Kurt Keutzer and Peter Vajda},
  year = {2021},
  doi = {10.1109/ICCV48922.2021.00064},
  url = {https://doi.org/10.1109/ICCV48922.2021.00064},
  researchr = {https://researchr.org/publication/WuXDWZYT0KV21},
  cites = {0},
  citedby = {0},
  pages = {579-589},
  booktitle = {2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021},
  publisher = {IEEE},
  isbn = {978-1-6654-2812-5},
}