The following publications are possibly variants of this publication:
- Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuningWeicong Liang, Yuhui Yuan, Henghui Ding, Xiao Luo, Weihong Lin, Ding Jia, Zheng Zhang, Chao Zhang, Han Hu. nips 2022: [doi]
- Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without ConvolutionsWenhai Wang, Enze Xie, Xiang Li 0028, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo 0002, Ling Shao 0001. iccv 2021: 548-558 [doi]
- EViT: Expediting Vision Transformers via Token ReorganizationsYouwei Liang, Chongjian Ge, Zhan Tong, Yibing Song, Jue Wang 0001, Pengtao Xie. iclr 2022: [doi]