Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

Li Yuan 0007, Yunpeng Chen, Tao Wang, Weihao Yu, Yujun Shi, Zihang Jiang, Francis E. H. Tay, Jiashi Feng, Shuicheng Yan. Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021. pages 538-547, IEEE, 2021. [doi]

Authors

Li Yuan 0007

This author has not been identified. Look up 'Li Yuan 0007' in Google

Yunpeng Chen

This author has not been identified. Look up 'Yunpeng Chen' in Google

Tao Wang

This author has not been identified. Look up 'Tao Wang' in Google

Weihao Yu

This author has not been identified. Look up 'Weihao Yu' in Google

Yujun Shi

This author has not been identified. Look up 'Yujun Shi' in Google

Zihang Jiang

This author has not been identified. Look up 'Zihang Jiang' in Google

Francis E. H. Tay

This author has not been identified. Look up 'Francis E. H. Tay' in Google

Jiashi Feng

This author has not been identified. Look up 'Jiashi Feng' in Google

Shuicheng Yan

This author has not been identified. Look up 'Shuicheng Yan' in Google