Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection

Yuxin Fang, Shusheng Yang, Shijie Wang, Yixiao Ge, Ying Shan, Xinggang Wang. Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023. pages 6221-6230, IEEE, 2023. [doi]

Abstract

Abstract is missing.