MViTv2: Improved Multiscale Vision Transformers for Classification and Detection

Yanghao Li, Chao-Yuan Wu, Haoqi Fan 0001, Karttikeya Mangalam, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer. MViTv2: Improved Multiscale Vision Transformers for Classification and Detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 4794-4804, IEEE, 2022. [doi]

Abstract

Abstract is missing.