RegionViT: Regional-to-Local Attention for Vision Transformers

Chun-Fu Chen 0001, Rameswar Panda, Quanfu Fan. RegionViT: Regional-to-Local Attention for Vision Transformers. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

Abstract

Abstract is missing.