RegionViT: Regional-to-Local Attention for Vision Transformers

Chun-Fu Chen 0001, Rameswar Panda, Quanfu Fan. RegionViT: Regional-to-Local Attention for Vision Transformers. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

@inproceedings{0001PF22,
  title = {RegionViT: Regional-to-Local Attention for Vision Transformers},
  author = {Chun-Fu Chen 0001 and Rameswar Panda and Quanfu Fan},
  year = {2022},
  url = {https://openreview.net/forum?id=T__V3uLix7V},
  researchr = {https://researchr.org/publication/0001PF22},
  cites = {0},
  citedby = {0},
  booktitle = {The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022},
  publisher = {OpenReview.net},
}