PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers

Ryan Grainger, Thomas Paniagua, Xi Song, Naresh Cuntoor, Mun Wai Lee, Tianfu Wu 0001. PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 18568-18578, IEEE, 2023. [doi]

@inproceedings{GraingerPSCL023,
  title = {PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers},
  author = {Ryan Grainger and Thomas Paniagua and Xi Song and Naresh Cuntoor and Mun Wai Lee and Tianfu Wu 0001},
  year = {2023},
  doi = {10.1109/CVPR52729.2023.01781},
  url = {https://doi.org/10.1109/CVPR52729.2023.01781},
  researchr = {https://researchr.org/publication/GraingerPSCL023},
  cites = {0},
  citedby = {0},
  pages = {18568-18578},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023},
  publisher = {IEEE},
  isbn = {979-8-3503-0129-8},
}