ViTALiTy: Unifying Low-rank and Sparse Approximation for Vision Transformer Acceleration with a Linear Taylor Attention

Jyotikrishna Dass, Shang Wu, Huihong Shi, Chaojian Li, Zhifan Ye, Zhongfeng Wang, Yingyan Lin. ViTALiTy: Unifying Low-rank and Sparse Approximation for Vision Transformer Acceleration with a Linear Taylor Attention. In IEEE International Symposium on High-Performance Computer Architecture, HPCA 2023, Montreal, QC, Canada, February 25 - March 1, 2023. pages 415-428, IEEE, 2023. [doi]

@inproceedings{DassWSLYWL23,
  title = {ViTALiTy: Unifying Low-rank and Sparse Approximation for Vision Transformer Acceleration with a Linear Taylor Attention},
  author = {Jyotikrishna Dass and Shang Wu and Huihong Shi and Chaojian Li and Zhifan Ye and Zhongfeng Wang and Yingyan Lin},
  year = {2023},
  doi = {10.1109/HPCA56546.2023.10071081},
  url = {https://doi.org/10.1109/HPCA56546.2023.10071081},
  researchr = {https://researchr.org/publication/DassWSLYWL23},
  cites = {0},
  citedby = {0},
  pages = {415-428},
  booktitle = {IEEE International Symposium on High-Performance Computer Architecture, HPCA 2023, Montreal, QC, Canada, February 25 - March 1, 2023},
  publisher = {IEEE},
  isbn = {978-1-6654-7652-2},
}