Low-Rank Bottleneck in Multi-head Attention Models

Srinadh Bhojanapalli, Chulhee Yun, Ankit Singh Rawat, Sashank J. Reddi, Sanjiv Kumar. Low-Rank Bottleneck in Multi-head Attention Models. In Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event. Volume 119 of Proceedings of Machine Learning Research, pages 864-873, PMLR, 2020. [doi]

@inproceedings{BhojanapalliYRR20,
  title = {Low-Rank Bottleneck in Multi-head Attention Models},
  author = {Srinadh Bhojanapalli and Chulhee Yun and Ankit Singh Rawat and Sashank J. Reddi and Sanjiv Kumar},
  year = {2020},
  url = {http://proceedings.mlr.press/v119/bhojanapalli20a.html},
  researchr = {https://researchr.org/publication/BhojanapalliYRR20},
  cites = {0},
  citedby = {0},
  pages = {864-873},
  booktitle = {Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event},
  volume = {119},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}