Sumformer: Universal Approximation for Efficient Transformers

Silas Alberti, Niclas Dern, Laura Thesing, Gitta Kutyniok. Sumformer: Universal Approximation for Efficient Transformers. In Timothy Doster, Tegan Emerson, Henry Kvinge, Nina Miolane, Mathilde Papillon, Bastian Rieck, Sophia Sanborn, editors, Topological, Algebraic and Geometric Learning Workshops 2023, 28 July 2023, Honolulu, HI, USA. Volume 221 of Proceedings of Machine Learning Research, pages 72-86, PMLR, 2023. [doi]

@inproceedings{AlbertiDTK23,
  title = {Sumformer: Universal Approximation for Efficient Transformers},
  author = {Silas Alberti and Niclas Dern and Laura Thesing and Gitta Kutyniok},
  year = {2023},
  url = {https://proceedings.mlr.press/v221/alberti23a.html},
  researchr = {https://researchr.org/publication/AlbertiDTK23},
  cites = {0},
  citedby = {0},
  pages = {72-86},
  booktitle = {Topological, Algebraic and Geometric Learning Workshops 2023, 28 July 2023, Honolulu, HI, USA},
  editor = {Timothy Doster and Tegan Emerson and Henry Kvinge and Nina Miolane and Mathilde Papillon and Bastian Rieck and Sophia Sanborn},
  volume = {221},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}