Sumformer: Universal Approximation for Efficient Transformers

Silas Alberti, Niclas Dern, Laura Thesing, Gitta Kutyniok. Sumformer: Universal Approximation for Efficient Transformers. In Timothy Doster, Tegan Emerson, Henry Kvinge, Nina Miolane, Mathilde Papillon, Bastian Rieck, Sophia Sanborn, editors, Topological, Algebraic and Geometric Learning Workshops 2023, 28 July 2023, Honolulu, HI, USA. Volume 221 of Proceedings of Machine Learning Research, pages 72-86, PMLR, 2023. [doi]

Abstract

Abstract is missing.