Linear attention is (maybe) all you need (to understand Transformer optimization) - researchr publication references

researchr

You are not signed in
Sign in
Sign up

Kwangjun Ahn, Xiang Cheng, Minhak Song, Chulhee Yun, Ali Jadbabaie, Suvrit Sra. Linear attention is (maybe) all you need (to understand Transformer optimization). In The Twelfth International Conference on Learning Representations, ICLR 2024, Vienna, Austria, May 7-11, 2024. OpenReview.net, 2024. [doi]

No references recorded for this publication.

No citations of this publication recorded.

runs on WebDSL