Long Range Language Modeling via Gated State Spaces

Harsh Mehta, Ankit Gupta 0001, Ashok Cutkosky, Behnam Neyshabur. Long Range Language Modeling via Gated State Spaces. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [doi]

Abstract

Abstract is missing.