Rethinking Compression: Reduced order modelling of Latent Features in Large Language Models

Arnav Chavan, Nahush Lele, Deepak Gupta. Rethinking Compression: Reduced order modelling of Latent Features in Large Language Models. In The Second Tiny Papers Track at ICLR 2024, Tiny Papers @ ICLR 2024, Vienna, Austria, May 11, 2024. OpenReview.net, 2024. [doi]

Abstract

Abstract is missing.