The FFN as a Key-Value Memory: Functional Specialization in Transformer Computation

Zaryab Rahman, Fakhrud Din, Shah Khalid, Rishi Karthikeyan. The FFN as a Key-Value Memory: Functional Specialization in Transformer Computation. Machine Learning, 115(1):2, January 2026. [doi]

Abstract

Abstract is missing.