Transformer Feed-Forward Layers Are Key-Value Memories - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Mor Geva, Roei Schuster, Jonathan Berant, Omer Levy. Transformer Feed-Forward Layers Are Key-Value Memories. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih, editors, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021. pages 5484-5495, Association for Computational Linguistics, 2021. [doi]

This author has not been identified. Look up 'Mor Geva' in GoogleThis author has not been identified. Look up 'Roei Schuster' in GoogleThis author has not been identified. Look up 'Jonathan Berant' in GoogleThis author has not been identified. Look up 'Omer Levy' in Google

runs on WebDSL