A Multiscale Visualization of Attention in the Transformer Model

Jesse Vig. A Multiscale Visualization of Attention in the Transformer Model. In Marta R. Costa-Jussà, Enrique Alfonseca, editors, Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, Florence, Italy, July 28 - August 2, 2019, Volume 3: System Demonstrations. pages 37-42, Association for Computational Linguistics, 2019. [doi]

Abstract

Abstract is missing.