Transformers without Tears: Improving the Normalization of Self-Attention

Toan Q. Nguyen, Julian Salazar. Transformers without Tears: Improving the Normalization of Self-Attention. In Jan Niehues, Roldano Cattoni, Sebastian Stüker, Matteo Negri, Marco Turchi, Thanh-Le Ha, Elizabeth Salesky, Ramon Sanabria, Loïc Barrault, Lucia Specia, Marcello Federico, editors, Proceedings of the 16th International Conference on Spoken Language Translation, IWSLT 2019, Hong Kong, November 2-3, 2019. Association for Computational Linguistics, 2019. [doi]

Authors

Toan Q. Nguyen

This author has not been identified. Look up 'Toan Q. Nguyen' in Google

Julian Salazar

This author has not been identified. Look up 'Julian Salazar' in Google