Accelerating Neural Transformer via an Average Attention Network

Deyi Xiong, Biao Zhang 0002, Jinsong Su. Accelerating Neural Transformer via an Average Attention Network. In Iryna Gurevych, Yusuke Miyao, editors, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Melbourne, Australia, July 15-20, 2018, Volume 1: Long Papers. pages 1789-1798, Association for Computational Linguistics, 2018. [doi]

Abstract

Abstract is missing.