Training Deeper Neural Machine Translation Models with Transparent Attention

Ankur Bapna, Mia Xu Chen, Orhan Firat, Yuan Cao, Yonghui Wu. Training Deeper Neural Machine Translation Models with Transparent Attention. In Ellen Riloff, David Chiang 0001, Julia Hockenmaier, Jun'ichi Tsujii, editors, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018. pages 3028-3033, Association for Computational Linguistics, 2018. [doi]

@inproceedings{BapnaCFCW18,
  title = {Training Deeper Neural Machine Translation Models with Transparent Attention},
  author = {Ankur Bapna and Mia Xu Chen and Orhan Firat and Yuan Cao and Yonghui Wu},
  year = {2018},
  url = {https://aclanthology.info/papers/D18-1338/d18-1338},
  researchr = {https://researchr.org/publication/BapnaCFCW18},
  cites = {0},
  citedby = {0},
  pages = {3028-3033},
  booktitle = {Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018},
  editor = {Ellen Riloff and David Chiang 0001 and Julia Hockenmaier and Jun'ichi Tsujii},
  publisher = {Association for Computational Linguistics},
  isbn = {978-1-948087-84-1},
}