Mask Attention Networks: Rethinking and Strengthen Transformer

Zhihao Fan, Yeyun Gong, Dayiheng Liu, Zhongyu Wei, Siyuan Wang, Jian Jiao 0007, Nan Duan, Ruofei Zhang, Xuanjing Huang. Mask Attention Networks: Rethinking and Strengthen Transformer. In Kristina Toutanova, Anna Rumshisky, Luke Zettlemoyer, Dilek Hakkani-Tür, Iz Beltagy, Steven Bethard, Ryan Cotterell, Tanmoy Chakraborty 0002, Yichao Zhou, editors, Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2021, Online, June 6-11, 2021. pages 1692-1701, Association for Computational Linguistics, 2021. [doi]

Authors

Zhihao Fan

This author has not been identified. Look up 'Zhihao Fan' in Google

Yeyun Gong

This author has not been identified. Look up 'Yeyun Gong' in Google

Dayiheng Liu

This author has not been identified. Look up 'Dayiheng Liu' in Google

Zhongyu Wei

This author has not been identified. Look up 'Zhongyu Wei' in Google

Siyuan Wang

This author has not been identified. Look up 'Siyuan Wang' in Google

Jian Jiao 0007

This author has not been identified. Look up 'Jian Jiao 0007' in Google

Nan Duan

This author has not been identified. Look up 'Nan Duan' in Google

Ruofei Zhang

This author has not been identified. Look up 'Ruofei Zhang' in Google

Xuanjing Huang

This author has not been identified. Look up 'Xuanjing Huang' in Google