UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost

Zhen Wu, Lijun Wu, Qi Meng, Yingce Xia, Shufang Xie 0003, Tao Qin, Xinyu Dai, Tie-Yan Liu. UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost. In Kristina Toutanova, Anna Rumshisky, Luke Zettlemoyer, Dilek Hakkani-Tür, Iz Beltagy, Steven Bethard, Ryan Cotterell, Tanmoy Chakraborty 0002, Yichao Zhou, editors, Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2021, Online, June 6-11, 2021. pages 3865-3878, Association for Computational Linguistics, 2021. [doi]

Authors

Zhen Wu

This author has not been identified. Look up 'Zhen Wu' in Google

Lijun Wu

This author has not been identified. Look up 'Lijun Wu' in Google

Qi Meng

This author has not been identified. Look up 'Qi Meng' in Google

Yingce Xia

This author has not been identified. Look up 'Yingce Xia' in Google

Shufang Xie 0003

This author has not been identified. Look up 'Shufang Xie 0003' in Google

Tao Qin

This author has not been identified. Look up 'Tao Qin' in Google

Xinyu Dai

This author has not been identified. Look up 'Xinyu Dai' in Google

Tie-Yan Liu

This author has not been identified. Look up 'Tie-Yan Liu' in Google