Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token Embeddings

Sangwon Yu, Jongyoon Song, Heeseung Kim, Seongmin Lee 0005, Woo-Jong Ryu, Sungroh Yoon. Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token Embeddings. In Smaranda Muresan, Preslav Nakov, Aline Villavicencio, editors, Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2022, Dublin, Ireland, May 22-27, 2022. pages 29-45, Association for Computational Linguistics, 2022. [doi]

Authors

Sangwon Yu

This author has not been identified. Look up 'Sangwon Yu' in Google

Jongyoon Song

This author has not been identified. Look up 'Jongyoon Song' in Google

Heeseung Kim

This author has not been identified. Look up 'Heeseung Kim' in Google

Seongmin Lee 0005

This author has not been identified. Look up 'Seongmin Lee 0005' in Google

Woo-Jong Ryu

This author has not been identified. Look up 'Woo-Jong Ryu' in Google

Sungroh Yoon

This author has not been identified. Look up 'Sungroh Yoon' in Google