StableMoE: Stable Routing Strategy for Mixture of Experts

Damai Dai, Li Dong 0004, Shuming Ma, Bo Zheng, Zhifang Sui, Baobao Chang, Furu Wei. StableMoE: Stable Routing Strategy for Mixture of Experts. In Smaranda Muresan, Preslav Nakov, Aline Villavicencio, editors, Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2022, Dublin, Ireland, May 22-27, 2022. pages 7085-7095, Association for Computational Linguistics, 2022. [doi]

Authors

Damai Dai

This author has not been identified. Look up 'Damai Dai' in Google

Li Dong 0004

This author has not been identified. Look up 'Li Dong 0004' in Google

Shuming Ma

This author has not been identified. Look up 'Shuming Ma' in Google

Bo Zheng

This author has not been identified. Look up 'Bo Zheng' in Google

Zhifang Sui

This author has not been identified. Look up 'Zhifang Sui' in Google

Baobao Chang

This author has not been identified. Look up 'Baobao Chang' in Google

Furu Wei

This author has not been identified. Look up 'Furu Wei' in Google