Sparse Mixture-of-Experts are Domain Generalizable Learners

Bo Li, Yifei Shen, Jingkang Yang, Yezhen Wang, Jiawei Ren, Tong Che, Jun Zhang, Ziwei Liu 0002. Sparse Mixture-of-Experts are Domain Generalizable Learners. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [doi]

Authors

Bo Li

This author has not been identified. Look up 'Bo Li' in Google

Yifei Shen

This author has not been identified. Look up 'Yifei Shen' in Google

Jingkang Yang

This author has not been identified. Look up 'Jingkang Yang' in Google

Yezhen Wang

This author has not been identified. Look up 'Yezhen Wang' in Google

Jiawei Ren

This author has not been identified. Look up 'Jiawei Ren' in Google

Tong Che

This author has not been identified. Look up 'Tong Che' in Google

Jun Zhang

This author has not been identified. Look up 'Jun Zhang' in Google

Ziwei Liu 0002

This author has not been identified. Look up 'Ziwei Liu 0002' in Google