UniMax: Fairer and More Effective Language Sampling for Large-Scale Multilingual Pretraining

Hyung Won Chung, Xavier Garcia, Adam Roberts, Yi Tay, Orhan Firat, Sharan Narang, Noah Constant. UniMax: Fairer and More Effective Language Sampling for Large-Scale Multilingual Pretraining. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [doi]

Authors

Hyung Won Chung

This author has not been identified. Look up 'Hyung Won Chung' in Google

Xavier Garcia

This author has not been identified. Look up 'Xavier Garcia' in Google

Adam Roberts

This author has not been identified. Look up 'Adam Roberts' in Google

Yi Tay

This author has not been identified. Look up 'Yi Tay' in Google

Orhan Firat

This author has not been identified. Look up 'Orhan Firat' in Google

Sharan Narang

This author has not been identified. Look up 'Sharan Narang' in Google

Noah Constant

This author has not been identified. Look up 'Noah Constant' in Google