Wen Yang, Junhong Wu, Chen Wang, Chengqing Zong, Jiajun Zhang. Language Imbalance Driven Rewarding for Multilingual Self-improving. In The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025. OpenReview.net, 2025. [doi]
Abstract is missing.