Rethinking Data Mixture for Large Language Models: A Comprehensive Survey and New Perspectives

Yajiao Liu, Congliang Chen, Junchi Yang, Ruoyu Sun 0001. Rethinking Data Mixture for Large Language Models: A Comprehensive Survey and New Perspectives. In Vera Demberg, Kentaro Inui, LluĂ­s Marquez, editors, Findings of the Association for Computational Linguistics: EACL 2026, Rabat, Morocco, March 24-29, 2026. pages 275-289, Association for Computational Linguistics, 2026. [doi]

Abstract

Abstract is missing.