Chenhao Dang, Jing Ma, Mingjie Liao. Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning. In Srinivasan Parthasarathy 0001, David F. Gleich, Xiangliang Zhang 0001, Wee Hyong Tok, Faisal Farooq, Qi He, Ambuj K. Singh, Haixun Wang, Yan Liu 0002, editors, Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, KDD 2026, Jeju Island, Korea, August 9-13, 2026. pages 176-187, ACM, 2026. [doi]
Abstract is missing.