Nianli Peng, Muhang Tian, Brandon Fain. Multi-objective Reinforcement Learning with Nonlinear Preferences: Provable Approximation for Maximizing Expected Scalarized Return. In Sanmay Das, Ann Nowé, Yevgeniy Vorobeychik, editors, Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025, Detroit, MI, USA, May 19-23, 2025. pages 1632-1640, International Foundation for Autonomous Agents and Multiagent Systems / ACM, 2025. [doi]
Abstract is missing.