Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets

Yongcan Cao, Huixin Zhan. Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets. J. Artif. Intell. Res. (JAIR), 70:319-349, 2021. [doi]

Abstract

Abstract is missing.