Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm

Qinbo Bai, Mridul Agarwal, Vaneet Aggarwal. Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm. J. Artif. Intell. Res. (JAIR), 74:1565-1597, 2022. [doi]

Abstract

Abstract is missing.