Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality

Shaocong Ma, Ziyi Chen 0002, Yi Zhou 0017, Heng Huang. Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality. Trans. Mach. Learn. Res., 2025, 2025. [doi]

Abstract

Abstract is missing.