Reward Constrained Policy Optimization

Chen Tessler, Daniel J. Mankowitz, Shie Mannor. Reward Constrained Policy Optimization. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net, 2019. [doi]

Abstract

Abstract is missing.