4): near-optimal safety-constrained reinforcement learning in polynomial time

David M. Bossens, Nicholas Bishop. 4): near-optimal safety-constrained reinforcement learning in polynomial time. Machine Learning, 112(3):817-858, March 2023. [doi]

Abstract

Abstract is missing.