Towards Fast Safe Online Reinforcement Learning via Policy Finetuning

Keru Chen, Honghao Wei, Zhigang Deng 0001, Sen Lin. Towards Fast Safe Online Reinforcement Learning via Policy Finetuning. Trans. Mach. Learn. Res., 2026, 2026. [doi]

Abstract

Abstract is missing.