Model gradient: unified model and policy learning in model-based reinforcement learning

Chengxing Jia, Fuxiang Zhang, Tian Xu, Jing-Cheng Pang, Zongzhang Zhang, Yang Yu 0001. Model gradient: unified model and policy learning in model-based reinforcement learning. Frontiers of Computer Science in China, 18(4):184339, August 2024. [doi]

Abstract

Abstract is missing.