Model gradient: unified model and policy learning in model-based reinforcement learning

Chengxing Jia, Fuxiang Zhang, Tian Xu, Jing-Cheng Pang, Zongzhang Zhang, Yang Yu 0001. Model gradient: unified model and policy learning in model-based reinforcement learning. Frontiers of Computer Science in China, 18(4):184339, August 2024. [doi]

Authors

Chengxing Jia

This author has not been identified. Look up 'Chengxing Jia' in Google

Fuxiang Zhang

This author has not been identified. Look up 'Fuxiang Zhang' in Google

Tian Xu

This author has not been identified. Look up 'Tian Xu' in Google

Jing-Cheng Pang

This author has not been identified. Look up 'Jing-Cheng Pang' in Google

Zongzhang Zhang

This author has not been identified. Look up 'Zongzhang Zhang' in Google

Yang Yu 0001

This author has not been identified. Look up 'Yang Yu 0001' in Google