An Improved Trust-Region Method for Off-Policy Deep Reinforcement Learning

Hepeng Li, Xiangnan Zhong, Haibo He. An Improved Trust-Region Method for Off-Policy Deep Reinforcement Learning. In International Joint Conference on Neural Networks, IJCNN 2023, Gold Coast, Australia, June 18-23, 2023. pages 1-8, IEEE, 2023. [doi]

Abstract

Abstract is missing.