Episode-Experience Replay Based Tree-Backup Method for Off-Policy Actor-Critic Algorithm

Haobo Jiang, Jianjun Qian, Jin Xie, Jian Yang. Episode-Experience Replay Based Tree-Backup Method for Off-Policy Actor-Critic Algorithm. In Jian-Huang Lai, Cheng-Lin Liu, Xilin Chen, Jie Zhou 0001, Tieniu Tan, Nanning Zheng, Hongbin Zha, editors, Pattern Recognition and Computer Vision - First Chinese Conference, PRCV 2018, Guangzhou, China, November 23-26, 2018, Proceedings, Part I. Volume 11256 of Lecture Notes in Computer Science, pages 562-573, Springer, 2018. [doi]

Abstract

Abstract is missing.