Episode-Experience Replay Based Tree-Backup Method for Off-Policy Actor-Critic Algorithm

Haobo Jiang, Jianjun Qian, Jin Xie, Jian Yang. Episode-Experience Replay Based Tree-Backup Method for Off-Policy Actor-Critic Algorithm. In Jian-Huang Lai, Cheng-Lin Liu, Xilin Chen, Jie Zhou 0001, Tieniu Tan, Nanning Zheng, Hongbin Zha, editors, Pattern Recognition and Computer Vision - First Chinese Conference, PRCV 2018, Guangzhou, China, November 23-26, 2018, Proceedings, Part I. Volume 11256 of Lecture Notes in Computer Science, pages 562-573, Springer, 2018. [doi]

Authors

Haobo Jiang

This author has not been identified. Look up 'Haobo Jiang' in Google

Jianjun Qian

This author has not been identified. Look up 'Jianjun Qian' in Google

Jin Xie

This author has not been identified. Look up 'Jin Xie' in Google

Jian Yang

This author has not been identified. Look up 'Jian Yang' in Google