Value Function Dynamic Estimation in Reinforcement Learning based on Data Adequacy

Huifan Gao, Yinghui Pan, Jing Tang 0001, Yifeng Zeng, Peihua Chai, Langcai Cao. Value Function Dynamic Estimation in Reinforcement Learning based on Data Adequacy. In HPCCT & BDAI 2020: 4th High Performance Computing and Cluster Technologies Conference & 3rd International Conference on Big Data and Artificial Intelligence, Qingdao, China, July, 2020. pages 204-208, ACM, 2020. [doi]

Abstract

Abstract is missing.