Huifan Gao, Yinghui Pan, Jing Tang 0001, Yifeng Zeng, Peihua Chai, Langcai Cao. Value Function Dynamic Estimation in Reinforcement Learning based on Data Adequacy. In HPCCT & BDAI 2020: 4th High Performance Computing and Cluster Technologies Conference & 3rd International Conference on Big Data and Artificial Intelligence, Qingdao, China, July, 2020. pages 204-208, ACM, 2020. [doi]
Abstract is missing.