Policy Return: A New Method for Reducing the Number of Experimental Trials in Deep Reinforcement Learning

Feng Liu, Shuling Dai, Yongjia Zhao. Policy Return: A New Method for Reducing the Number of Experimental Trials in Deep Reinforcement Learning. IEEE Access, 8:228099-228107, 2020. [doi]

Abstract

Abstract is missing.