Policy Return: A New Method for Reducing the Number of Experimental Trials in Deep Reinforcement Learning

Feng Liu, Shuling Dai, Yongjia Zhao. Policy Return: A New Method for Reducing the Number of Experimental Trials in Deep Reinforcement Learning. IEEE Access, 8:228099-228107, 2020. [doi]

Authors

Feng Liu

This author has not been identified. Look up 'Feng Liu' in Google

Shuling Dai

This author has not been identified. Look up 'Shuling Dai' in Google

Yongjia Zhao

This author has not been identified. Look up 'Yongjia Zhao' in Google