The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning

Lingheng Meng, Rob Gorbet, Dana Kulic. The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning. In 25th International Conference on Pattern Recognition, ICPR 2020, Virtual Event / Milan, Italy, January 10-15, 2021. pages 347-353, IEEE, 2020. [doi]

Abstract

Abstract is missing.