The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning - researchr publication

researchr

You are not signed in
Sign in
Sign up

Lingheng Meng, Rob Gorbet, Dana Kulic. The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning. In 25th International Conference on Pattern Recognition, ICPR 2020, Virtual Event / Milan, Italy, January 10-15, 2021. pages 347-353, IEEE, 2020. [doi]

Abstract is missing.

runs on WebDSL