Brett Daley, Martha White, Marlos C. Machado. Averaging n-step Returns Reduces Variance in Reinforcement Learning. In Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. OpenReview.net, 2024. [doi]
Abstract is missing.