Reward Estimation for Variance Reduction in Deep Reinforcement Learning

Joshua Romoff, Alexandre Piché, Peter Henderson 0002, Vincent François-Lavet, Joelle Pineau. Reward Estimation for Variance Reduction in Deep Reinforcement Learning. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Workshop Track Proceedings. OpenReview.net, 2018. [doi]

Abstract

Abstract is missing.