Reward Estimation for Variance Reduction in Deep Reinforcement Learning

Joshua Romoff, Alexandre Piché, Peter Henderson 0002, Vincent François-Lavet, Joelle Pineau. Reward Estimation for Variance Reduction in Deep Reinforcement Learning. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Workshop Track Proceedings. OpenReview.net, 2018. [doi]

Authors

Joshua Romoff

This author has not been identified. Look up 'Joshua Romoff' in Google

Alexandre Piché

This author has not been identified. Look up 'Alexandre Piché' in Google

Peter Henderson 0002

This author has not been identified. Look up 'Peter Henderson 0002' in Google

Vincent François-Lavet

This author has not been identified. Look up 'Vincent François-Lavet' in Google

Joelle Pineau

This author has not been identified. Look up 'Joelle Pineau' in Google