Buy 4 REINFORCE Samples, Get a Baseline for Free!

Wouter Kool, Herke van Hoof, Max Welling. Buy 4 REINFORCE Samples, Get a Baseline for Free!. In Deep Reinforcement Learning Meets Structured Prediction, ICLR 2019 Workshop, New Orleans, Louisiana, United States, May 6, 2019. OpenReview.net, 2019. [doi]

Abstract

Abstract is missing.