Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

Evan Greensmith, Peter L. Bartlett, Jonathan Baxter. Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning. In Thomas G. Dietterich, Suzanna Becker, Zoubin Ghahramani, editors, Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, NIPS 2001, December 3-8, 2001, Vancouver, British Columbia, Canada]. pages 1507-1514, MIT Press, 2001. [doi]

Abstract

Abstract is missing.