Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

Peter L. Bartlett, Jonathan Baxter. Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning. J. Comput. Syst. Sci., 64(1):133-150, 2002. [doi]

Abstract

Abstract is missing.