Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

Peter L. Bartlett, Jonathan Baxter. Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning. J. Comput. Syst. Sci., 64(1):133-150, 2002. [doi]

Authors

Peter L. Bartlett

This author has not been identified. Look up 'Peter L. Bartlett' in Google

Jonathan Baxter

This author has not been identified. Look up 'Jonathan Baxter' in Google