Peter L. Bartlett, Jonathan Baxter. Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning. J. Comput. Syst. Sci., 64(1):133-150, 2002. [doi]
No references recorded for this publication.
No citations of this publication recorded.