Reinforcement Learning in POMDP s via Direct Gradient Ascent

Jonathan Baxter, Peter L. Bartlett. Reinforcement Learning in POMDP s via Direct Gradient Ascent. In Pat Langley, editor, Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Standord, CA, USA, June 29 - July 2, 2000. pages 41-48, Morgan Kaufmann, 2000.

Abstract

Abstract is missing.