Scalable Internal-State Policy-Gradient Methods for POMDPs

Douglas Aberdeen, Jonathan Baxter. Scalable Internal-State Policy-Gradient Methods for POMDPs. In Claude Sammut, Achim G. Hoffmann, editors, Machine Learning, Proceedings of the Nineteenth International Conference (ICML 2002), University of New South Wales, Sydney, Australia, July 8-12, 2002. pages 3-10, Morgan Kaufmann, 2002.

Authors

Douglas Aberdeen

This author has not been identified. Look up 'Douglas Aberdeen' in Google

Jonathan Baxter

This author has not been identified. Look up 'Jonathan Baxter' in Google