Scalable Internal-State Policy-Gradient Methods for POMDPs

Douglas Aberdeen, Jonathan Baxter. Scalable Internal-State Policy-Gradient Methods for POMDPs. In Claude Sammut, Achim G. Hoffmann, editors, Machine Learning, Proceedings of the Nineteenth International Conference (ICML 2002), University of New South Wales, Sydney, Australia, July 8-12, 2002. pages 3-10, Morgan Kaufmann, 2002.

Abstract

Abstract is missing.