On the Existence of Fixed Points for Q-Learning and Sarsa in Partially Observable Domains

Theodore J. Perkins, Mark D. Pendrith. On the Existence of Fixed Points for Q-Learning and Sarsa in Partially Observable Domains. In Claude Sammut, Achim G. Hoffmann, editors, Machine Learning, Proceedings of the Nineteenth International Conference (ICML 2002), University of New South Wales, Sydney, Australia, July 8-12, 2002. pages 490-497, Morgan Kaufmann, 2002.

Abstract

Abstract is missing.