Theodore J. Perkins, Mark D. Pendrith. On the Existence of Fixed Points for Q-Learning and Sarsa in Partially Observable Domains. In Claude Sammut, Achim G. Hoffmann, editors, Machine Learning, Proceedings of the Nineteenth International Conference (ICML 2002), University of New South Wales, Sydney, Australia, July 8-12, 2002. pages 490-497, Morgan Kaufmann, 2002.
Abstract is missing.