Fixed Points of Approximate Value Iteration and Temporal-Difference Learning

Daniela Pucci de Farias, Benjamin Van Roy. Fixed Points of Approximate Value Iteration and Temporal-Difference Learning. In Pat Langley, editor, Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Standord, CA, USA, June 29 - July 2, 2000. pages 207-214, Morgan Kaufmann, 2000.

Abstract

Abstract is missing.