Fixed Points of Approximate Value Iteration and Temporal-Difference Learning

Daniela Pucci de Farias, Benjamin Van Roy. Fixed Points of Approximate Value Iteration and Temporal-Difference Learning. In Pat Langley, editor, Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Standord, CA, USA, June 29 - July 2, 2000. pages 207-214, Morgan Kaufmann, 2000.

Authors

Daniela Pucci de Farias

This author has not been identified. Look up 'Daniela Pucci de Farias' in Google

Benjamin Van Roy

This author has not been identified. Look up 'Benjamin Van Roy' in Google