Fast gradient-descent methods for temporal-difference learning with linear function approximation

Richard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora. Fast gradient-descent methods for temporal-difference learning with linear function approximation. In Andrea Pohoreckyj Danyluk, Léon Bottou, Michael L. Littman, editors, Proceedings of the 26th Annual International Conference on Machine Learning, ICML 2009, Montreal, Quebec, Canada, June 14-18, 2009. Volume 382 of ACM International Conference Proceeding Series, pages 125, ACM, 2009. [doi]

Abstract

Abstract is missing.