Fast gradient-descent methods for temporal-difference learning with linear function approximation

Richard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora. Fast gradient-descent methods for temporal-difference learning with linear function approximation. In Andrea Pohoreckyj Danyluk, Léon Bottou, Michael L. Littman, editors, Proceedings of the 26th Annual International Conference on Machine Learning, ICML 2009, Montreal, Quebec, Canada, June 14-18, 2009. Volume 382 of ACM International Conference Proceeding Series, pages 125, ACM, 2009. [doi]

Authors

Richard S. Sutton

This author has not been identified. Look up 'Richard S. Sutton' in Google

Hamid Reza Maei

This author has not been identified. Look up 'Hamid Reza Maei' in Google

Doina Precup

This author has not been identified. Look up 'Doina Precup' in Google

Shalabh Bhatnagar

This author has not been identified. Look up 'Shalabh Bhatnagar' in Google

David Silver

This author has not been identified. Look up 'David Silver' in Google

Csaba Szepesvári

This author has not been identified. Look up 'Csaba Szepesvári' in Google

Eric Wiewiora

This author has not been identified. Look up 'Eric Wiewiora' in Google