Fast LSTD Using Stochastic Approximation: Finite Time Analysis and Application to Traffic Control

Prashanth L. A., Nathaniel Korda, Rémi Munos. Fast LSTD Using Stochastic Approximation: Finite Time Analysis and Application to Traffic Control. In Toon Calders, Floriana Esposito, Eyke Hüllermeier, Rosa Meo, editors, Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2014, Nancy, France, September 15-19, 2014. Proceedings, Part II. Volume 8725 of Lecture Notes in Computer Science, pages 66-81, Springer, 2014. [doi]

Authors

Prashanth L. A.

This author has not been identified. Look up 'Prashanth L. A.' in Google

Nathaniel Korda

This author has not been identified. Look up 'Nathaniel Korda' in Google

Rémi Munos

This author has not been identified. Look up 'Rémi Munos' in Google