A worst-case comparison between temporal difference and residual gradient with linear function approximation

Lihong Li. A worst-case comparison between temporal difference and residual gradient with linear function approximation. In William W. Cohen, Andrew McCallum, Sam T. Roweis, editors, Machine Learning, Proceedings of the Twenty-Fifth International Conference (ICML 2008), Helsinki, Finland, June 5-9, 2008. Volume 307 of ACM International Conference Proceeding Series, pages 560-567, ACM, 2008. [doi]

Abstract

Abstract is missing.