On the mean-square rate of convergence of temporal-difference learning algorithms

Vladislav B. Tadic. On the mean-square rate of convergence of temporal-difference learning algorithms. In American Control Conference, ACC 2002, Anchorage, Alaska, USA, May 8-10 2002. pages 1454-1459, IEEE, 2002. [doi]

Abstract

Abstract is missing.