On the Asymptotic Behaviour of a Constant Stepsize Temporal-Difference Learning Algorithm

Vladislav Tadic. On the Asymptotic Behaviour of a Constant Stepsize Temporal-Difference Learning Algorithm. In Paul Fischer, Hans-Ulrich Simon, editors, Computational Learning Theory, 4th European Conference, EuroCOLT 99, Nordkirchen, Germany, March 29-31, 1999, Proceedings. Volume 1572 of Lecture Notes in Computer Science, pages 126-137, Springer, 1999. [doi]

Abstract

Abstract is missing.