True Online Temporal-Difference Learning

Harm van Seijen, Ashique Rupam Mahmood, Patrick M. Pilarski, Marlos C. Machado, Richard S. Sutton. True Online Temporal-Difference Learning. Journal of Machine Learning Research, 17, 2016. [doi]

Abstract

Abstract is missing.