An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning

Richard S. Sutton, Ashique Rupam Mahmood, Martha White. An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning. Journal of Machine Learning Research, 17, 2016. [doi]

Abstract

Abstract is missing.