An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning

Richard S. Sutton, Ashique Rupam Mahmood, Martha White. An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning. Journal of Machine Learning Research, 17, 2016. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.