Q(λ) with Off-Policy Corrections

Anna Harutyunyan, Marc G. Bellemare, Tom Stepleton, Rémi Munos. Q(λ) with Off-Policy Corrections. In Ronald Ortner, Hans-Ulrich Simon, Sandra Zilles, editors, Algorithmic Learning Theory - 27th International Conference, ALT 2016, Bari, Italy, October 19-21, 2016, Proceedings. Volume 9925 of Lecture Notes in Computer Science, pages 305-320, 2016. [doi]

Authors

Anna Harutyunyan

This author has not been identified. Look up 'Anna Harutyunyan' in Google

Marc G. Bellemare

This author has not been identified. Look up 'Marc G. Bellemare' in Google

Tom Stepleton

This author has not been identified. Look up 'Tom Stepleton' in Google

Rémi Munos

This author has not been identified. Look up 'Rémi Munos' in Google