Learning With Options That Terminate Off-Policy

Anna Harutyunyan, Peter Vrancx, Pierre-Luc Bacon, Doina Precup, Ann Nowé. Learning With Options That Terminate Off-Policy. In Sheila A. McIlraith, Kilian Q. Weinberger, editors, Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans, Louisiana, USA, February 2-7, 2018. pages 3173-3182, AAAI Press, 2018. [doi]

Abstract

Abstract is missing.