Learning With Options That Terminate Off-Policy

Anna Harutyunyan, Peter Vrancx, Pierre-Luc Bacon, Doina Precup, Ann Nowé. Learning With Options That Terminate Off-Policy. In Sheila A. McIlraith, Kilian Q. Weinberger, editors, Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans, Louisiana, USA, February 2-7, 2018. pages 3173-3182, AAAI Press, 2018. [doi]

@inproceedings{HarutyunyanVBPN18,
  title = {Learning With Options That Terminate Off-Policy},
  author = {Anna Harutyunyan and Peter Vrancx and Pierre-Luc Bacon and Doina Precup and Ann Nowé},
  year = {2018},
  url = {https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16907},
  researchr = {https://researchr.org/publication/HarutyunyanVBPN18},
  cites = {0},
  citedby = {0},
  pages = {3173-3182},
  booktitle = {Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans, Louisiana, USA, February 2-7, 2018},
  editor = {Sheila A. McIlraith and Kilian Q. Weinberger},
  publisher = {AAAI Press},
}