Off-Policy Evaluation via Off-Policy Classification

Alexander Irpan, Kanishka Rao, Konstantinos Bousmalis, Chris Harris, Julian Ibarz, Sergey Levine. Off-Policy Evaluation via Off-Policy Classification. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d'Alché-Buc, Edward A. Fox, Roman Garnett, editors, Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, 8-14 December 2019, Vancouver, BC, Canada. pages 5438-5449, 2019. [doi]

@inproceedings{IrpanRBHIL19,
  title = {Off-Policy Evaluation via Off-Policy Classification},
  author = {Alexander Irpan and Kanishka Rao and Konstantinos Bousmalis and Chris Harris and Julian Ibarz and Sergey Levine},
  year = {2019},
  url = {http://papers.nips.cc/paper/8783-off-policy-evaluation-via-off-policy-classification},
  researchr = {https://researchr.org/publication/IrpanRBHIL19},
  cites = {0},
  citedby = {0},
  pages = {5438-5449},
  booktitle = {Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, 8-14 December 2019, Vancouver, BC, Canada},
  editor = {Hanna M. Wallach and Hugo Larochelle and Alina Beygelzimer and Florence d'Alché-Buc and Edward A. Fox and Roman Garnett},
}