Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting

Ilja Kuzborskij, Claire Vernade, András György 0001, Csaba Szepesvári. Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting. In Arindam Banerjee 0001, Kenji Fukumizu, editors, The 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021, April 13-15, 2021, Virtual Event. Volume 130 of Proceedings of Machine Learning Research, pages 640-648, PMLR, 2021. [doi]

@inproceedings{KuzborskijV0S21,
  title = {Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting},
  author = {Ilja Kuzborskij and Claire Vernade and András György 0001 and Csaba Szepesvári},
  year = {2021},
  url = {http://proceedings.mlr.press/v130/kuzborskij21a.html},
  researchr = {https://researchr.org/publication/KuzborskijV0S21},
  cites = {0},
  citedby = {0},
  pages = {640-648},
  booktitle = {The 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021, April 13-15, 2021, Virtual Event},
  editor = {Arindam Banerjee 0001 and Kenji Fukumizu},
  volume = {130},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}