On the Design of Estimators for Bandit Off-Policy Evaluation

Nikos Vlassis, Aurélien Bibaut, Maria Dimakopoulou, Tony Jebara. On the Design of Estimators for Bandit Off-Policy Evaluation. In Kamalika Chaudhuri, Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. Volume 97 of Proceedings of Machine Learning Research, pages 6468-6476, PMLR, 2019. [doi]

Abstract

Abstract is missing.