On the Design of Estimators for Bandit Off-Policy Evaluation

Nikos Vlassis, Aurélien Bibaut, Maria Dimakopoulou, Tony Jebara. On the Design of Estimators for Bandit Off-Policy Evaluation. In Kamalika Chaudhuri, Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. Volume 97 of Proceedings of Machine Learning Research, pages 6468-6476, PMLR, 2019. [doi]

Authors

Nikos Vlassis

This author has not been identified. Look up 'Nikos Vlassis' in Google

Aurélien Bibaut

This author has not been identified. Look up 'Aurélien Bibaut' in Google

Maria Dimakopoulou

This author has not been identified. Look up 'Maria Dimakopoulou' in Google

Tony Jebara

This author has not been identified. Look up 'Tony Jebara' in Google