Counterfactual Risk Minimization: Learning from Logged Bandit Feedback

Adith Swaminathan, Thorsten Joachims. Counterfactual Risk Minimization: Learning from Logged Bandit Feedback. In Francis R. Bach, David M. Blei, editors, Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, Lille, France, 6-11 July 2015. Volume 37 of JMLR Proceedings, pages 814-823, JMLR.org, 2015. [doi]

Abstract

Abstract is missing.