Counterfactual Risk Minimization: Learning from Logged Bandit Feedback - researchr publication

researchr

You are not signed in
Sign in
Sign up

Adith Swaminathan, Thorsten Joachims. Counterfactual Risk Minimization: Learning from Logged Bandit Feedback. In Francis R. Bach, David M. Blei, editors, Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, Lille, France, 6-11 July 2015. Volume 37 of JMLR Proceedings, pages 814-823, JMLR.org, 2015. [doi]

Abstract is missing.

runs on WebDSL