Researchr is a web site for finding, collecting, sharing, and reviewing scientific publications, for researchers by researchers.
Sign up for an account to create a profile with publication list, tag and review your related work, and share bibliographies with your co-authors.
Adith Swaminathan, Thorsten Joachims. Batch learning from logged bandit feedback through counterfactual risk minimization. Journal of Machine Learning Research, 16:1731-1755, 2015. [doi]
Possibly Related PublicationsThe following publications are possibly variants of this publication: Counterfactual Risk Minimization: Learning from Logged Bandit FeedbackAdith Swaminathan, Thorsten Joachims. icml 2015: 814-823 [doi] Counterfactual evaluation and learning from logged user feedbackAdith Swaminathan. PhD thesis, Cornell University, USA, 2017. Variance-Minimizing Augmentation Logging for Counterfactual Evaluation in Contextual BanditsAaron David Tucker, Thorsten Joachims. wsdm 2023: 967-975 [doi] Deep Learning with Logged Bandit FeedbackThorsten Joachims, Adith Swaminathan, Maarten de Rijke. iclr 2018: [doi]
The following publications are possibly variants of this publication: