Batch learning from logged bandit feedback through counterfactual risk minimization - researchr publication references

researchr

You are not signed in
Sign in
Sign up

Adith Swaminathan, Thorsten Joachims. Batch learning from logged bandit feedback through counterfactual risk minimization. Journal of Machine Learning Research, 16:1731-1755, 2015. [doi]

No references recorded for this publication.

No citations of this publication recorded.

runs on WebDSL