Batch learning from logged bandit feedback through counterfactual risk minimization - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Adith Swaminathan, Thorsten Joachims. Batch learning from logged bandit feedback through counterfactual risk minimization. Journal of Machine Learning Research, 16:1731-1755, 2015. [doi]

This author has not been identified. Look up 'Adith Swaminathan' in GoogleThis author has not been identified. Look up 'Thorsten Joachims' in Google

runs on WebDSL