Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model - researchr publication

researchr

You are not signed in
Sign in
Sign up

Gi-Soo Kim, Myunghee Cho Paik. Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model. In Kamalika Chaudhuri, Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. Volume 97 of Proceedings of Machine Learning Research, pages 3389-3397, PMLR, 2019. [doi]

Abstract is missing.

runs on WebDSL