Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model

Gi-Soo Kim, Myunghee Cho Paik. Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model. In Kamalika Chaudhuri, Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. Volume 97 of Proceedings of Machine Learning Research, pages 3389-3397, PMLR, 2019. [doi]

Abstract

Abstract is missing.