OSOM: A simultaneously optimal algorithm for multi-armed and linear contextual bandits

Niladri S. Chatterji, Vidya Muthukumar, Peter L. Bartlett. OSOM: A simultaneously optimal algorithm for multi-armed and linear contextual bandits. In Silvia Chiappa, Roberto Calandra, editors, The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020, 26-28 August 2020, Online [Palermo, Sicily, Italy]. Volume 108 of Proceedings of Machine Learning Research, pages 1844-1854, PMLR, 2020. [doi]

Abstract

Abstract is missing.