Old Dog Learns New Tricks: Randomized UCB for Bandit Problems

Sharan Vaswani, Abbas Mehrabian, Audrey Durand, Branislav Kveton. Old Dog Learns New Tricks: Randomized UCB for Bandit Problems. In Silvia Chiappa, Roberto Calandra, editors, The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020, 26-28 August 2020, Online [Palermo, Sicily, Italy]. Volume 108 of Proceedings of Machine Learning Research, pages 1988-1998, PMLR, 2020. [doi]

Abstract

Abstract is missing.