Tsallis-INF for Decoupled Exploration and Exploitation in Multi-armed Bandits

ChloƩ Rouyer, Yevgeny Seldin. Tsallis-INF for Decoupled Exploration and Exploitation in Multi-armed Bandits. In Jacob D. Abernethy, Shivani Agarwal 0001, editors, Conference on Learning Theory, COLT 2020, 9-12 July 2020, Virtual Event [Graz, Austria]. Volume 125 of Proceedings of Machine Learning Research, pages 3227-3249, PMLR, 2020. [doi]

Abstract

Abstract is missing.