Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret Bounds

Shinji Ito, Kei Takemura. Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret Bounds. In Gergely Neu, Lorenzo Rosasco, editors, The Thirty Sixth Annual Conference on Learning Theory, 12-15 July 2023, Bangalore, India. Volume 195 of Proceedings of Machine Learning Research, pages 2653-2677, PMLR, 2023. [doi]

Abstract

Abstract is missing.