A Thompson Sampling Algorithm With Logarithmic Regret for Unimodal Gaussian Bandit

Long Yang, Zhao Li 0007, Zehong Hu, Shasha Ruan, Gang Pan 0001. A Thompson Sampling Algorithm With Logarithmic Regret for Unimodal Gaussian Bandit. IEEE Transactions on Neural Networks, 34(9):5332-5341, September 2023. [doi]

Abstract

Abstract is missing.