Self-accelerated Thompson sampling with near-optimal regret upper bound

Zhenyu Zhu, Liusheng Huang, Hongli Xu. Self-accelerated Thompson sampling with near-optimal regret upper bound. Neurocomputing, 399:37-47, 2020. [doi]

Abstract

Abstract is missing.