Gayathri R. Prabhu, Srikrishna Bhashyam, Aditya Gopalan, Rajesh Sundaresan. Sequential Multi-Hypothesis Testing in Multi-Armed Bandit Problems: An Approach for Asymptotic Optimality. IEEE Transactions on Information Theory, 68(7):4790-4817, 2022. [doi]
Abstract is missing.