An Optimal Algorithm for the Stochastic Bandits with Knowing Near-optimal Mean Reward

Shangdong Yang, Hao Wang, Yang Gao, Xingguo Chen. An Optimal Algorithm for the Stochastic Bandits with Knowing Near-optimal Mean Reward. In Elisabeth André, Sven Koenig, Mehdi Dastani, Gita Sukthankar, editors, Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS 2018, Stockholm, Sweden, July 10-15, 2018. pages 2130-2132, International Foundation for Autonomous Agents and Multiagent Systems Richland, SC, USA / ACM, 2018. [doi]

Abstract

Abstract is missing.