An Optimal Algorithm for the Stochastic Bandits with Knowing Near-optimal Mean Reward

Shangdong Yang, Hao Wang, Yang Gao, Xingguo Chen. An Optimal Algorithm for the Stochastic Bandits with Knowing Near-optimal Mean Reward. In Elisabeth André, Sven Koenig, Mehdi Dastani, Gita Sukthankar, editors, Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS 2018, Stockholm, Sweden, July 10-15, 2018. pages 2130-2132, International Foundation for Autonomous Agents and Multiagent Systems Richland, SC, USA / ACM, 2018. [doi]

Authors

Shangdong Yang

This author has not been identified. Look up 'Shangdong Yang' in Google

Hao Wang

This author has not been identified. Look up 'Hao Wang' in Google

Yang Gao

This author has not been identified. Look up 'Yang Gao' in Google

Xingguo Chen

This author has not been identified. Look up 'Xingguo Chen' in Google