An Optimal Algorithm for the Stochastic Bandits with Knowing Near-optimal Mean Reward - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Shangdong Yang, Hao Wang, Yang Gao, Xingguo Chen. An Optimal Algorithm for the Stochastic Bandits with Knowing Near-optimal Mean Reward. In Elisabeth André, Sven Koenig, Mehdi Dastani, Gita Sukthankar, editors, Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS 2018, Stockholm, Sweden, July 10-15, 2018. pages 2130-2132, International Foundation for Autonomous Agents and Multiagent Systems Richland, SC, USA / ACM, 2018. [doi]

This author has not been identified. Look up 'Shangdong Yang' in GoogleThis author has not been identified. Look up 'Hao Wang' in GoogleThis author has not been identified. Look up 'Yang Gao' in GoogleThis author has not been identified. Look up 'Xingguo Chen' in Google

runs on WebDSL