Best arm identification in multi-armed bandits with delayed feedback

Aditya Grover, Todor Markov, Peter Attia, Norman Jin, Nicolas Perkins, Bryan Cheong, Michael Chen, Zi Yang, Stephen Harris, William Chueh, Stefano Ermon. Best arm identification in multi-armed bandits with delayed feedback. In Amos J. Storkey, Fernando Pérez-Cruz, editors, International Conference on Artificial Intelligence and Statistics, AISTATS 2018, 9-11 April 2018, Playa Blanca, Lanzarote, Canary Islands, Spain. Volume 84 of Proceedings of Machine Learning Research, pages 833-842, PMLR, 2018. [doi]

Abstract

Abstract is missing.