Stochastic Regret Minimization via Thompson Sampling

Sudipto Guha, Kamesh Munagala. Stochastic Regret Minimization via Thompson Sampling. In Maria-Florina Balcan, Csaba Szepesvári, editors, Proceedings of The 27th Conference on Learning Theory, COLT 2014, Barcelona, Spain, June 13-15, 2014. Volume 35 of JMLR Proceedings, pages 317-338, JMLR.org, 2014. [doi]

Abstract

Abstract is missing.