Thompson Sampling for Contextual Bandits with Linear Payoffs

Shipra Agrawal, Navin Goyal. Thompson Sampling for Contextual Bandits with Linear Payoffs. In Proceedings of the 30th International Conference on Machine Learning, ICML 2013, Atlanta, GA, USA, 16-21 June 2013. Volume 28 of JMLR Proceedings, pages 127-135, JMLR.org, 2013. [doi]

Abstract

Abstract is missing.