Shipra Agrawal, Navin Goyal. Thompson Sampling for Contextual Bandits with Linear Payoffs. In Proceedings of the 30th International Conference on Machine Learning, ICML 2013, Atlanta, GA, USA, 16-21 June 2013. Volume 28 of JMLR Proceedings, pages 127-135, JMLR.org, 2013. [doi]
Abstract is missing.