Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

Sébastien Bubeck, Nicolò Cesa-Bianchi. Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems. Foundations and Trends in Machine Learning, 5(1):1-122, 2012. [doi]

Authors

Sébastien Bubeck

This author has not been identified. Look up 'Sébastien Bubeck' in Google

Nicolò Cesa-Bianchi

This author has not been identified. Look up 'Nicolò Cesa-Bianchi' in Google