A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences

Odalric-Ambrym Maillard, RĂ©mi Munos, Gilles Stoltz. A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences. Journal of Machine Learning Research, 19:497-514, 2011. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.