A simple multi-armed bandit algorithm with optimal variation-bounded regret

Elad Hazan, Satyen Kale. A simple multi-armed bandit algorithm with optimal variation-bounded regret. Journal of Machine Learning Research, 19:817-820, 2011. [doi]

Abstract

Abstract is missing.