A simple multi-armed bandit algorithm with optimal variation-bounded regret

Elad Hazan, Satyen Kale. A simple multi-armed bandit algorithm with optimal variation-bounded regret. Journal of Machine Learning Research, 19:817-820, 2011. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.