A simple multi-armed bandit algorithm with optimal variation-bounded regret - researchr publication references

researchr

You are not signed in
Sign in
Sign up

Elad Hazan, Satyen Kale. A simple multi-armed bandit algorithm with optimal variation-bounded regret. Journal of Machine Learning Research, 19:817-820, 2011. [doi]

No references recorded for this publication.

No citations of this publication recorded.

runs on WebDSL