An asymptotically optimal policy for finite support models in the multiarmed bandit problem - researchr publication related

researchr

You are not signed in
Sign in
Sign up

Junya Honda, Akimichi Takemura. An asymptotically optimal policy for finite support models in the multiarmed bandit problem. Machine Learning, 85(3):361-391, 2011. [doi]

The following publications are possibly variants of this publication:

An Asymptotically Optimal Bandit Algorithm for Bounded Support ModelsJunya Honda, Akimichi Takemura. colt 2010: 67-79 [doi]

runs on WebDSL