Sandjai Bhulai, Ger Koole. On the value of learning for Bernoulli bandits with unknown parameters. In 39th IEEE Conference on Decision and Control, CDC 2000, Sydney, Australia, December 12-15, 2000. pages 736-741, IEEE, 2000. [doi]
Abstract is missing.