Adaptive Bandits: Towards the best history-dependent strategy

Odalric-Ambrym Maillard, RĂ©mi Munos. Adaptive Bandits: Towards the best history-dependent strategy. Journal of Machine Learning Research, 15:570-578, 2011. [doi]

Abstract

Abstract is missing.