Nonstochastic bandits: Countable decision set, unbounded costs and reactive environments

Jan Poland. Nonstochastic bandits: Countable decision set, unbounded costs and reactive environments. Theoretical Computer Science, 397(1-3):77-93, 2008. [doi]

Abstract

Abstract is missing.