Automatic Discovery of Ranking Formulas for Playing with Multi-armed Bandits

Francis Maes, Louis Wehenkel, Damien Ernst. Automatic Discovery of Ranking Formulas for Playing with Multi-armed Bandits. In Scott Sanner, Marcus Hutter, editors, Recent Advances in Reinforcement Learning - 9th European Workshop, EWRL 2011, Athens, Greece, September 9-11, 2011, Revised Selected Papers. Volume 7188 of Lecture Notes in Computer Science, pages 5-17, Springer, 2011. [doi]

Abstract

Abstract is missing.