Infinite Horizon Multi-armed Bandits with Reward Vectors: Exploration/Exploitation Trade-off

Madalina M. Drugan. Infinite Horizon Multi-armed Bandits with Reward Vectors: Exploration/Exploitation Trade-off. In Béatrice Duval, H. Jaap van den Herik, Stéphane Loiseau, Joaquim Filipe, editors, Agents and Artificial Intelligence - 7th International Conference, ICAART 2015, Lisbon, Portugal, January 10-12, 2015, Revised Selected Papers. Volume 9494 of Lecture Notes in Computer Science, pages 128-144, Springer, 2015. [doi]

Abstract

Abstract is missing.