Tuning continual exploration in reinforcement learning: An optimality property of the Boltzmann strategy

Youssef Achbany, François Fouss, Luh Yen, Alain Pirotte, Marco Saerens. Tuning continual exploration in reinforcement learning: An optimality property of the Boltzmann strategy. Neurocomputing, 71(13-15):2507-2520, 2008. [doi]

Abstract

Abstract is missing.