Adaptive playouts for online learning of policies during Monte Carlo Tree Search

Tobias Graf, Marco Platzner. Adaptive playouts for online learning of policies during Monte Carlo Tree Search. Theoretical Computer Science, 644:53-62, 2016. [doi]

Abstract

Abstract is missing.