Adaptive playouts for online learning of policies during Monte Carlo Tree Search

Tobias Graf, Marco Platzner. Adaptive playouts for online learning of policies during Monte Carlo Tree Search. Theoretical Computer Science, 644:53-62, 2016. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: