Adaptive playouts for online learning of policies during Monte Carlo Tree Search - researchr publication related

researchr

You are not signed in
Sign in
Sign up

Tobias Graf, Marco Platzner. Adaptive playouts for online learning of policies during Monte Carlo Tree Search. Theoretical Computer Science, 644:53-62, 2016. [doi]

The following publications are possibly variants of this publication:

Adaptive Playouts in Monte-Carlo Tree Search with Policy-Gradient Reinforcement LearningTobias Graf, Marco Platzner. acg 2015: 1-11 [doi]

runs on WebDSL