Learning Policies from Self-Play with Policy Gradients and MCTS Value Estimates - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Dennis J. N. J. Soemers, Éric Piette, Matthew Stephenson, Cameron Browne. Learning Policies from Self-Play with Policy Gradients and MCTS Value Estimates. In IEEE Conference on Games, CoG 2019, London, United Kingdom, August 20-23, 2019. pages 1-8, IEEE, 2019. [doi]

This author has not been identified. Look up 'Dennis J. N. J. Soemers' in GoogleThis author has not been identified. Look up 'Éric Piette' in GoogleThis author has not been identified. Look up 'Matthew Stephenson' in GoogleThis author has not been identified. Look up 'Cameron Browne' in Google

runs on WebDSL