Learning Policies from Self-Play with Policy Gradients and MCTS Value Estimates - researchr publication references

researchr

You are not signed in
Sign in
Sign up

Dennis J. N. J. Soemers, Éric Piette, Matthew Stephenson, Cameron Browne. Learning Policies from Self-Play with Policy Gradients and MCTS Value Estimates. In IEEE Conference on Games, CoG 2019, London, United Kingdom, August 20-23, 2019. pages 1-8, IEEE, 2019. [doi]

No references recorded for this publication.

No citations of this publication recorded.

runs on WebDSL