Reducing Sampling Error in Policy Gradient Learning - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Josiah P. Hanna, Peter Stone. Reducing Sampling Error in Policy Gradient Learning. In Edith Elkind, Manuela Veloso, Noa Agmon, Matthew E. Taylor, editors, Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS '19, Montreal, QC, Canada, May 13-17, 2019. pages 1016-1024, International Foundation for Autonomous Agents and Multiagent Systems, 2019. [doi]

This author has not been identified. Look up 'Josiah P. Hanna' in GoogleThis author has not been identified. Look up 'Peter Stone' in Google

runs on WebDSL