Neural Policy Gradient Methods: Global Optimality and Rates of Convergence - researchr publication

researchr

You are not signed in
Sign in
Sign up

Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang. Neural Policy Gradient Methods: Global Optimality and Rates of Convergence. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. OpenReview.net, 2020. [doi]

Abstract is missing.

runs on WebDSL