Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes - researchr publication references

researchr

You are not signed in
Sign in
Sign up

Alekh Agarwal, Sham M. Kakade, Jason D. Lee, Gaurav Mahajan. Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes. In Jacob D. Abernethy, Shivani Agarwal 0001, editors, Conference on Learning Theory, COLT 2020, 9-12 July 2020, Virtual Event [Graz, Austria]. Volume 125 of Proceedings of Machine Learning Research, pages 64-66, PMLR, 2020. [doi]

No references recorded for this publication.

No citations of this publication recorded.

runs on WebDSL