Sample Efficient Policy Gradient Methods with Recursive Variance Reduction

Pan Xu 0002, Felicia Gao, Quanquan Gu. Sample Efficient Policy Gradient Methods with Recursive Variance Reduction. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. OpenReview.net, 2020. [doi]

Abstract

Abstract is missing.