Regularized Policy Gradients: Direct Variance Reduction in Policy Gradient Estimation

Tingting Zhao, Gang Niu, Ning Xie 0003, Jucheng Yang, Masashi Sugiyama. Regularized Policy Gradients: Direct Variance Reduction in Policy Gradient Estimation. In Proceedings of The 7th Asian Conference on Machine Learning, ACML 2015, Hong Kong, November 20-22, 2015. Volume 45 of JMLR Workshop and Conference Proceedings, pages 333-348, JMLR.org, 2015. [doi]

Abstract

Abstract is missing.