Regularized Policy Gradients: Direct Variance Reduction in Policy Gradient Estimation

Tingting Zhao, Gang Niu, Ning Xie 0003, Jucheng Yang, Masashi Sugiyama. Regularized Policy Gradients: Direct Variance Reduction in Policy Gradient Estimation. In Proceedings of The 7th Asian Conference on Machine Learning, ACML 2015, Hong Kong, November 20-22, 2015. Volume 45 of JMLR Workshop and Conference Proceedings, pages 333-348, JMLR.org, 2015. [doi]

Authors

Tingting Zhao

This author has not been identified. Look up 'Tingting Zhao' in Google

Gang Niu

This author has not been identified. Look up 'Gang Niu' in Google

Ning Xie 0003

This author has not been identified. Look up 'Ning Xie 0003' in Google

Jucheng Yang

This author has not been identified. Look up 'Jucheng Yang' in Google

Masashi Sugiyama

This author has not been identified. Look up 'Masashi Sugiyama' in Google