Analysis and Improvement of Policy Gradient Estimation

Tingting Zhao, Hirotaka Hachiya, Gang Niu, Masashi Sugiyama. Analysis and Improvement of Policy Gradient Estimation. In John Shawe-Taylor, Richard S. Zemel, Peter L. Bartlett, Fernando C. N. Pereira, Kilian Q. Weinberger, editors, Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, Granada, Spain. pages 262-270, 2011. [doi]

Abstract

Abstract is missing.