An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods

Yanli Liu 0003, Kaiqing Zhang, Tamer Basar, Wotao Yin. An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

Authors

Yanli Liu 0003

This author has not been identified. Look up 'Yanli Liu 0003' in Google

Kaiqing Zhang

This author has not been identified. Look up 'Kaiqing Zhang' in Google

Tamer Basar

This author has not been identified. Look up 'Tamer Basar' in Google

Wotao Yin

This author has not been identified. Look up 'Wotao Yin' in Google