PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation

Matilde Gargiani, Andrea Zanelli, Andrea Martinelli, Tyler H. Summers, John Lygeros. PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu 0001, Sivan Sabato, editors, International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA. Volume 162 of Proceedings of Machine Learning Research, pages 7223-7240, PMLR, 2022. [doi]

Authors

Matilde Gargiani

This author has not been identified. Look up 'Matilde Gargiani' in Google

Andrea Zanelli

This author has not been identified. Look up 'Andrea Zanelli' in Google

Andrea Martinelli

This author has not been identified. Look up 'Andrea Martinelli' in Google

Tyler H. Summers

This author has not been identified. Look up 'Tyler H. Summers' in Google

John Lygeros

This author has not been identified. Look up 'John Lygeros' in Google