A Temporal-Difference Approach to Policy Gradient Estimation - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Samuele Tosatto, Andrew Patterson, Martha White, Rupam Mahmood. A Temporal-Difference Approach to Policy Gradient Estimation. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu 0001, Sivan Sabato, editors, International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA. Volume 162 of Proceedings of Machine Learning Research, pages 21609-21632, PMLR, 2022. [doi]

This author has not been identified. Look up 'Samuele Tosatto' in GoogleThis author has not been identified. Look up 'Andrew Patterson' in GoogleThis author has not been identified. Look up 'Martha White' in GoogleThis author has not been identified. Look up 'Rupam Mahmood' in Google

runs on WebDSL