Sample Complexity of Estimating the Policy Gradient for Nearly Deterministic Dynamical Systems

Osbert Bastani. Sample Complexity of Estimating the Policy Gradient for Nearly Deterministic Dynamical Systems. In Silvia Chiappa, Roberto Calandra, editors, The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020, 26-28 August 2020, Online [Palermo, Sicily, Italy]. Volume 108 of Proceedings of Machine Learning Research, pages 3858-3869, PMLR, 2020. [doi]

Abstract

Abstract is missing.