A finite-sample analysis of multi-step temporal difference estimates

Yaqi Duan, Martin J. Wainwright. A finite-sample analysis of multi-step temporal difference estimates. In Nikolai Matni, Manfred Morari, George J. Pappas, editors, Learning for Dynamics and Control Conference, L4DC 2023, 15-16 June 2023, Philadelphia, PA, USA. Volume 211 of Proceedings of Machine Learning Research, pages 612-624, PMLR, 2023. [doi]

Abstract

Abstract is missing.