Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation - researchr publication

researchr

You are not signed in
Sign in
Sign up

Gandharv Patil, Prashanth L. A., Dheeraj Nagaraj, Doina Precup. Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation. In Francisco J. R. Ruiz, Jennifer G. Dy, Jan-Willem van de Meent, editors, International Conference on Artificial Intelligence and Statistics, 25-27 April 2023, Palau de Congressos, Valencia, Spain. Volume 206 of Proceedings of Machine Learning Research, pages 5438-5448, PMLR, 2023. [doi]

Abstract is missing.

runs on WebDSL