Modified Retrace for Off-Policy Temporal Difference Learning

Xingguo Chen, Xingzhou Ma, Yang Li, Guang Yang, Shangdong Yang, Yang Gao. Modified Retrace for Off-Policy Temporal Difference Learning. In Robin J. Evans 0002, Ilya Shpitser, editors, Uncertainty in Artificial Intelligence, UAI 2023, July 31 - 4 August 2023, Pittsburgh, PA, USA. Volume 216 of Proceedings of Machine Learning Research, pages 303-312, PMLR, 2023. [doi]

Abstract

Abstract is missing.