Policy Evaluation in Continuous MDPs With Efficient Kernelized Gradient Temporal Difference

Alec Koppel, Garrett Warnell, Ethan Stump, Peter Stone, Alejandro Ribeiro. Policy Evaluation in Continuous MDPs With Efficient Kernelized Gradient Temporal Difference. IEEE Trans. Automat. Contr., 66(4):1856-1863, 2021. [doi]

Abstract

Abstract is missing.