On-Line Policy Gradient Estimation with Multi-Step Sampling

Yan-Jie Li, Fang Cao, Xi-Ren Cao. On-Line Policy Gradient Estimation with Multi-Step Sampling. Discrete Event Dynamic Systems, 20(1):3-17, 2010. [doi]

Abstract

Abstract is missing.