On-Line Policy Gradient Estimation with Multi-Step Sampling

Yan-Jie Li, Fang Cao, Xi-Ren Cao. On-Line Policy Gradient Estimation with Multi-Step Sampling. Discrete Event Dynamic Systems, 20(1):3-17, 2010. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.