Huizhen Yu. A Function Approximation Approach to Estimation of Policy Gradient for POMDP with Structured Policies. In UAI 05, Proceedings of the 21st Conference in Uncertainty in Artificial Intelligence, July 26-29 2005, Edinburgh, Scotland. pages 642-657, AUAI Press, 2005. [doi]
Abstract is missing.