Approximate Gradient Methods in Policy-Space Optimization of Markov Reward Processes

Peter Marbach, John N. Tsitsiklis. Approximate Gradient Methods in Policy-Space Optimization of Markov Reward Processes. Discrete Event Dynamic Systems, 13(1-2):111-148, 2003. [doi]

No reviews for this publication, yet.