Approximate Gradient Methods in Policy-Space Optimization of Markov Reward Processes

Peter Marbach, John N. Tsitsiklis. Approximate Gradient Methods in Policy-Space Optimization of Markov Reward Processes. Discrete Event Dynamic Systems, 13(1-2):111-148, 2003. [doi]

Authors

Peter Marbach

This author has not been identified. Look up 'Peter Marbach' in Google

John N. Tsitsiklis

This author has not been identified. Look up 'John N. Tsitsiklis' in Google