Policy gradient stochastic approximation algorithms for adaptive control of constrained time varying Markov decision processes - researchr publication

researchr

You are not signed in
Sign in
Sign up

Felisa J. Vázquez-Abad, Vikram Krishnamurthy. Policy gradient stochastic approximation algorithms for adaptive control of constrained time varying Markov decision processes. In 42nd IEEE Conference on Decision and Control, CDC 2003, Maui, Hawaii, USA, December 9-12, 2003. pages 2823-2828, IEEE, 2003. [doi]

Abstract is missing.

runs on WebDSL