Felisa J. Vázquez-Abad, Vikram Krishnamurthy. Policy gradient stochastic approximation algorithms for adaptive control of constrained time varying Markov decision processes. In 42nd IEEE Conference on Decision and Control, CDC 2003, Maui, Hawaii, USA, December 9-12, 2003. pages 2823-2828, IEEE, 2003. [doi]
No references recorded for this publication.
No citations of this publication recorded.