An Online Actor-Critic Algorithm with Function Approximation for Constrained Markov Decision Processes

Shalabh Bhatnagar, K. Lakshmanan. An Online Actor-Critic Algorithm with Function Approximation for Constrained Markov Decision Processes. J. Optimization Theory and Applications, 153(3):688-708, 2012. [doi]

Abstract

Abstract is missing.