A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients

Ivo Grondman, Lucian Busoniu, Gabriel A. D. Lopes, Robert Babuska. A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients. IEEE Transactions on Systems, Man, and Cybernetics, Part A, 42(6):1291-1307, 2012. [doi]

Abstract

Abstract is missing.