Policy Gradient Reinforcement Learning with Environmental Dynamics and Action-Values in Policies

Seiji Ishihara, Harukazu Igarashi. Policy Gradient Reinforcement Learning with Environmental Dynamics and Action-Values in Policies. In Andreas König, Andreas Dengel, Knut Hinkelmann, Koichi Kise, Robert J. Howlett, Lakhmi C. Jain, editors, Knowledge-Based and Intelligent Information and Engineering Systems - 15th International Conference, KES 2011, Kaiserslautern, Germany, September 12-14, 2011, Proceedings, Part I. Volume 6881 of Lecture Notes in Computer Science, pages 120-130, Springer, 2011. [doi]

Abstract

Abstract is missing.