An Off-Policy Natural Policy Gradient Method for a Partial Observable Markov Decision Process

Yutaka Nakamura, Takeshi Mori, Shin Ishii. An Off-Policy Natural Policy Gradient Method for a Partial Observable Markov Decision Process. In Wlodzislaw Duch, Janusz Kacprzyk, Erkki Oja, Slawomir Zadrozny, editors, Artificial Neural Networks: Formal Models and Their Applications - ICANN 2005, 15th International Conference, Warsaw, Poland, September 11-15, 2005, Proceedings, Part II. Volume 3697 of Lecture Notes in Computer Science, pages 431-436, Springer, 2005. [doi]

Abstract

Abstract is missing.