Reducing policy degradation in neuro-dynamic programming

Thomas Gabel, Martin Riedmiller. Reducing policy degradation in neuro-dynamic programming. In ESANN 2006, 14th European Symposium on Artificial Neural Networks, Bruges, Belgium, April 26-28, 2006, Proceedings. pages 653-658, 2006. [doi]

Abstract

Abstract is missing.