Neural Rewards Regression for near-optimal policy identification in Markovian and partial observable environments

Daniel Schneegaß, Steffen Udluft, Thomas Martinetz. Neural Rewards Regression for near-optimal policy identification in Markovian and partial observable environments. In ESANN 2007, 15th European Symposium on Artificial Neural Networks, Bruges, Belgium, April 25-27, 2007, Proceedings. pages 301-306, 2007. [doi]

Abstract

Abstract is missing.