Neural Rewards Regression for near-optimal policy identification in Markovian and partial observable environments

Daniel Schneegaß, Steffen Udluft, Thomas Martinetz. Neural Rewards Regression for near-optimal policy identification in Markovian and partial observable environments. In ESANN 2007, 15th European Symposium on Artificial Neural Networks, Bruges, Belgium, April 25-27, 2007, Proceedings. pages 301-306, 2007. [doi]

@inproceedings{SchneegassUM07,
  title = {Neural Rewards Regression for near-optimal policy identification in Markovian and partial observable environments},
  author = {Daniel Schneegaß and Steffen Udluft and Thomas Martinetz},
  year = {2007},
  url = {http://www.dice.ucl.ac.be/Proceedings/esann/esannpdf/es2007-24.pdf},
  tags = {Meta-Environment},
  researchr = {https://researchr.org/publication/SchneegassUM07},
  cites = {0},
  citedby = {0},
  pages = {301-306},
  booktitle = {ESANN 2007, 15th European Symposium on Artificial Neural Networks, Bruges, Belgium, April 25-27, 2007, Proceedings},
}