Planning and Learning in Environments with Delayed Feedback

Thomas J. Walsh, Ali Nouri, Lihong Li, Michael L. Littman. Planning and Learning in Environments with Delayed Feedback. In Joost N. Kok, Jacek Koronacki, Ramon López de Mántaras, Stan Matwin, Dunja Mladenic, Andrzej Skowron, editors, Machine Learning: ECML 2007, 18th European Conference on Machine Learning, Warsaw, Poland, September 17-21, 2007, Proceedings. Volume 4701 of Lecture Notes in Computer Science, pages 442-453, Springer, 2007. [doi]

@inproceedings{WalshNLL07,
  title = {Planning and Learning in Environments with Delayed Feedback},
  author = {Thomas J. Walsh and Ali Nouri and Lihong Li and Michael L. Littman},
  year = {2007},
  doi = {10.1007/978-3-540-74958-5_41},
  url = {http://dx.doi.org/10.1007/978-3-540-74958-5_41},
  tags = {meta-model, Meta-Environment},
  researchr = {https://researchr.org/publication/WalshNLL07},
  cites = {0},
  citedby = {0},
  pages = {442-453},
  booktitle = {Machine Learning: ECML 2007, 18th European Conference on Machine Learning, Warsaw, Poland, September 17-21, 2007, Proceedings},
  editor = {Joost N. Kok and Jacek Koronacki and Ramon López de Mántaras and Stan Matwin and Dunja Mladenic and Andrzej Skowron},
  volume = {4701},
  series = {Lecture Notes in Computer Science},
  publisher = {Springer},
  isbn = {978-3-540-74957-8},
}