On Generalized Bellman Equations and Temporal-Difference Learning

Huizhen Yu, Ashique Rupam Mahmood, Richard S. Sutton. On Generalized Bellman Equations and Temporal-Difference Learning. In Malek Mouhoub, Philippe Langlais, editors, Advances in Artificial Intelligence - 30th Canadian Conference on Artificial Intelligence, Canadian AI 2017, Edmonton, AB, Canada, May 16-19, 2017, Proceedings. Volume 10233 of Lecture Notes in Computer Science, pages 3-14, 2017. [doi]

@inproceedings{YuMS17,
  title = {On Generalized Bellman Equations and Temporal-Difference Learning},
  author = {Huizhen Yu and Ashique Rupam Mahmood and Richard S. Sutton},
  year = {2017},
  doi = {10.1007/978-3-319-57351-9_1},
  url = {https://doi.org/10.1007/978-3-319-57351-9_1},
  researchr = {https://researchr.org/publication/YuMS17},
  cites = {0},
  citedby = {0},
  pages = {3-14},
  booktitle = {Advances in Artificial Intelligence - 30th Canadian Conference on Artificial Intelligence, Canadian AI 2017, Edmonton, AB, Canada, May 16-19, 2017, Proceedings},
  editor = {Malek Mouhoub and Philippe Langlais},
  volume = {10233},
  series = {Lecture Notes in Computer Science},
  isbn = {978-3-319-57351-9},
}