Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory

Bin Hu, Usman Ahmed Syed. Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d'Alché-Buc, Edward A. Fox, Roman Garnett, editors, Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, 8-14 December 2019, Vancouver, BC, Canada. pages 8477-8488, 2019. [doi]

@inproceedings{HuS19-5,
  title = {Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory},
  author = {Bin Hu and Usman Ahmed Syed},
  year = {2019},
  url = {http://papers.nips.cc/paper/9055-characterizing-the-exact-behaviors-of-temporal-difference-learning-algorithms-using-markov-jump-linear-system-theory},
  researchr = {https://researchr.org/publication/HuS19-5},
  cites = {0},
  citedby = {0},
  pages = {8477-8488},
  booktitle = {Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, 8-14 December 2019, Vancouver, BC, Canada},
  editor = {Hanna M. Wallach and Hugo Larochelle and Alina Beygelzimer and Florence d'Alché-Buc and Edward A. Fox and Roman Garnett},
}