Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism

Wang Chi Cheung, David Simchi-Levi, Ruihao Zhu. Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism. In Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event. Volume 119 of Proceedings of Machine Learning Research, pages 1843-1854, PMLR, 2020. [doi]

@inproceedings{CheungSZ20,
  title = {Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism},
  author = {Wang Chi Cheung and David Simchi-Levi and Ruihao Zhu},
  year = {2020},
  url = {http://proceedings.mlr.press/v119/cheung20a.html},
  researchr = {https://researchr.org/publication/CheungSZ20},
  cites = {0},
  citedby = {0},
  pages = {1843-1854},
  booktitle = {Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event},
  volume = {119},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}