Minimax Regret Bounds for Reinforcement Learning

Mohammad Gheshlaghi Azar, Ian Osband, Rémi Munos. Minimax Regret Bounds for Reinforcement Learning. In Doina Precup, Yee Whye Teh, editors, Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, NSW, Australia, 6-11 August 2017. Volume 70 of JMLR Workshop and Conference Proceedings, pages 263-272, JMLR.org, 2017. [doi]

@inproceedings{AzarOM17,
  title = {Minimax Regret Bounds for Reinforcement Learning},
  author = {Mohammad Gheshlaghi Azar and Ian Osband and Rémi Munos},
  year = {2017},
  url = {http://proceedings.mlr.press/v70/azar17a.html},
  researchr = {https://researchr.org/publication/AzarOM17},
  cites = {0},
  citedby = {0},
  pages = {263-272},
  booktitle = {Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, NSW, Australia, 6-11 August 2017},
  editor = {Doina Precup and Yee Whye Teh},
  volume = {70},
  series = {JMLR Workshop and Conference Proceedings},
  publisher = {JMLR.org},
}