Safe Policy Improvement with Baseline Bootstrapping

Romain Laroche, Paul Trichelair, Remi Tachet des Combes. Safe Policy Improvement with Baseline Bootstrapping. In Kamalika Chaudhuri, Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. Volume 97 of Proceedings of Machine Learning Research, pages 3652-3661, PMLR, 2019. [doi]

@inproceedings{LarocheTC19,
  title = {Safe Policy Improvement with Baseline Bootstrapping},
  author = {Romain Laroche and Paul Trichelair and Remi Tachet des Combes},
  year = {2019},
  url = {http://proceedings.mlr.press/v97/laroche19a.html},
  researchr = {https://researchr.org/publication/LarocheTC19},
  cites = {0},
  citedby = {0},
  pages = {3652-3661},
  booktitle = {Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA},
  editor = {Kamalika Chaudhuri and Ruslan Salakhutdinov},
  volume = {97},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}