Safe Policy Improvement with Baseline Bootstrapping

researchr

You are not signed in
Sign in
Sign up

Romain Laroche, Paul Trichelair, Remi Tachet des Combes. Safe Policy Improvement with Baseline Bootstrapping. In Kamalika Chaudhuri, Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. Volume 97 of Proceedings of Machine Learning Research, pages 3652-3661, PMLR, 2019. [doi]

@inproceedings{LarocheTC19,
  title = {Safe Policy Improvement with Baseline Bootstrapping},
  author = {Romain Laroche and Paul Trichelair and Remi Tachet des Combes},
  year = {2019},
  url = {http://proceedings.mlr.press/v97/laroche19a.html},
  researchr = {https://researchr.org/publication/LarocheTC19},
  cites = {0},
  citedby = {0},
  pages = {3652-3661},
  booktitle = {Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA},
  editor = {Kamalika Chaudhuri and Ruslan Salakhutdinov},
  volume = {97},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}

External Links

Cite Key

Statistics

PDF

Researchr

Safe Policy Improvement with Baseline Bootstrapping