Safe Policy Improvement with Baseline Bootstrapping

Romain Laroche, Paul Trichelair, Remi Tachet des Combes. Safe Policy Improvement with Baseline Bootstrapping. In Kamalika Chaudhuri, Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. Volume 97 of Proceedings of Machine Learning Research, pages 3652-3661, PMLR, 2019. [doi]

Abstract

Abstract is missing.