Safe Policy Improvement with Baseline Bootstrapping

Romain Laroche, Paul Trichelair, Remi Tachet des Combes. Safe Policy Improvement with Baseline Bootstrapping. In Kamalika Chaudhuri, Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. Volume 97 of Proceedings of Machine Learning Research, pages 3652-3661, PMLR, 2019. [doi]

Authors

Romain Laroche

This author has not been identified. Look up 'Romain Laroche' in Google

Paul Trichelair

This author has not been identified. Look up 'Paul Trichelair' in Google

Remi Tachet des Combes

This author has not been identified. Look up 'Remi Tachet des Combes' in Google