Conditional Importance Sampling for Off-Policy Learning

Mark Rowland, Anna Harutyunyan, Hado van Hasselt, Diana Borsa, Tom Schaul, Rémi Munos, Will Dabney. Conditional Importance Sampling for Off-Policy Learning. In Silvia Chiappa, Roberto Calandra, editors, The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020, 26-28 August 2020, Online [Palermo, Sicily, Italy]. Volume 108 of Proceedings of Machine Learning Research, pages 45-55, PMLR, 2020. [doi]

Abstract

Abstract is missing.