Researchr is a web site for finding, collecting, sharing, and reviewing scientific publications, for researchers by researchers.
Sign up for an account to create a profile with publication list, tag and review your related work, and share bibliographies with your co-authors.
Josiah P. Hanna, Scott Niekum, Peter Stone. Importance sampling in reinforcement learning with an estimated behavior policy. Machine Learning, 110(6):1267-1317, 2021. [doi]
Possibly Related PublicationsThe following publications are possibly variants of this publication: Importance Sampling Policy Evaluation with an Estimated Behavior PolicyJosiah Hanna, Scott Niekum, Peter Stone. icml 2019: 2605-2613 [doi]
The following publications are possibly variants of this publication: