Importance sampling in reinforcement learning with an estimated behavior policy - researchr publication related

researchr

You are not signed in
Sign in
Sign up

Josiah P. Hanna, Scott Niekum, Peter Stone. Importance sampling in reinforcement learning with an estimated behavior policy. Machine Learning, 110(6):1267-1317, 2021. [doi]

The following publications are possibly variants of this publication:

Importance Sampling Policy Evaluation with an Estimated Behavior PolicyJosiah Hanna, Scott Niekum, Peter Stone. icml 2019: 2605-2613 [doi]

runs on WebDSL