Importance sampling in reinforcement learning with an estimated behavior policy

Josiah P. Hanna, Scott Niekum, Peter Stone. Importance sampling in reinforcement learning with an estimated behavior policy. Machine Learning, 110(6):1267-1317, 2021. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.