Importance sampling in reinforcement learning with an estimated behavior policy

Josiah P. Hanna, Scott Niekum, Peter Stone. Importance sampling in reinforcement learning with an estimated behavior policy. Machine Learning, 110(6):1267-1317, 2021. [doi]

Authors

Josiah P. Hanna

This author has not been identified. Look up 'Josiah P. Hanna' in Google

Scott Niekum

This author has not been identified. Look up 'Scott Niekum' in Google

Peter Stone

This author has not been identified. Look up 'Peter Stone' in Google