Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Yifan Lin, Yuhao Wang, Enlu Zhou. Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate. Operations Research, 73(6):3010-3026, 2025. [doi]

This author has not been identified. Look up 'Yifan Lin' in GoogleThis author has not been identified. Look up 'Yuhao Wang' in GoogleThis author has not been identified. Look up 'Enlu Zhou' in Google

runs on WebDSL