Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization

Shicong Cen, Chen Cheng, Yuxin Chen 0002, Yuting Wei, Yuejie Chi. Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization. Operations Research, 70(4):2563-2578, 2022. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: