Shicong Cen, Chen Cheng, Yuxin Chen 0002, Yuting Wei, Yuejie Chi. Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization. Operations Research, 70(4):2563-2578, 2022. [doi]
No references recorded for this publication.
No citations of this publication recorded.