Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization

Shicong Cen, Chen Cheng, Yuxin Chen 0002, Yuting Wei, Yuejie Chi. Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization. Operations Research, 70(4):2563-2578, 2022. [doi]

Authors

Shicong Cen

This author has not been identified. Look up 'Shicong Cen' in Google

Chen Cheng

This author has not been identified. Look up 'Chen Cheng' in Google

Yuxin Chen 0002

This author has not been identified. Look up 'Yuxin Chen 0002' in Google

Yuting Wei

This author has not been identified. Look up 'Yuting Wei' in Google

Yuejie Chi

This author has not been identified. Look up 'Yuejie Chi' in Google