Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies

Kaiqing Zhang, Alec Koppel, Hao Zhu 0001, Tamer Basar. Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies. SIAM J. Control and Optimization, 58(6):3586-3612, 2020. [doi]

Authors

Kaiqing Zhang

This author has not been identified. Look up 'Kaiqing Zhang' in Google

Alec Koppel

This author has not been identified. Look up 'Alec Koppel' in Google

Hao Zhu 0001

This author has not been identified. Look up 'Hao Zhu 0001' in Google

Tamer Basar

This author has not been identified. Look up 'Tamer Basar' in Google