Toward a Theoretical Foundation of Policy Optimization for Learning Control Policies

Bin Hu, Kaiqing Zhang, Na Li 0002, Mehran Mesbahi, Maryam Fazel, Tamer Basar. Toward a Theoretical Foundation of Policy Optimization for Learning Control Policies. Annu. Rev. Control. Robotics Auton. Syst., 6:123-158, May 2023. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: