Toward a Theoretical Foundation of Policy Optimization for Learning Control Policies

Bin Hu, Kaiqing Zhang, Na Li 0002, Mehran Mesbahi, Maryam Fazel, Tamer Basar. Toward a Theoretical Foundation of Policy Optimization for Learning Control Policies. Annu. Rev. Control. Robotics Auton. Syst., 6:123-158, May 2023. [doi]

Abstract

Abstract is missing.