Toward a Theoretical Foundation of Policy Optimization for Learning Control Policies

Bin Hu, Kaiqing Zhang, Na Li 0002, Mehran Mesbahi, Maryam Fazel, Tamer Basar. Toward a Theoretical Foundation of Policy Optimization for Learning Control Policies. Annu. Rev. Control. Robotics Auton. Syst., 6:123-158, May 2023. [doi]

Authors

Bin Hu

This author has not been identified. Look up 'Bin Hu' in Google

Kaiqing Zhang

This author has not been identified. Look up 'Kaiqing Zhang' in Google

Na Li 0002

This author has not been identified. Look up 'Na Li 0002' in Google

Mehran Mesbahi

This author has not been identified. Look up 'Mehran Mesbahi' in Google

Maryam Fazel

This author has not been identified. Look up 'Maryam Fazel' in Google

Tamer Basar

This author has not been identified. Look up 'Tamer Basar' in Google