Toward a Theoretical Foundation of Policy Optimization for Learning Control Policies

Bin Hu, Kaiqing Zhang, Na Li 0002, Mehran Mesbahi, Maryam Fazel, Tamer Basar. Toward a Theoretical Foundation of Policy Optimization for Learning Control Policies. Annu. Rev. Control. Robotics Auton. Syst., 6:123-158, May 2023. [doi]

@article{HuZLMFB23,
  title = {Toward a Theoretical Foundation of Policy Optimization for Learning Control Policies},
  author = {Bin Hu and Kaiqing Zhang and Na Li 0002 and Mehran Mesbahi and Maryam Fazel and Tamer Basar},
  year = {2023},
  month = {May},
  doi = {10.1146/annurev-control-042920-020021},
  url = {https://doi.org/10.1146/annurev-control-042920-020021},
  researchr = {https://researchr.org/publication/HuZLMFB23},
  cites = {0},
  citedby = {0},
  journal = {Annu. Rev. Control. Robotics Auton. Syst.},
  volume = {6},
  pages = {123-158},
}