Reinforcement Learning in Nonzero-sum Linear Quadratic Deep Structured Games: Global Convergence of Policy Optimization

Masoud Roudneshin, Jalal Arabneydi, Amir G. Aghdam. Reinforcement Learning in Nonzero-sum Linear Quadratic Deep Structured Games: Global Convergence of Policy Optimization. In 59th IEEE Conference on Decision and Control, CDC 2020, Jeju Island, South Korea, December 14-18, 2020. pages 512-517, IEEE, 2020. [doi]

@inproceedings{RoudneshinAA20,
  title = {Reinforcement Learning in Nonzero-sum Linear Quadratic Deep Structured Games: Global Convergence of Policy Optimization},
  author = {Masoud Roudneshin and Jalal Arabneydi and Amir G. Aghdam},
  year = {2020},
  doi = {10.1109/CDC42340.2020.9303950},
  url = {https://doi.org/10.1109/CDC42340.2020.9303950},
  researchr = {https://researchr.org/publication/RoudneshinAA20},
  cites = {0},
  citedby = {0},
  pages = {512-517},
  booktitle = {59th IEEE Conference on Decision and Control, CDC 2020, Jeju Island, South Korea, December 14-18, 2020},
  publisher = {IEEE},
  isbn = {978-1-7281-7447-1},
}