Structured-policy Q-learning: an LMI-based Design Strategy for Distributed Reinforcement Learning

Lorenzo Sforni, Andrea Camisa, Giuseppe Notarstefano. Structured-policy Q-learning: an LMI-based Design Strategy for Distributed Reinforcement Learning. In 61st IEEE Conference on Decision and Control, CDC 2022, Cancun, Mexico, December 6-9, 2022. pages 4059-4064, IEEE, 2022. [doi]

@inproceedings{SforniCN22,
  title = {Structured-policy Q-learning: an LMI-based Design Strategy for Distributed Reinforcement Learning},
  author = {Lorenzo Sforni and Andrea Camisa and Giuseppe Notarstefano},
  year = {2022},
  doi = {10.1109/CDC51059.2022.9992584},
  url = {https://doi.org/10.1109/CDC51059.2022.9992584},
  researchr = {https://researchr.org/publication/SforniCN22},
  cites = {0},
  citedby = {0},
  pages = {4059-4064},
  booktitle = {61st IEEE Conference on Decision and Control, CDC 2022, Cancun, Mexico, December 6-9, 2022},
  publisher = {IEEE},
  isbn = {978-1-6654-6761-2},
}