Mean-Field Controls with Q-Learning for Cooperative MARL: Convergence and Complexity Analysis

Haotian Gu, Xin Guo 0001, Xiaoli Wei, Renyuan Xu. Mean-Field Controls with Q-Learning for Cooperative MARL: Convergence and Complexity Analysis. SIMODS, 3(4):1168-1196, 2021. [doi]

@article{GuGWX21,
  title = {Mean-Field Controls with Q-Learning for Cooperative MARL: Convergence and Complexity Analysis},
  author = {Haotian Gu and Xin Guo 0001 and Xiaoli Wei and Renyuan Xu},
  year = {2021},
  doi = {10.1137/20M1360700},
  url = {https://doi.org/10.1137/20M1360700},
  researchr = {https://researchr.org/publication/GuGWX21},
  cites = {0},
  citedby = {0},
  journal = {SIMODS},
  volume = {3},
  number = {4},
  pages = {1168-1196},
}