Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients

Daniel Hennes, Dustin Morrill, Shayegan Omidshafiei, Rémi Munos, Julien Pérolat, Marc Lanctot, Audrunas Gruslys, Jean-Baptiste Lespiau, Paavo Parmas, Edgar A. Duéñez-Guzmán, Karl Tuyls. Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients. In Amal El Fallah-Seghrouchni, Gita Sukthankar, Bo An 0001, Neil Yorke-Smith, editors, Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS '20, Auckland, New Zealand, May 9-13, 2020. pages 492-501, International Foundation for Autonomous Agents and Multiagent Systems, 2020. [doi]

@inproceedings{HennesMOMPLGLPD20,
  title = {Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients},
  author = {Daniel Hennes and Dustin Morrill and Shayegan Omidshafiei and Rémi Munos and Julien Pérolat and Marc Lanctot and Audrunas Gruslys and Jean-Baptiste Lespiau and Paavo Parmas and Edgar A. Duéñez-Guzmán and Karl Tuyls},
  year = {2020},
  url = {https://dl.acm.org/doi/abs/10.5555/3398761.3398822},
  researchr = {https://researchr.org/publication/HennesMOMPLGLPD20},
  cites = {0},
  citedby = {0},
  pages = {492-501},
  booktitle = {Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS '20, Auckland, New Zealand, May 9-13, 2020},
  editor = {Amal El Fallah-Seghrouchni and Gita Sukthankar and Bo An 0001 and Neil Yorke-Smith},
  publisher = {International Foundation for Autonomous Agents and Multiagent Systems},
  isbn = {978-1-4503-7518-4},
}