Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients

researchr

You are not signed in
Sign in
Sign up

Daniel Hennes, Dustin Morrill, Shayegan Omidshafiei, Rémi Munos, Julien Pérolat, Marc Lanctot, Audrunas Gruslys, Jean-Baptiste Lespiau, Paavo Parmas, Edgar A. Duéñez-Guzmán, Karl Tuyls. Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients. In Amal El Fallah-Seghrouchni, Gita Sukthankar, Bo An 0001, Neil Yorke-Smith, editors, Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS '20, Auckland, New Zealand, May 9-13, 2020. pages 492-501, International Foundation for Autonomous Agents and Multiagent Systems, 2020. [doi]

@inproceedings{HennesMOMPLGLPD20,
  title = {Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients},
  author = {Daniel Hennes and Dustin Morrill and Shayegan Omidshafiei and Rémi Munos and Julien Pérolat and Marc Lanctot and Audrunas Gruslys and Jean-Baptiste Lespiau and Paavo Parmas and Edgar A. Duéñez-Guzmán and Karl Tuyls},
  year = {2020},
  url = {https://dl.acm.org/doi/abs/10.5555/3398761.3398822},
  researchr = {https://researchr.org/publication/HennesMOMPLGLPD20},
  cites = {0},
  citedby = {0},
  pages = {492-501},
  booktitle = {Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS '20, Auckland, New Zealand, May 9-13, 2020},
  editor = {Amal El Fallah-Seghrouchni and Gita Sukthankar and Bo An 0001 and Neil Yorke-Smith},
  publisher = {International Foundation for Autonomous Agents and Multiagent Systems},
  isbn = {978-1-4503-7518-4},
}

External Links

Cite Key

Statistics

PDF

Researchr

Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients