Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients

Daniel Hennes, Dustin Morrill, Shayegan Omidshafiei, Rémi Munos, Julien Pérolat, Marc Lanctot, Audrunas Gruslys, Jean-Baptiste Lespiau, Paavo Parmas, Edgar A. Duéñez-Guzmán, Karl Tuyls. Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients. In Amal El Fallah-Seghrouchni, Gita Sukthankar, Bo An 0001, Neil Yorke-Smith, editors, Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS '20, Auckland, New Zealand, May 9-13, 2020. pages 492-501, International Foundation for Autonomous Agents and Multiagent Systems, 2020. [doi]

Authors

Daniel Hennes

This author has not been identified. Look up 'Daniel Hennes' in Google

Dustin Morrill

This author has not been identified. Look up 'Dustin Morrill' in Google

Shayegan Omidshafiei

This author has not been identified. Look up 'Shayegan Omidshafiei' in Google

Rémi Munos

This author has not been identified. Look up 'Rémi Munos' in Google

Julien Pérolat

This author has not been identified. Look up 'Julien Pérolat' in Google

Marc Lanctot

This author has not been identified. Look up 'Marc Lanctot' in Google

Audrunas Gruslys

This author has not been identified. Look up 'Audrunas Gruslys' in Google

Jean-Baptiste Lespiau

This author has not been identified. Look up 'Jean-Baptiste Lespiau' in Google

Paavo Parmas

This author has not been identified. Look up 'Paavo Parmas' in Google

Edgar A. Duéñez-Guzmán

This author has not been identified. Look up 'Edgar A. Duéñez-Guzmán' in Google

Karl Tuyls

This author has not been identified. Look up 'Karl Tuyls' in Google