Simon Vanneste, Astrid Vanneste, Tom De Schepper, Siegfried Mercelis, Peter Hellinckx, Kevin Mets. Multi-Agent Counterfactual Communication Using Difference Rewards Policy Gradients. In Frans A. Oliehoek, Manon Kok, Sicco Verwer, editors, Artificial Intelligence and Machine Learning - 35th Benelux Conference, BNAIC/Benelearn 2023, Delft, The Netherlands, November 8-10, 2023, Revised Selected Papers. Volume 2187 of Communications in Computer and Information Science, pages 82-100, Springer, 2023. [doi]
Abstract is missing.