Exploration for Free: How Does Reward Heterogeneity Improve Regret in Cooperative Multi-agent Bandits?

Xuchuang Wang, Lin Yang, Yu-Zhen Janice Chen, Xutong Liu, Mohammad Hajiesmaili, Don Towsley, John C. S. Lui. Exploration for Free: How Does Reward Heterogeneity Improve Regret in Cooperative Multi-agent Bandits?. In Robin J. Evans 0002, Ilya Shpitser, editors, Uncertainty in Artificial Intelligence, UAI 2023, July 31 - 4 August 2023, Pittsburgh, PA, USA. Volume 216 of Proceedings of Machine Learning Research, pages 2192-2202, PMLR, 2023. [doi]

Abstract

Abstract is missing.