Exploration for Free: How Does Reward Heterogeneity Improve Regret in Cooperative Multi-agent Bandits?

Xuchuang Wang, Lin Yang, Yu-Zhen Janice Chen, Xutong Liu, Mohammad Hajiesmaili, Don Towsley, John C. S. Lui. Exploration for Free: How Does Reward Heterogeneity Improve Regret in Cooperative Multi-agent Bandits?. In Robin J. Evans 0002, Ilya Shpitser, editors, Uncertainty in Artificial Intelligence, UAI 2023, July 31 - 4 August 2023, Pittsburgh, PA, USA. Volume 216 of Proceedings of Machine Learning Research, pages 2192-2202, PMLR, 2023. [doi]

Authors

Xuchuang Wang

This author has not been identified. Look up 'Xuchuang Wang' in Google

Lin Yang

This author has not been identified. Look up 'Lin Yang' in Google

Yu-Zhen Janice Chen

This author has not been identified. Look up 'Yu-Zhen Janice Chen' in Google

Xutong Liu

This author has not been identified. Look up 'Xutong Liu' in Google

Mohammad Hajiesmaili

This author has not been identified. Look up 'Mohammad Hajiesmaili' in Google

Don Towsley

This author has not been identified. Look up 'Don Towsley' in Google

John C. S. Lui

This author has not been identified. Look up 'John C. S. Lui' in Google