Coordination without communication: optimal regret in two players multi-armed bandits

Sébastien Bubeck, Thomas Budzinski. Coordination without communication: optimal regret in two players multi-armed bandits. In Jacob D. Abernethy, Shivani Agarwal 0001, editors, Conference on Learning Theory, COLT 2020, 9-12 July 2020, Virtual Event [Graz, Austria]. Volume 125 of Proceedings of Machine Learning Research, pages 916-939, PMLR, 2020. [doi]

Abstract

Abstract is missing.