Coordination without communication: optimal regret in two players multi-armed bandits

Sébastien Bubeck, Thomas Budzinski. Coordination without communication: optimal regret in two players multi-armed bandits. In Jacob D. Abernethy, Shivani Agarwal 0001, editors, Conference on Learning Theory, COLT 2020, 9-12 July 2020, Virtual Event [Graz, Austria]. Volume 125 of Proceedings of Machine Learning Research, pages 916-939, PMLR, 2020. [doi]

Authors

Sébastien Bubeck

This author has not been identified. Look up 'Sébastien Bubeck' in Google

Thomas Budzinski

This author has not been identified. Look up 'Thomas Budzinski' in Google