On Regret-Optimal Learning in Decentralized Multiplayer Multiarmed Bandits

Naumaan Nayyar, Dileep M. Kalathil, Rahul Jain 0002. On Regret-Optimal Learning in Decentralized Multiplayer Multiarmed Bandits. IEEE Trans. Control of Network Systems, 5(1):597-606, 2018. [doi]

Abstract

Abstract is missing.