Thodoris Lykouris, Éva Tardos, Drishti Wali. Feedback graph regret bounds for Thompson Sampling and UCB. In Aryeh Kontorovich, Gergely Neu, editors, Algorithmic Learning Theory, ALT 2020, 8-11 February 2020, San Diego, CA, USA. Volume 117 of Proceedings of Machine Learning Research, pages 592-614, PMLR, 2020. [doi]
Abstract is missing.