Improved High-Probability Regret for Adversarial Bandits with Time-Varying Feedback Graphs

Haipeng Luo, Hanghang Tong, Mengxiao Zhang, Yuheng Zhang. Improved High-Probability Regret for Adversarial Bandits with Time-Varying Feedback Graphs. In Shipra Agrawal 0001, Francesco Orabona, editors, International Conference on Algorithmic Learning Theory, February 20-23, 2023, Singapore. Volume 201 of Proceedings of Machine Learning Research, pages 1074-1100, PMLR, 2023. [doi]

Abstract

Abstract is missing.