A Sharp Memory-Regret Trade-off for Multi-Pass Streaming Bandits

Arpit Agarwal, Sanjeev Khanna, Prathamesh Patil. A Sharp Memory-Regret Trade-off for Multi-Pass Streaming Bandits. In Po-Ling Loh, Maxim Raginsky, editors, Conference on Learning Theory, 2-5 July 2022, London, UK. Volume 178 of Proceedings of Machine Learning Research, pages 1423-1462, PMLR, 2022. [doi]

Abstract

Abstract is missing.