Bandit Learning with Delayed Impact of Actions

Wei Tang, Chien-Ju Ho, Yang Liu 0018. Bandit Learning with Delayed Impact of Actions. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 26804-26817, 2021. [doi]

Abstract

Abstract is missing.