Learning Neural Contextual Bandits through Perturbed Rewards

Yiling Jia, Weitong Zhang, Dongruo Zhou, Quanquan Gu, Hongning Wang. Learning Neural Contextual Bandits through Perturbed Rewards. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

Abstract

Abstract is missing.