Learning Neural Contextual Bandits through Perturbed Rewards

Yiling Jia, Weitong Zhang, Dongruo Zhou, Quanquan Gu, Hongning Wang. Learning Neural Contextual Bandits through Perturbed Rewards. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

Authors

Yiling Jia

This author has not been identified. Look up 'Yiling Jia' in Google

Weitong Zhang

This author has not been identified. Look up 'Weitong Zhang' in Google

Dongruo Zhou

This author has not been identified. Look up 'Dongruo Zhou' in Google

Quanquan Gu

This author has not been identified. Look up 'Quanquan Gu' in Google

Hongning Wang

This author has not been identified. Look up 'Hongning Wang' in Google