Learning Neural Contextual Bandits through Perturbed Rewards

Yiling Jia, Weitong Zhang, Dongruo Zhou, Quanquan Gu, Hongning Wang. Learning Neural Contextual Bandits through Perturbed Rewards. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

@inproceedings{JiaZZGW22,
  title = {Learning Neural Contextual Bandits through Perturbed Rewards},
  author = {Yiling Jia and Weitong Zhang and Dongruo Zhou and Quanquan Gu and Hongning Wang},
  year = {2022},
  url = {https://openreview.net/forum?id=7inCJ3MhXt3},
  researchr = {https://researchr.org/publication/JiaZZGW22},
  cites = {0},
  citedby = {0},
  booktitle = {The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022},
  publisher = {OpenReview.net},
}