Researchr is a web site for finding, collecting, sharing, and reviewing scientific publications, for researchers by researchers.
Sign up for an account to create a profile with publication list, tag and review your related work, and share bibliographies with your co-authors.
Tong Zhang 0001. Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning. SIMODS, 4(2):834-857, June 2022. [doi]
Possibly Related PublicationsThe following publications are possibly variants of this publication: Thompson Sampling with Time-Varying Reward for Contextual BanditsCairong Yan, Hualu Xu, Haixia Han, Yanting Zhang 0001, Zijian Wang. dasfaa 2023: 54-63 [doi]
The following publications are possibly variants of this publication: