Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning - researchr publication related

researchr

You are not signed in
Sign in
Sign up

Tong Zhang 0001. Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning. SIMODS, 4(2):834-857, June 2022. [doi]

The following publications are possibly variants of this publication:

Thompson Sampling with Time-Varying Reward for Contextual BanditsCairong Yan, Hualu Xu, Haixia Han, Yanting Zhang 0001, Zijian Wang. dasfaa 2023: 54-63 [doi]

runs on WebDSL