Linearizing contextual bandits with latent state dynamics

Elliot Nelson, Debarun Bhattacharjya, Tian Gao, Miao Liu, Djallel Bouneffouf 0001, Pascal Poupart. Linearizing contextual bandits with latent state dynamics. In James Cussens, Kun Zhang 0001, editors, Uncertainty in Artificial Intelligence, Proceedings of the Thirty-Eighth Conference on Uncertainty in Artificial Intelligence, UAI 2022, 1-5 August 2022, Eindhoven, The Netherlands. Volume 180 of Proceedings of Machine Learning Research, pages 1477-1487, PMLR, 2022. [doi]

Abstract

Abstract is missing.