Stochastic Constrained Contextual Bandits via Lyapunov Optimization Based Estimation to Decision Framework

Hengquan Guo, Xin Liu. Stochastic Constrained Contextual Bandits via Lyapunov Optimization Based Estimation to Decision Framework. In Shipra Agrawal 0001, Aaron Roth 0001, editors, The Thirty Seventh Annual Conference on Learning Theory, June 30 - July 3, 2023, Edmonton, Canada. Volume 247 of Proceedings of Machine Learning Research, pages 2204-2231, PMLR, 2024. [doi]

Abstract

Abstract is missing.