Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints

Hengquan Guo, Zhu Qi, Xin Liu. Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints. In Nikolai Matni, Manfred Morari, George J. Pappas, editors, Learning for Dynamics and Control Conference, L4DC 2023, 15-16 June 2023, Philadelphia, PA, USA. Volume 211 of Proceedings of Machine Learning Research, pages 1333-1344, PMLR, 2023. [doi]

Abstract

Abstract is missing.