Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes

Yichun Hu, Nathan Kallus, Xiaojie Mao. Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes. In Jacob D. Abernethy, Shivani Agarwal 0001, editors, Conference on Learning Theory, COLT 2020, 9-12 July 2020, Virtual Event [Graz, Austria]. Volume 125 of Proceedings of Machine Learning Research, pages 2007-2010, PMLR, 2020. [doi]

Abstract

Abstract is missing.