Contextual bandits with continuous actions: Smoothing, zooming, and adapting

Akshay Krishnamurthy, John Langford 0001, Aleksandrs Slivkins, Chicheng Zhang. Contextual bandits with continuous actions: Smoothing, zooming, and adapting. In Alina Beygelzimer, Daniel Hsu 0001, editors, Conference on Learning Theory, COLT 2019, 25-28 June 2019, Phoenix, AZ, USA. Volume 99 of Proceedings of Machine Learning Research, pages 2025-2027, PMLR, 2019. [doi]

Abstract

Abstract is missing.