Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles

Yuxuan Han, Jialin Zeng, Yang Wang, Yang Xiang, Jiheng Zhang. Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles. In Francisco J. R. Ruiz, Jennifer G. Dy, Jan-Willem van de Meent, editors, International Conference on Artificial Intelligence and Statistics, 25-27 April 2023, Palau de Congressos, Valencia, Spain. Volume 206 of Proceedings of Machine Learning Research, pages 5011-5035, PMLR, 2023. [doi]

Abstract

Abstract is missing.