Sharp bounds on the price of bandit feedback for several models of mistake-bounded online learning

Raymond Feng, Jesse Geneson, Andrew Lee, Espen Slettnes. Sharp bounds on the price of bandit feedback for several models of mistake-bounded online learning. Theoretical Computer Science, 965:113980, 2023. [doi]

Abstract

Abstract is missing.