New bounds on the price of bandit feedback for mistake-bounded online multiclass learning

Philip M. Long. New bounds on the price of bandit feedback for mistake-bounded online multiclass learning. In International Conference on Algorithmic Learning Theory, ALT 2017, 15-17 October 2017, Kyoto University, Kyoto, Japan. Volume 76 of Proceedings of Machine Learning Research, pages 3-10, PMLR, 2017. [doi]

Abstract

Abstract is missing.