Learning to trade off between exploration and exploitation in multiclass bandit prediction

Hamed Valizadegan, Rong Jin, Shijun Wang. Learning to trade off between exploration and exploitation in multiclass bandit prediction. In Chid Apté, Joydeep Ghosh, Padhraic Smyth, editors, Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, August 21-24, 2011. pages 204-212, ACM, 2011. [doi]

Abstract

Abstract is missing.