Preference-based Online Learning with Dueling Bandits: A Survey

Viktor Bengs, Róbert Busa-Fekete, Adil El Mesaoudi-Paul, Eyke Hüllermeier. Preference-based Online Learning with Dueling Bandits: A Survey. Journal of Machine Learning Research, 22, 2021. [doi]

Abstract

Abstract is missing.