Learning Multi-Objective Rewards and User Utility Function in Contextual Bandits for Personalized Ranking

Nirandika Wanigasekara, Yuxuan Liang, Siong-Thye Goh, Ye Liu, Joseph Jay Williams, David S. Rosenblum. Learning Multi-Objective Rewards and User Utility Function in Contextual Bandits for Personalized Ranking. In Sarit Kraus, editor, Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10-16, 2019. pages 3835-3841, ijcai.org, 2019. [doi]

Abstract

Abstract is missing.