Interactive Multi-objective Reinforcement Learning in Multi-armed Bandits with Gaussian Process Utility Models

Diederik M. Roijers, Luisa M. Zintgraf, Pieter Libin, Mathieu Reymond, Eugenio Bargiacchi, Ann Nowé. Interactive Multi-objective Reinforcement Learning in Multi-armed Bandits with Gaussian Process Utility Models. In Frank Hutter, Kristian Kersting, Jefrey Lijffijt, Isabel Valera, editors, Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2020, Ghent, Belgium, September 14-18, 2020, Proceedings, Part III. Volume 12459 of Lecture Notes in Computer Science, pages 463-478, Springer, 2020. [doi]

Abstract

Abstract is missing.