Preference-Based Policy Iteration: Leveraging Preference Learning for Reinforcement Learning

Weiwei Cheng, Johannes Fürnkranz, Eyke Hüllermeier, Sang-Hyeun Park. Preference-Based Policy Iteration: Leveraging Preference Learning for Reinforcement Learning. In Dimitrios Gunopulos, Thomas Hofmann, Donato Malerba, Michalis Vazirgiannis, editors, Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Athens, Greece, September 5-9, 2011. Proceedings, Part I. Volume 6911 of Lecture Notes in Computer Science, pages 312-327, Springer, 2011. [doi]

Abstract

Abstract is missing.