Reinforcement learning from simultaneous human and MDP reward

W. Bradley Knox, Peter Stone. Reinforcement learning from simultaneous human and MDP reward. In Wiebe van der Hoek, Lin Padgham, Vincent Conitzer, Michael Winikoff, editors, International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2012, Valencia, Spain, June 4-8, 2012 (3 Volumes). pages 475-482, IFAAMAS, 2012. [doi]

Abstract

Abstract is missing.