Reinforcement learning from simultaneous human and MDP reward

W. Bradley Knox, Peter Stone. Reinforcement learning from simultaneous human and MDP reward. In Wiebe van der Hoek, Lin Padgham, Vincent Conitzer, Michael Winikoff, editors, International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2012, Valencia, Spain, June 4-8, 2012 (3 Volumes). pages 475-482, IFAAMAS, 2012. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: