Reinforcement learning from simultaneous human and MDP reward

W. Bradley Knox, Peter Stone. Reinforcement learning from simultaneous human and MDP reward. In Wiebe van der Hoek, Lin Padgham, Vincent Conitzer, Michael Winikoff, editors, International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2012, Valencia, Spain, June 4-8, 2012 (3 Volumes). pages 475-482, IFAAMAS, 2012. [doi]

Authors

W. Bradley Knox

This author has not been identified. Look up 'W. Bradley Knox' in Google

Peter Stone

This author has not been identified. Look up 'Peter Stone' in Google