Combining manual feedback with subsequent MDP reward signals for reinforcement learning

W. Bradley Knox, Peter Stone. Combining manual feedback with subsequent MDP reward signals for reinforcement learning. In Wiebe van der Hoek, Gal A. Kaminka, Yves Lespérance, Michael Luck, Sandip Sen, editors, 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), Toronto, Canada, May 10-14, 2010, Volume 1-3. pages 5-12, IFAAMAS, 2010. [doi]

Abstract

Abstract is missing.