Combining manual feedback with subsequent MDP reward signals for reinforcement learning

W. Bradley Knox, Peter Stone. Combining manual feedback with subsequent MDP reward signals for reinforcement learning. In Wiebe van der Hoek, Gal A. Kaminka, Yves Lespérance, Michael Luck, Sandip Sen, editors, 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), Toronto, Canada, May 10-14, 2010, Volume 1-3. pages 5-12, IFAAMAS, 2010. [doi]

Authors

W. Bradley Knox

This author has not been identified. Look up 'W. Bradley Knox' in Google

Peter Stone

This author has not been identified. Look up 'Peter Stone' in Google