Combining manual feedback with subsequent MDP reward signals for reinforcement learning - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

W. Bradley Knox, Peter Stone. Combining manual feedback with subsequent MDP reward signals for reinforcement learning. In Wiebe van der Hoek, Gal A. Kaminka, Yves Lespérance, Michael Luck, Sandip Sen, editors, 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), Toronto, Canada, May 10-14, 2010, Volume 1-3. pages 5-12, IFAAMAS, 2010. [doi]

This author has not been identified. Look up 'W. Bradley Knox' in GoogleThis author has not been identified. Look up 'Peter Stone' in Google

runs on WebDSL