Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree search

John Asmuth, Michael L. Littman. Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree search. In Fabio Gagliardi Cozman, Avi Pfeffer, editors, UAI 2011, Proceedings of the Twenty-Seventh Conference on Uncertainty in Artificial Intelligence, Barcelona, Spain, July 14-17, 2011. pages 19-26, AUAI Press, 2011. [doi]

Authors

John Asmuth

This author has not been identified. Look up 'John Asmuth' in Google

Michael L. Littman

This author has not been identified. Look up 'Michael L. Littman' in Google