On-line Policy Improvement using Monte-Carlo Search

Gerald Tesauro, Gregory R. Galperin. On-line Policy Improvement using Monte-Carlo Search. In Michael Mozer, Michael I. Jordan, Thomas Petsche, editors, Advances in Neural Information Processing Systems 9, NIPS, Denver, CO, USA, December 2-5, 1996. pages 1068-1074, MIT Press, 1996. [doi]

Abstract

Abstract is missing.