On-line Policy Improvement using Monte-Carlo Search

Gerald Tesauro, Gregory R. Galperin. On-line Policy Improvement using Monte-Carlo Search. In Michael Mozer, Michael I. Jordan, Thomas Petsche, editors, Advances in Neural Information Processing Systems 9, NIPS, Denver, CO, USA, December 2-5, 1996. pages 1068-1074, MIT Press, 1996. [doi]

Authors

Gerald Tesauro

This author has not been identified. Look up 'Gerald Tesauro' in Google

Gregory R. Galperin

This author has not been identified. Look up 'Gregory R. Galperin' in Google