From Q(lambda) to Average Q-learning: Efficient Implementation of an Asymptotic Approximation

Frédérick Garcia, Florent Serre. From Q(lambda) to Average Q-learning: Efficient Implementation of an Asymptotic Approximation. In Bernhard Nebel, editor, Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, IJCAI 2001, Seattle, Washington, USA, August 4-10, 2001. pages 959-964, Morgan Kaufmann, 2001.

Abstract

Abstract is missing.