From Q(lambda) to Average Q-learning: Efficient Implementation of an Asymptotic Approximation

Frédérick Garcia, Florent Serre. From Q(lambda) to Average Q-learning: Efficient Implementation of an Asymptotic Approximation. In Bernhard Nebel, editor, Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, IJCAI 2001, Seattle, Washington, USA, August 4-10, 2001. pages 959-964, Morgan Kaufmann, 2001.

Authors

Frédérick Garcia

This author has not been identified. Look up 'Frédérick Garcia' in Google

Florent Serre

This author has not been identified. Look up 'Florent Serre' in Google