Fast Reinforcement Learning with Large Action Sets Using Error-Correcting Output Codes for MDP Factorization

Gabriel Dulac-Arnold, Ludovic Denoyer, Philippe Preux, Patrick Gallinari. Fast Reinforcement Learning with Large Action Sets Using Error-Correcting Output Codes for MDP Factorization. In Peter A. Flach, Tijl De Bie, Nello Cristianini, editors, Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2012, Bristol, UK, September 24-28, 2012. Proceedings, Part II. Volume 7524 of Lecture Notes in Computer Science, pages 180-194, Springer, 2012. [doi]

Authors

Gabriel Dulac-Arnold

This author has not been identified. Look up 'Gabriel Dulac-Arnold' in Google

Ludovic Denoyer

This author has not been identified. Look up 'Ludovic Denoyer' in Google

Philippe Preux

This author has not been identified. Look up 'Philippe Preux' in Google

Patrick Gallinari

This author has not been identified. Look up 'Patrick Gallinari' in Google