Fast Reinforcement Learning with Large Action Sets Using Error-Correcting Output Codes for MDP Factorization

Gabriel Dulac-Arnold, Ludovic Denoyer, Philippe Preux, Patrick Gallinari. Fast Reinforcement Learning with Large Action Sets Using Error-Correcting Output Codes for MDP Factorization. In Peter A. Flach, Tijl De Bie, Nello Cristianini, editors, Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2012, Bristol, UK, September 24-28, 2012. Proceedings, Part II. Volume 7524 of Lecture Notes in Computer Science, pages 180-194, Springer, 2012. [doi]

Abstract

Abstract is missing.