A learning algorithm for Markov decision processes with adaptive state aggregation

J. S. Baras, V. S. Borkar. A learning algorithm for Markov decision processes with adaptive state aggregation. In 39th IEEE Conference on Decision and Control, CDC 2000, Sydney, Australia, December 12-15, 2000. pages 3351-3356, IEEE, 2000. [doi]

Abstract

Abstract is missing.