A learning algorithm for Markov decision processes with adaptive state aggregation

J. S. Baras, V. S. Borkar. A learning algorithm for Markov decision processes with adaptive state aggregation. In 39th IEEE Conference on Decision and Control, CDC 2000, Sydney, Australia, December 12-15, 2000. pages 3351-3356, IEEE, 2000. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.