Pseudometrics for State Aggregation in Average Reward Markov Decision Processes

Ronald Ortner. Pseudometrics for State Aggregation in Average Reward Markov Decision Processes. In Marcus Hutter, Rocco A. Servedio, Eiji Takimoto, editors, Algorithmic Learning Theory, 18th International Conference, ALT 2007, Sendai, Japan, October 1-4, 2007, Proceedings. Volume 4754 of Lecture Notes in Computer Science, pages 373-387, Springer, 2007. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.