Learning Curve Bounds for a Markov Decision Process with Undiscounted Rewards

Lawrence K. Saul, Satinder P. Singh. Learning Curve Bounds for a Markov Decision Process with Undiscounted Rewards. In COLT. pages 147-156, 1996. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.