Rommert Dekker, Arie Hordijk. Average, Sensitive and Blackwell Optimal Policies in Denumerable Markov Decision Chains with Unbounded Rewards. Math. Oper. Res., 13(3):395-420, 1988. [doi]
No references recorded for this publication.
No citations of this publication recorded.