Average, Sensitive and Blackwell Optimal Policies in Denumerable Markov Decision Chains with Unbounded Rewards

Rommert Dekker, Arie Hordijk. Average, Sensitive and Blackwell Optimal Policies in Denumerable Markov Decision Chains with Unbounded Rewards. Math. Oper. Res., 13(3):395-420, 1988. [doi]

Abstract

Abstract is missing.