Undiscounted Markov decision chains with partial information; an algorithm for computing a locally optimal periodic policy

Arie Hordijk, J. A. Loeve. Undiscounted Markov decision chains with partial information; an algorithm for computing a locally optimal periodic policy. Math. Meth. of OR, 40(2):163-181, 1994. [doi]

Abstract

Abstract is missing.