On the Convergence of Policy Iteration in Finite State Undiscounted Markov Decision Processes: The Unichain Case

Arie Hordijk, Martin L. Puterman. On the Convergence of Policy Iteration in Finite State Undiscounted Markov Decision Processes: The Unichain Case. Math. Oper. Res., 12(1):163-176, 1987. [doi]

Abstract

Abstract is missing.