An empirical study of policy convergence in Markov decision process value iteration

Christopher W. Zobel, William T. Scherer. An empirical study of policy convergence in Markov decision process value iteration. Computers & OR, 32:127-142, 2005. [doi]

Abstract

Abstract is missing.