The policy iteration algorithm for average reward Markov decision processes with general state space

Sean P. Meyn. The policy iteration algorithm for average reward Markov decision processes with general state space. IEEE Trans. Automat. Contr., 42(12):1663-1680, 1997. [doi]

Abstract

Abstract is missing.