The policy iteration algorithm for average reward Markov decision processes with general state space - researchr publication

researchr

You are not signed in
Sign in
Sign up

Sean P. Meyn. The policy iteration algorithm for average reward Markov decision processes with general state space. IEEE Trans. Automat. Contr., 42(12):1663-1680, 1997. [doi]

Abstract is missing.

runs on WebDSL