Potential-based online policy iteration algorithms for Markov decision processes

Haitao Fang, Xi-Ren Cao. Potential-based online policy iteration algorithms for Markov decision processes. IEEE Trans. Automat. Contr., 49(4):493-505, 2004. [doi]

Abstract

Abstract is missing.