Temporal Difference-Based Policy Iteration for Optimal Control of Stochastic Systems

Kang Cheng, Shumin Fei, Kanjian Zhang, Xiaomei Liu, Haikun Wei. Temporal Difference-Based Policy Iteration for Optimal Control of Stochastic Systems. J. Optimization Theory and Applications, 163(1):165-180, 2014. [doi]

Abstract

Abstract is missing.