Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data

Frank L. Lewis, Kyriakos G. Vamvoudakis. Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data. IEEE Transactions on Systems, Man, and Cybernetics, Part A, 41(1):14-25, 2011. [doi]

Abstract

Abstract is missing.