Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data

Frank L. Lewis, Kyriakos G. Vamvoudakis. Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data. IEEE Transactions on Systems, Man, and Cybernetics, Part A, 41(1):14-25, 2011. [doi]

Authors

Frank L. Lewis

This author has not been identified. Look up 'Frank L. Lewis' in Google

Kyriakos G. Vamvoudakis

This author has not been identified. Look up 'Kyriakos G. Vamvoudakis' in Google