Reinforcement Learning for POMDPs Based on Action Values and Stochastic Optimization

Theodore J. Perkins. Reinforcement Learning for POMDPs Based on Action Values and Stochastic Optimization. In AAAI/IAAI. pages 199-204, 2002.

Authors

Theodore J. Perkins

This author has not been identified. Look up 'Theodore J. Perkins' in Google