Reinforcement learning for long-run average cost

Abhijit Gosavi. Reinforcement learning for long-run average cost. European Journal of Operational Research, 155(3):654-674, 2004. [doi]

Authors

Abhijit Gosavi

This author has not been identified. Look up 'Abhijit Gosavi' in Google