Adaptive computation of optimal nonrandomized policies in constrained average-reward MDPs

Eugene A. Feinberg. Adaptive computation of optimal nonrandomized policies in constrained average-reward MDPs. In IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2009, Nashville, TN, USA, March 31 - April 1, 2009. pages 96-100, IEEE, 2009. [doi]

Abstract

Abstract is missing.