Exploiting the Structural Properties of the Underlying Markov Decision Problem in the Q-Learning Algorithm

Sumit Kunnumkal, Huseyin Topaloglu. Exploiting the Structural Properties of the Underlying Markov Decision Problem in the Q-Learning Algorithm. INFORMS Journal on Computing, 20(2):288-301, 2008. [doi]

Abstract

Abstract is missing.