A new Q-learning algorithm based on the metropolis criterion

Maozu Guo, Yang Liu, Jacek Malec. A new Q-learning algorithm based on the metropolis criterion. IEEE Transactions on Systems, Man, and Cybernetics, Part A, 34(5):2140-2143, 2004. [doi]

Abstract

Abstract is missing.