A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

Sumit Kunnumkal, Huseyin Topaloglu. A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm. ACM Trans. Model. Comput. Simul., 20(3), 2010. [doi]

Abstract

Abstract is missing.