Adaptive policy for two finite Markov chains zero-sum stochastic game with unknown transition matrices and average payoffs

Kaddour Najim, Alexander S. Poznyak, E. Gomez. Adaptive policy for two finite Markov chains zero-sum stochastic game with unknown transition matrices and average payoffs. Automatica, 37(7):1007-1018, 2001. [doi]

Abstract

Abstract is missing.