Adaptive Adversarial Multi-Armed Bandit Approach to Two-Person Zero-Sum Markov Games

Hyeong Soo Chang, Jiaqiao Hu, Michael C. Fu, Steven I. Marcus. Adaptive Adversarial Multi-Armed Bandit Approach to Two-Person Zero-Sum Markov Games. IEEE Trans. Automat. Contr., 55(2):463-468, 2010. [doi]

Abstract

Abstract is missing.