Policy Iteration Q-Learning for Data-Based Two-Player Zero-Sum Game of Linear Discrete-Time Systems

Biao Luo, Yin Yang 0001, Derong Liu 0001. Policy Iteration Q-Learning for Data-Based Two-Player Zero-Sum Game of Linear Discrete-Time Systems. IEEE T. Cybernetics, 51(7):3630-3640, 2021. [doi]

Abstract

Abstract is missing.