A Generalized Minimax Q-Learning Algorithm for Two-Player Zero-Sum Stochastic Games

Raghuram Bharadwaj Diddigi, Chandramouli Kamanchi, Shalabh Bhatnagar. A Generalized Minimax Q-Learning Algorithm for Two-Player Zero-Sum Stochastic Games. IEEE Trans. Automat. Contr., 67(9):4816-4823, 2022. [doi]

Abstract

Abstract is missing.