Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation

Yue Guan, Qifan Zhang, Panagiotis Tsiotras. Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation. In Zhi-Hua Zhou, editor, Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI 2021, Virtual Event / Montreal, Canada, 19-27 August 2021. pages 2462-2468, ijcai.org, 2021. [doi]

Abstract

Abstract is missing.