Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation

Yue Guan, Qifan Zhang, Panagiotis Tsiotras. Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation. In Zhi-Hua Zhou, editor, Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI 2021, Virtual Event / Montreal, Canada, 19-27 August 2021. pages 2462-2468, ijcai.org, 2021. [doi]

Authors

Yue Guan

This author has not been identified. Look up 'Yue Guan' in Google

Qifan Zhang

This author has not been identified. Look up 'Qifan Zhang' in Google

Panagiotis Tsiotras

This author has not been identified. Look up 'Panagiotis Tsiotras' in Google