Reinforcement Learning in Nonzero-sum Linear Quadratic Deep Structured Games: Global Convergence of Policy Optimization

Masoud Roudneshin, Jalal Arabneydi, Amir G. Aghdam. Reinforcement Learning in Nonzero-sum Linear Quadratic Deep Structured Games: Global Convergence of Policy Optimization. In 59th IEEE Conference on Decision and Control, CDC 2020, Jeju Island, South Korea, December 14-18, 2020. pages 512-517, IEEE, 2020. [doi]

Abstract

Abstract is missing.