Policy iteration based Q-learning for linear nonzero-sum quadratic differential games

Xinxing Li, Zhihong Peng, Li Liang, Wenzhong Zha. Policy iteration based Q-learning for linear nonzero-sum quadratic differential games. Science in China Series F: Information Sciences, 62(5):52204, 2019. [doi]

Abstract

Abstract is missing.