Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games

Qichao Zhang, Dongbin Zhao, Sibo Zhang. Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games. In Derong Liu, Shengli Xie, Yuanqing Li, Dongbin Zhao, El-Sayed M. El-Alfy, editors, Neural Information Processing - 24th International Conference, ICONIP 2017, Guangzhou, China, November 14-18, 2017, Proceedings, Part I. Volume 10634 of Lecture Notes in Computer Science, pages 822-830, Springer, 2017. [doi]

Abstract

Abstract is missing.