Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games

Qichao Zhang, Dongbin Zhao, Sibo Zhang. Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games. In Derong Liu, Shengli Xie, Yuanqing Li, Dongbin Zhao, El-Sayed M. El-Alfy, editors, Neural Information Processing - 24th International Conference, ICONIP 2017, Guangzhou, China, November 14-18, 2017, Proceedings, Part I. Volume 10634 of Lecture Notes in Computer Science, pages 822-830, Springer, 2017. [doi]

Authors

Qichao Zhang

This author has not been identified. Look up 'Qichao Zhang' in Google

Dongbin Zhao

This author has not been identified. Look up 'Dongbin Zhao' in Google

Sibo Zhang

This author has not been identified. Look up 'Sibo Zhang' in Google