Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Qichao Zhang, Dongbin Zhao, Sibo Zhang. Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games. In Derong Liu, Shengli Xie, Yuanqing Li, Dongbin Zhao, El-Sayed M. El-Alfy, editors, Neural Information Processing - 24th International Conference, ICONIP 2017, Guangzhou, China, November 14-18, 2017, Proceedings, Part I. Volume 10634 of Lecture Notes in Computer Science, pages 822-830, Springer, 2017. [doi]

This author has not been identified. Look up 'Qichao Zhang' in GoogleThis author has not been identified. Look up 'Dongbin Zhao' in GoogleThis author has not been identified. Look up 'Sibo Zhang' in Google

runs on WebDSL