Policy Optimization with Second-Order Advantage Information

Jiajin Li, Baoxiang Wang, Shengyu Zhang. Policy Optimization with Second-Order Advantage Information. In Jérôme Lang, editor, Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI 2018, July 13-19, 2018, Stockholm, Sweden. pages 5038-5044, ijcai.org, 2018. [doi]

Abstract

Abstract is missing.