Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains

Matthieu Zimmer, Paul Weng. Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains. In Sarit Kraus, editor, Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10-16, 2019. pages 4496-4502, ijcai.org, 2019. [doi]

Abstract

Abstract is missing.