Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement Learning

Wenjie Shi, Shiji Song, Cheng Wu. Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement Learning. In Sarit Kraus, editor, Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10-16, 2019. pages 3425-3431, ijcai.org, 2019. [doi]

Abstract

Abstract is missing.