Training neural networks with policy gradient

Sourabh Bose, Manfred Huber. Training neural networks with policy gradient. In 2017 International Joint Conference on Neural Networks, IJCNN 2017, Anchorage, AK, USA, May 14-19, 2017. pages 3998-4005, IEEE, 2017. [doi]

Abstract

Abstract is missing.