Policy Gradient using Weak Derivatives for Reinforcement Learning

Sujay Bhatt, Alec Koppel, Vikram Krishnamurthy. Policy Gradient using Weak Derivatives for Reinforcement Learning. In 53rd Annual Conference on Information Sciences and Systems, CISS 2019, Baltimore, MD, USA, March 20-22, 2019. pages 1-3, IEEE, 2019. [doi]

Abstract

Abstract is missing.