Sujay Bhatt, Alec Koppel, Vikram Krishnamurthy. Policy Gradient using Weak Derivatives for Reinforcement Learning. In 53rd Annual Conference on Information Sciences and Systems, CISS 2019, Baltimore, MD, USA, March 20-22, 2019. pages 1-3, IEEE, 2019. [doi]
Abstract is missing.