Policy Gradient using Weak Derivatives for Reinforcement Learning

Sujay Bhatt, Alec Koppel, Vikram Krishnamurthy. Policy Gradient using Weak Derivatives for Reinforcement Learning. In 58th IEEE Conference on Decision and Control, CDC 2019, Nice, France, December 11-13, 2019. pages 5531-5537, IEEE, 2019. [doi]

Abstract

Abstract is missing.