Sujay Bhatt, Alec Koppel, Vikram Krishnamurthy. Policy Gradient using Weak Derivatives for Reinforcement Learning. In 58th IEEE Conference on Decision and Control, CDC 2019, Nice, France, December 11-13, 2019. pages 5531-5537, IEEE, 2019. [doi]
No reviews for this publication, yet.