Policy Gradient using Weak Derivatives for Reinforcement Learning

Sujay Bhatt, Alec Koppel, Vikram Krishnamurthy. Policy Gradient using Weak Derivatives for Reinforcement Learning. In 58th IEEE Conference on Decision and Control, CDC 2019, Nice, France, December 11-13, 2019. pages 5531-5537, IEEE, 2019. [doi]

Authors

Sujay Bhatt

This author has not been identified. Look up 'Sujay Bhatt' in Google

Alec Koppel

This author has not been identified. Look up 'Alec Koppel' in Google

Vikram Krishnamurthy

This author has not been identified. Look up 'Vikram Krishnamurthy' in Google