Risk-sensitive REINFORCE: A Monte Carlo Policy Gradient Algorithm for Exponential Performance Criteria

Erfaun Noorani, John S. Baras. Risk-sensitive REINFORCE: A Monte Carlo Policy Gradient Algorithm for Exponential Performance Criteria. In 60th IEEE Conference on Decision and Control, CDC 2021, Austin, TX, USA, December 14-17, 2021. pages 1522-1527, IEEE, 2021. [doi]

Authors

Erfaun Noorani

This author has not been identified. Look up 'Erfaun Noorani' in Google

John S. Baras

This author has not been identified. Look up 'John S. Baras' in Google