Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning

Shangtong Zhang, Bo Liu, Shimon Whiteson. Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning. In Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, Thirty-Third Conference on Innovative Applications of Artificial Intelligence, IAAI 2021, The Eleventh Symposium on Educational Advances in Artificial Intelligence, EAAI 2021, Virtual Event, February 2-9, 2021. pages 10905-10913, AAAI Press, 2021. [doi]

Authors

Shangtong Zhang

This author has not been identified. Look up 'Shangtong Zhang' in Google

Bo Liu

This author has not been identified. Look up 'Bo Liu' in Google

Shimon Whiteson

This author has not been identified. Look up 'Shimon Whiteson' in Google