Average-Reward Reinforcement Learning with Trust Region Methods

Xiaoteng Ma, Xiaohang Tang, Li Xia, Jun Yang 0028, Qianchuan Zhao. Average-Reward Reinforcement Learning with Trust Region Methods. In Zhi-Hua Zhou, editor, Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI 2021, Virtual Event / Montreal, Canada, 19-27 August 2021. pages 2797-2803, ijcai.org, 2021. [doi]

Authors

Xiaoteng Ma

This author has not been identified. Look up 'Xiaoteng Ma' in Google

Xiaohang Tang

This author has not been identified. Look up 'Xiaohang Tang' in Google

Li Xia

This author has not been identified. Look up 'Li Xia' in Google

Jun Yang 0028

This author has not been identified. Look up 'Jun Yang 0028' in Google

Qianchuan Zhao

This author has not been identified. Look up 'Qianchuan Zhao' in Google