Average-Reward Reinforcement Learning with Trust Region Methods

Xiaoteng Ma, Xiaohang Tang, Li Xia, Jun Yang 0028, Qianchuan Zhao. Average-Reward Reinforcement Learning with Trust Region Methods. In Zhi-Hua Zhou, editor, Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI 2021, Virtual Event / Montreal, Canada, 19-27 August 2021. pages 2797-2803, ijcai.org, 2021. [doi]

Abstract

Abstract is missing.