BiES: Adaptive Policy Optimization for Model-Based Offline Reinforcement Learning

Yijun Yang, Jing Jiang 0002, Zhuowei Wang 0003, Qiqi Duan, Yuhui Shi. BiES: Adaptive Policy Optimization for Model-Based Offline Reinforcement Learning. In Guodong Long, Xinghuo Yu 0001, Sen Wang 0001, editors, AI 2021: Advances in Artificial Intelligence - 34th Australasian Joint Conference, AI 2021, Sydney, NSW, Australia, February 2-4, 2022, Proceedings. Volume 13151 of Lecture Notes in Computer Science, pages 570-581, Springer, 2022. [doi]

Abstract

Abstract is missing.