Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning

Chenjia Bai, Lingxiao Wang 0003, Jianye Hao, Zhuoran Yang, Bin Zhao 0001, Zhen Wang, Xuelong Li 0001. Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning. Artificial Intelligence, 326:104048, January 2024. [doi]

Abstract

Abstract is missing.