The following publications are possibly variants of this publication:
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement LearningChenjia Bai, Lingxiao Wang 0003, Zhuoran Yang, Zhi-Hong Deng, Animesh Garg, Peng Liu 0008, Zhaoran Wang. iclr 2022: [doi]
- Neural Network Approximation for Pessimistic Offline Reinforcement LearningDi Wu, Yuling Jiao, Li Shen, Haizhao Yang, Xiliang Lu. AAAI 2024: 15868-15877 [doi]
- Pessimistic value iteration for multi-task data sharing in Offline Reinforcement LearningChenjia Bai, Lingxiao Wang 0003, Jianye Hao, Zhuoran Yang, Bin Zhao 0001, Zhen Wang, Xuelong Li 0001. ai, 326:104048, January 2024. [doi]
- Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value IterationRunzhe Wu, Yufeng Zhang 0007, Zhuoran Yang, Zhaoran Wang. nips 2021: 25439-25451 [doi]