Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism

Ming Yin, Yaqi Duan, Mengdi Wang, Yu-Xiang Wang 0003. Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

Abstract

Abstract is missing.