Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting

Zhang-Wei Hong, Pulkit Agrawal, Remi Tachet des Combes, Romain Laroche. Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [doi]

Abstract

Abstract is missing.