Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting

Zhang-Wei Hong, Pulkit Agrawal, Remi Tachet des Combes, Romain Laroche. Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [doi]

Authors

Zhang-Wei Hong

This author has not been identified. Look up 'Zhang-Wei Hong' in Google

Pulkit Agrawal

This author has not been identified. Look up 'Pulkit Agrawal' in Google

Remi Tachet des Combes

This author has not been identified. Look up 'Remi Tachet des Combes' in Google

Romain Laroche

This author has not been identified. Look up 'Romain Laroche' in Google