Offline Reinforcement Learning with On-Policy Q-Function Regularization

Laixi Shi, Robert Dadashi, Yuejie Chi, Pablo Samuel Castro, Matthieu Geist. Offline Reinforcement Learning with On-Policy Q-Function Regularization. In Danai Koutra, Claudia Plant, Manuel Gomez-Rodriguez, Elena Baralis, Francesco Bonchi, editors, Machine Learning and Knowledge Discovery in Databases: Research Track - European Conference, ECML PKDD 2023, Turin, Italy, September 18-22, 2023, Proceedings, Part IV. Volume 14172 of Lecture Notes in Computer Science, pages 455-471, Springer, 2023. [doi]

Abstract

Abstract is missing.