Offline Reinforcement Learning with On-Policy Q-Function Regularization

Laixi Shi, Robert Dadashi, Yuejie Chi, Pablo Samuel Castro, Matthieu Geist. Offline Reinforcement Learning with On-Policy Q-Function Regularization. In Danai Koutra, Claudia Plant, Manuel Gomez-Rodriguez, Elena Baralis, Francesco Bonchi, editors, Machine Learning and Knowledge Discovery in Databases: Research Track - European Conference, ECML PKDD 2023, Turin, Italy, September 18-22, 2023, Proceedings, Part IV. Volume 14172 of Lecture Notes in Computer Science, pages 455-471, Springer, 2023. [doi]

Authors

Laixi Shi

This author has not been identified. Look up 'Laixi Shi' in Google

Robert Dadashi

This author has not been identified. Look up 'Robert Dadashi' in Google

Yuejie Chi

This author has not been identified. Look up 'Yuejie Chi' in Google

Pablo Samuel Castro

This author has not been identified. Look up 'Pablo Samuel Castro' in Google

Matthieu Geist

This author has not been identified. Look up 'Matthieu Geist' in Google