Offline RL in Regular Decision Processes: Sample Efficiency via Language Metrics

Ahana Deb, Roberto Cipollone 0002, Anders Jonsson 0001, Alessandro Ronca, Mohammad Sadegh Talebi. Offline RL in Regular Decision Processes: Sample Efficiency via Language Metrics. In The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025. OpenReview.net, 2025. [doi]

Abstract

Abstract is missing.