Offline reinforcement learning under value and density-ratio realizability: The power of gaps

Jinglin Chen, Nan Jiang 0008. Offline reinforcement learning under value and density-ratio realizability: The power of gaps. In James Cussens, Kun Zhang 0001, editors, Uncertainty in Artificial Intelligence, Proceedings of the Thirty-Eighth Conference on Uncertainty in Artificial Intelligence, UAI 2022, 1-5 August 2022, Eindhoven, The Netherlands. Volume 180 of Proceedings of Machine Learning Research, pages 378-388, PMLR, 2022. [doi]

Abstract

Abstract is missing.