The following publications are possibly variants of this publication:
- GradientDICE: Rethinking Generalized Offline Estimation of Stationary ValuesShangtong Zhang, Bo Liu, Shimon Whiteson. icml 2020: 11194-11203 [doi]
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction EstimationJongmin Lee 0004, Wonseok Jeon, Byung-Jun Lee 0001, Joelle Pineau, Kee-Eung Kim. icml 2021: 6120-6130 [doi]
- Conservative State Value Estimation for Offline Reinforcement LearningLiting Chen, Jie Yan, Zhengdao Shao, Lu Wang, Qingwei Lin, Saravanakumar Rajmohan, Thomas Moscibroda, Dongmei Zhang. nips 2023: [doi]