The following publications are possibly variants of this publication:
- Doubly Robust Off-policy Value Evaluation for Reinforcement LearningNan Jiang, Lihong Li. icml 2016: 652-661 [doi]
- Towards Robust and Safe Reinforcement Learning with Benign Off-policy DataZuxin Liu, Zijian Guo, Zhepeng Cen, Huan Zhang, Yihang Yao, Hanjiang Hu, Ding Zhao. icml 2023: 21586-21610 [doi]
- Learning Optimal Compact Codebook for Efficient Object CategorizationTeng Li, Tao Mei, In-So Kweon. wacv 2008: 1-6 [doi]
- Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture SearchYuan Tian, Qin Wang, Zhiwu Huang, Wen Li, Dengxin Dai, Minghao Yang, Jun Wang 0012, Olga Fink. eccv 2020: 175-192 [doi]