Wenjie Shang, Qingyang Li, Zhiwei Qin, Yang Yu 0001, Yiping Meng, Jieping Ye. Partially observable environment estimation with uplift inference for reinforcement learning based recommendation. Machine Learning, 110(9):2603-2640, 2021. [doi]
Abstract is missing.