Partially observable environment estimation with uplift inference for reinforcement learning based recommendation

Wenjie Shang, Qingyang Li, Zhiwei Qin, Yang Yu 0001, Yiping Meng, Jieping Ye. Partially observable environment estimation with uplift inference for reinforcement learning based recommendation. Machine Learning, 110(9):2603-2640, 2021. [doi]

Abstract

Abstract is missing.