Partially observable environment estimation with uplift inference for reinforcement learning based recommendation

Wenjie Shang, Qingyang Li, Zhiwei Qin, Yang Yu 0001, Yiping Meng, Jieping Ye. Partially observable environment estimation with uplift inference for reinforcement learning based recommendation. Machine Learning, 110(9):2603-2640, 2021. [doi]

Authors

Wenjie Shang

This author has not been identified. Look up 'Wenjie Shang' in Google

Qingyang Li

This author has not been identified. Look up 'Qingyang Li' in Google

Zhiwei Qin

This author has not been identified. Look up 'Zhiwei Qin' in Google

Yang Yu 0001

This author has not been identified. Look up 'Yang Yu 0001' in Google

Yiping Meng

This author has not been identified. Look up 'Yiping Meng' in Google

Jieping Ye

This author has not been identified. Look up 'Jieping Ye' in Google