Value Penalized Q-Learning for Recommender Systems

Chengqian Gao, Ke Xu, Kuangqi Zhou, Lanqing Li, Xueqian Wang, Bo Yuan, Peilin Zhao. Value Penalized Q-Learning for Recommender Systems. In Enrique Amigó, Pablo Castells, Julio Gonzalo, Ben Carterette, J. Shane Culpepper, Gabriella Kazai, editors, SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11 - 15, 2022. pages 2008-2012, ACM, 2022. [doi]

Authors

Chengqian Gao

This author has not been identified. Look up 'Chengqian Gao' in Google

Ke Xu

This author has not been identified. Look up 'Ke Xu' in Google

Kuangqi Zhou

This author has not been identified. Look up 'Kuangqi Zhou' in Google

Lanqing Li

This author has not been identified. Look up 'Lanqing Li' in Google

Xueqian Wang

This author has not been identified. Look up 'Xueqian Wang' in Google

Bo Yuan

This author has not been identified. Look up 'Bo Yuan' in Google

Peilin Zhao

This author has not been identified. Look up 'Peilin Zhao' in Google