Sequential Search with Off-Policy Reinforcement Learning

Dadong Miao, Yanan Wang, Guoyu Tang, Lin Liu, Sulong Xu, Bo Long, Yun Xiao, Lingfei Wu, Yunjiang Jiang. Sequential Search with Off-Policy Reinforcement Learning. In Gianluca Demartini, Guido Zuccon, J. Shane Culpepper, Zi Huang, Hanghang Tong, editors, CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1 - 5, 2021. pages 4006-4015, ACM, 2021. [doi]

Authors

Dadong Miao

This author has not been identified. Look up 'Dadong Miao' in Google

Yanan Wang

This author has not been identified. Look up 'Yanan Wang' in Google

Guoyu Tang

This author has not been identified. Look up 'Guoyu Tang' in Google

Lin Liu

This author has not been identified. Look up 'Lin Liu' in Google

Sulong Xu

This author has not been identified. Look up 'Sulong Xu' in Google

Bo Long

This author has not been identified. Look up 'Bo Long' in Google

Yun Xiao

This author has not been identified. Look up 'Yun Xiao' in Google

Lingfei Wu

This author has not been identified. Look up 'Lingfei Wu' in Google

Yunjiang Jiang

This author has not been identified. Look up 'Yunjiang Jiang' in Google