Sequential Search with Off-Policy Reinforcement Learning

Dadong Miao, Yanan Wang, Guoyu Tang, Lin Liu, Sulong Xu, Bo Long, Yun Xiao, Lingfei Wu, Yunjiang Jiang. Sequential Search with Off-Policy Reinforcement Learning. In Gianluca Demartini, Guido Zuccon, J. Shane Culpepper, Zi Huang, Hanghang Tong, editors, CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1 - 5, 2021. pages 4006-4015, ACM, 2021. [doi]

Abstract

Abstract is missing.