Zheng Yuan, Qian Wan, Tao Zhang, Chengfu Huo. PISDR: Page and Item Sequential Decision for Re-ranking Based on Offline Reinforcement Learning. In Vu Nguyen 0001, Hsuan-Tien Lin, editors, Asian Conference on Machine Learning, 5-8 December 2024, Hanoi, Vietnam. Volume 260 of Proceedings of Machine Learning Research, pages 829-844, PMLR, 2024. [doi]
Abstract is missing.