Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction

Haruka Kiyohara, Masahiro Nomura, Yuta Saito. Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction. In Tat-Seng Chua, Chong-Wah Ngo, Ravi Kumar 0001, Hady W. Lauw, Roy Ka-Wei Lee, editors, Proceedings of the ACM on Web Conference 2024, WWW 2024, Singapore, May 13-17, 2024. pages 3150-3161, ACM, 2024. [doi]

Bibliographies