Reducing Human Effort to Validate LLM Relevance Judgements via Stratified Sampling

Simone Merlo, Stefano Marchesin 0001, Guglielmo Faggioli, Nicola Ferro 0001. Reducing Human Effort to Validate LLM Relevance Judgements via Stratified Sampling. In Ricardo Campos 0001, Adam Jatowt, Yanyan Lan, Mohammad Aliannejadi, Christine Bauer 0001, Sean MacAvaney, Avishek Anand, Zhaochun Ren, Suzan Verberne, Nan Bai, Masoud Mansoury, editors, Advances in Information Retrieval - 48th European Conference on Information Retrieval, ECIR 2026, Delft, The Netherlands, March 29 - April 2, 2026, Proceedings, Part I. Volume 16483 of Lecture Notes in Computer Science, pages 418-433, Springer, 2026. [doi]

Abstract

Abstract is missing.