Anas Himmi, Ekhine Irurozki, Nathan Noiry, Stéphan Clémençon, Pierre Colombo. Towards More Robust NLP System Evaluation: Handling Missing Scores in Benchmarks. In Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen, editors, Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, Florida, USA, November 12-16, 2024. pages 11759-11785, Association for Computational Linguistics, 2024. [doi]
Abstract is missing.