BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)

Tomás Ruiz, Siyao Peng, Barbara Plank, Carsten Schwemmer. BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet). In Gavin Abercrombie, Valerio Basile, Simona Frenda, Sara Tonelli, Shiran Dudy, editors, Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP, NLPerspectives 2025, Suzhou, China, November 8, 2025. pages 153-170, Association for Computational Linguistics, 2025. [doi]

Abstract

Abstract is missing.