Rethinking Ground Truth: A Case Study on Human Label Variation in MLLM Benchmarking

Tomás Ruiz, Tanalp Agustoslu, Carsten Schwemmer. Rethinking Ground Truth: A Case Study on Human Label Variation in MLLM Benchmarking. In IEEE International Conference on Big Data, BigData 2025, Macau, China, December 8-11, 2025. pages 5224-5229, IEEE, 2025. [doi]

Authors

Tomás Ruiz

This author has not been identified. Look up 'Tomás Ruiz' in Google

Tanalp Agustoslu

This author has not been identified. Look up 'Tanalp Agustoslu' in Google

Carsten Schwemmer

This author has not been identified. Look up 'Carsten Schwemmer' in Google