The Need for Robust and Inclusive Benchmarks in Evaluating LLMs on Arabic Text

Lubana Al Rayes, Ashraf Elnagar. The Need for Robust and Inclusive Benchmarks in Evaluating LLMs on Arabic Text. In Mourad Abbas, Tariq Yousef, Lukas Galke, editors, Proceedings of the 8th International Conference on Natural Language and Speech Processing, ICNLSP 2025, Southern Denmark University, Odense, Denmark, August 25-27, 2025. pages 196-207, Association for Computational Linguistics, 2025. [doi]

Authors

Lubana Al Rayes

This author has not been identified. Look up 'Lubana Al Rayes' in Google

Ashraf Elnagar

This author has not been identified. Look up 'Ashraf Elnagar' in Google