Lubana Al Rayes, Ashraf Elnagar. The Need for Robust and Inclusive Benchmarks in Evaluating LLMs on Arabic Text. In Mourad Abbas, Tariq Yousef, Lukas Galke, editors, Proceedings of the 8th International Conference on Natural Language and Speech Processing, ICNLSP 2025, Southern Denmark University, Odense, Denmark, August 25-27, 2025. pages 196-207, Association for Computational Linguistics, 2025. [doi]
Abstract is missing.