The Need for Robust and Inclusive Benchmarks in Evaluating LLMs on Arabic Text

Lubana Al Rayes, Ashraf Elnagar. The Need for Robust and Inclusive Benchmarks in Evaluating LLMs on Arabic Text. In Mourad Abbas, Tariq Yousef, Lukas Galke, editors, Proceedings of the 8th International Conference on Natural Language and Speech Processing, ICNLSP 2025, Southern Denmark University, Odense, Denmark, August 25-27, 2025. pages 196-207, Association for Computational Linguistics, 2025. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.