Generating Q&A Benchmarks for RAG Evaluation in Enterprise Settings

Simone Filice, Guy Horowitz, David Carmel, Zohar S. Karnin, Liane Lewin-Eytan, Yoelle Maarek. Generating Q&A Benchmarks for RAG Evaluation in Enterprise Settings. In Georg Rehm, Yunyao Li 0001, editors, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), ACL 2025, Vienna, Austria, July 27 - August 1, 2025. pages 469-484, Association for Computational Linguistics, 2025. [doi]

Authors

Simone Filice

This author has not been identified. Look up 'Simone Filice' in Google

Guy Horowitz

This author has not been identified. Look up 'Guy Horowitz' in Google

David Carmel

This author has not been identified. It may be one of the following persons: Look up 'David Carmel' in Google

Zohar S. Karnin

This author has not been identified. Look up 'Zohar S. Karnin' in Google

Liane Lewin-Eytan

This author has not been identified. Look up 'Liane Lewin-Eytan' in Google

Yoelle Maarek

This author has not been identified. Look up 'Yoelle Maarek' in Google