Martín Santillán Cooper, Zahra Ashktorab, Hyo Jin Do, Erik Miehling, Werner Geyer, Jasmina Gajcin, Elizabeth M. Daly, Qian Pan, Michael Desmond. Synthetic Data for Evaluation: Supporting LLM-as-a-Judge Workflows with EvalAssist. In Ivan Habernal, Peter Schulam 0001, Jörg Tiedemann, editors, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, EMNLP 2025 - System Demonstrations, Suzhou, China, November 4-9, 2025. pages 1-11, Association for Computational Linguistics, 2025. [doi]
Abstract is missing.