Are LLM-Judges Robust to Expressions of Uncertainty? Investigating the effect of Epistemic Markers on LLM-based Evaluation

Dongryeol Lee, Yerin Hwang, Yongil Kim, Joonsuk Park, Kyomin Jung. Are LLM-Judges Robust to Expressions of Uncertainty? Investigating the effect of Epistemic Markers on LLM-based Evaluation. In Luis Chiruzzo, Alan Ritter, Lu Wang, editors, Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2025 - Volume 1: Long Papers, Albuquerque, New Mexico, USA, April 29 - May 4, 2025. pages 8962-8984, Association for Computational Linguistics, 2025. [doi]

@inproceedings{LeeHKPJ25,
  title = {Are LLM-Judges Robust to Expressions of Uncertainty? Investigating the effect of Epistemic Markers on LLM-based Evaluation},
  author = {Dongryeol Lee and Yerin Hwang and Yongil Kim and Joonsuk Park and Kyomin Jung},
  year = {2025},
  url = {https://aclanthology.org/2025.naacl-long.452/},
  researchr = {https://researchr.org/publication/LeeHKPJ25},
  cites = {0},
  citedby = {0},
  pages = {8962-8984},
  booktitle = {Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2025 - Volume 1: Long Papers, Albuquerque, New Mexico, USA, April 29 - May 4, 2025},
  editor = {Luis Chiruzzo and Alan Ritter and Lu Wang},
  publisher = {Association for Computational Linguistics},
  isbn = {979-8-89176-189-6},
}