Qualitative Diagnosis of LLMs as Judges Using LevelEval

Mallika Boyapati, Lokesh Meesala, Ramazan Aygun, Bill Franks, Hansook Choi, Sereres Riordan, Girish Modgil. Qualitative Diagnosis of LLMs as Judges Using LevelEval. In M. Arif Wani, Plamen Angelov 0001, Feng Luo, Mitsunori Ogihara, Xintao Wu, Radu-Emil Precup, Ramin Ramezani, Xiaowei Gu, editors, International Conference on Machine Learning and Applications, ICMLA 2024, Miami, FL, USA, December 18-20, 2024. pages 206-213, IEEE, 2024. [doi]

Abstract

Abstract is missing.