A Survey on Evaluation of Large Language Models

Yupeng Chang, Xu Wang, Jindong Wang 0001, Yuan Wu, Linyi Yang, Kaijie Zhu, Hao Chen, Xiaoyuan Yi, Cunxiang Wang, Yidong Wang, Wei Ye 0004, Yue Zhang, Yi Chang 0001, Philip S. Yu, Qiang Yang 0001, Xing Xie 0001. A Survey on Evaluation of Large Language Models. ACM TIST, 15(3), June 2024. [doi]

Abstract

Abstract is missing.