Jinchang Hou, Chang Ao, Haihong Wu, Xiangtao Kong, Zhigang Zheng, Daijia Tang, Chengming Li, Xiping Hu 0001, Ruifeng Xu, Shiwen Ni, Min Yang 0007. E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models. In Lun-Wei Ku, Andre Martins, Vivek Srikumar, editors, Findings of the Association for Computational Linguistics, ACL 2024, Bangkok, Thailand and virtual meeting, August 11-16, 2024. pages 7753-7774, Association for Computational Linguistics, 2024. [doi]
Abstract is missing.