MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark

QiHao Zhao, Yangyu Huang, Tengchao Lv, Lei Cui 0001, Qinzheng Sun, Shaoguang Mao, Xin Zhang, Ying Xin, Qiufeng Yin, Scarlett Li, Furu Wei. MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar, editors, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2025, Vienna, Austria, July 27 - August 1, 2025. pages 13371-13391, Association for Computational Linguistics, 2025. [doi]

Abstract

Abstract is missing.