CodeCleaner: Mitigating Data Contamination for LLM Benchmarking

Jialun Cao, Songqiang Chen, Wuqi Zhang, Hau Ching Lo, Yeting Li, Shing-Chi Cheung. CodeCleaner: Mitigating Data Contamination for LLM Benchmarking. In Hong Mei 0001, Jian Lv 0001, Zhi Jin 0001, Xuandong Li, Thomas Zimmermann 0001, Ge Li 0001, Lei Bu, Xin Xia 0001, editors, Proceedings of the 16th International Conference on Internetware, Internetware 2025, Trondheim, Norway, June 20-22, 2025. pages 71-83, ACM, 2025. [doi]

Abstract

Abstract is missing.