Case2Code: Scalable Synthetic Data for Code Generation

Yunfan Shao, Linyang Li, Yichuan Ma, Peiji Li, Demin Song, Qinyuan Cheng, Shimin Li, Xiaonan Li, Pengyu Wang, Qipeng Guo, Hang Yan 0001, Xipeng Qiu, Xuanjing Huang 0001, Dahua Lin. Case2Code: Scalable Synthetic Data for Code Generation. In Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa 0001, Barbara Di Eugenio, Steven Schockaert, editors, Proceedings of the 31st International Conference on Computational Linguistics, COLING 2025, Abu Dhabi, UAE, January 19-24, 2025. pages 11056-11069, Association for Computational Linguistics, 2025. [doi]

Abstract

Abstract is missing.