Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning

Abhinav Bandari, Lu Yin 0006, Cheng-Yu Hsieh, Ajay Jaiswal, Tianlong Chen, Li Shen 0008, Ranjay Krishna, Shiwei Liu 0003. Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning. In Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen, editors, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024, Miami, FL, USA, November 12-16, 2024. pages 18089-18099, Association for Computational Linguistics, 2024. [doi]

Abstract

Abstract is missing.