Structured Optimal Brain Pruning for Large Language Models

Jiateng Wei, Quan Lu, Ning Jiang, Siqi Li, Jingyang Xiang, Jun Chen 0023, Yong Liu 0007. Structured Optimal Brain Pruning for Large Language Models. In Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen, editors, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024, Miami, FL, USA, November 12-16, 2024. pages 13991-14007, Association for Computational Linguistics, 2024. [doi]

Abstract

Abstract is missing.