ZipLM: Inference-Aware Structured Pruning of Language Models

Eldar Kurtic, Elias Frantar, Dan Alistarh. ZipLM: Inference-Aware Structured Pruning of Language Models. In Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, Sergey Levine, editors, Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10 - 16, 2023. 2023. [doi]

Authors

Eldar Kurtic

This author has not been identified. Look up 'Eldar Kurtic' in Google

Elias Frantar

This author has not been identified. Look up 'Elias Frantar' in Google

Dan Alistarh

This author has not been identified. Look up 'Dan Alistarh' in Google