Minimal Distillation Schedule for Extreme Language Model Compression

Chen Zhang, Yang Yang, Qifan Wang, Jiahao Liu, Jingang Wang, Wei Wu 0014, Dawei Song 0001. Minimal Distillation Schedule for Extreme Language Model Compression. In Yvette Graham, Matthew Purver, editors, Findings of the Association for Computational Linguistics: EACL 2024, St. Julian's, Malta, March 17-22, 2024. pages 1378-1394, Association for Computational Linguistics, 2024. [doi]

Abstract

Abstract is missing.