Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression

Jiduan Liu, Jiahao Liu, Qifan Wang, Jingang Wang, Xunliang Cai, Dongyan Zhao 0001, Ran Wang, Rui Yan 0001. Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 8643-8657, Association for Computational Linguistics, 2023. [doi]

Abstract

Abstract is missing.