Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression

Jiduan Liu, Jiahao Liu, Qifan Wang, Jingang Wang, Xunliang Cai, Dongyan Zhao 0001, Ran Wang, Rui Yan 0001. Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 8643-8657, Association for Computational Linguistics, 2023. [doi]

Authors

Jiduan Liu

This author has not been identified. Look up 'Jiduan Liu' in Google

Jiahao Liu

This author has not been identified. Look up 'Jiahao Liu' in Google

Qifan Wang

This author has not been identified. Look up 'Qifan Wang' in Google

Jingang Wang

This author has not been identified. Look up 'Jingang Wang' in Google

Xunliang Cai

This author has not been identified. Look up 'Xunliang Cai' in Google

Dongyan Zhao 0001

This author has not been identified. Look up 'Dongyan Zhao 0001' in Google

Ran Wang

This author has not been identified. Look up 'Ran Wang' in Google

Rui Yan 0001

This author has not been identified. Look up 'Rui Yan 0001' in Google