Dynamic Knowledge Distillation for Pre-trained Language Models - researchr publication related

researchr

You are not signed in
Sign in
Sign up

Lei Li, Yankai Lin, Shuhuai Ren, Peng Li, Jie Zhou, Xu Sun 0001. Dynamic Knowledge Distillation for Pre-trained Language Models. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih, editors, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021. pages 379-389, Association for Computational Linguistics, 2021. [doi]

The following publications are possibly variants of this publication:

Explanation Guided Knowledge Distillation for Pre-trained Language Model CompressionZhao Yang, Yuanzhe Zhang, Dianbo Sui, Yiming Ju, Jun Zhao 0001, Kang Liu 0001. talip, 23(2), February 2024. [doi]

ReAugKD: Retrieval-Augmented Knowledge Distillation For Pre-trained Language ModelsJianyi Zhang, Aashiq Muhamed, Aditya Anantharaman, Guoyin Wang 0002, Changyou Chen, Kai Zhong, Qingjun Cui, Yi Xu, Belinda Zeng, Trishul Chilimbi, Yiran Chen 0001. acl 2023: 1128-1136 [doi]

KroneckerBERT: Significant Compression of Pre-trained Language Models Through Kronecker Decomposition and Knowledge DistillationMarzieh S. Tahaei, Ella Charlaix, Vahid Partovi Nia, Ali Ghodsi 0001, Mehdi Rezagholizadeh. naacl 2022: 2116-2127 [doi]

PANLP at MEDIQA 2019: Pre-trained Language Models, Transfer Learning and Knowledge DistillationWei Zhu, Xiaofeng Zhou, Keqiang Wang, Xun Luo, Xiepeng Li, Yuan Ni, Guotong Xie. bionlp 2019: 380-388 [doi]

runs on WebDSL