Towards Non-task-specific Distillation of BERT via Sentence Representation Approximation

Bowen Wu, Huan Zhang, Mengyuan Li, Zongsheng Wang, Qihang Feng, Junhong Huang, Baoxun Wang. Towards Non-task-specific Distillation of BERT via Sentence Representation Approximation. In Kam-Fai Wong, Kevin Knight, Hua Wu, editors, Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, AACL/IJCNLP 2020, Suzhou, China, December 4-7, 2020. pages 70-79, Association for Computational Linguistics, 2020. [doi]

Abstract

Abstract is missing.