BERT-EMD: Many-to-Many Layer Mapping for BERT Compression with Earth Mover's Distance

Jianquan Li, Xiaokang Liu, Honghong Zhao, Ruifeng Xu, Min Yang, Yaohong Jin. BERT-EMD: Many-to-Many Layer Mapping for BERT Compression with Earth Mover's Distance. In Bonnie Webber, Trevor Cohn, Yulan He, Yang Liu, editors, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP 2020, Online, November 16-20, 2020. pages 3009-3018, Association for Computational Linguistics, 2020. [doi]

Abstract

Abstract is missing.