Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning

Bingbing Li, Zhenglun Kong, Tianyun Zhang, Ji Li 0006, Zhengang Li, Hang Liu, Caiwen Ding. Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning. In Trevor Cohn, Yulan He, Yang Liu, editors, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, EMNLP 2020, Online Event, 16-20 November 2020. pages 3187-3199, Association for Computational Linguistics, 2020. [doi]

Abstract

Abstract is missing.