Re-compact: Structured Pruning and SpMM Kernel Co-design for Accelerating DNNs on GPUs

Yuling Zhang, Ao Ren, Xianzhang Chen, Qiu Lin, Yujuan Tan, Duo Liu. Re-compact: Structured Pruning and SpMM Kernel Co-design for Accelerating DNNs on GPUs. In 41st IEEE International Conference on Computer Design, ICCD 2023, Washington, DC, USA, November 6-8, 2023. pages 399-406, IEEE, 2023. [doi]

Abstract

Abstract is missing.