CNNBooster: Accelerating CNN Inference with Latency-aware Channel Pruning for GPU

Yuting Zhu, Hongxu Jiang, Runhua Zhang, Yonghua Zhang, Dong Dong. CNNBooster: Accelerating CNN Inference with Latency-aware Channel Pruning for GPU. In IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking, ISPA/BDCloud/SocialCom/SustainCom 2022, Melbourne, Australia, December 17-19, 2022. pages 355-362, IEEE, 2022. [doi]

Abstract

Abstract is missing.