CNNBooster: Accelerating CNN Inference with Latency-aware Channel Pruning for GPU

Yuting Zhu, Hongxu Jiang, Runhua Zhang, Yonghua Zhang, Dong Dong. CNNBooster: Accelerating CNN Inference with Latency-aware Channel Pruning for GPU. In IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking, ISPA/BDCloud/SocialCom/SustainCom 2022, Melbourne, Australia, December 17-19, 2022. pages 355-362, IEEE, 2022. [doi]

Authors

Yuting Zhu

This author has not been identified. Look up 'Yuting Zhu' in Google

Hongxu Jiang

This author has not been identified. Look up 'Hongxu Jiang' in Google

Runhua Zhang

This author has not been identified. Look up 'Runhua Zhang' in Google

Yonghua Zhang

This author has not been identified. Look up 'Yonghua Zhang' in Google

Dong Dong

This author has not been identified. Look up 'Dong Dong' in Google