LOCP: Latency-optimized channel pruning for CNN inference acceleration on GPUs

Yonghua Zhang, Hongxu Jiang, Yuting Zhu, Runhua Zhang, Yongxiang Cao, Chenhui Zhu, Wei Wang, Dong Dong, Xiaobin Li. LOCP: Latency-optimized channel pruning for CNN inference acceleration on GPUs. The Journal of Supercomputing, 79(13):14313-14341, September 2023. [doi]

Authors

Yonghua Zhang

This author has not been identified. Look up 'Yonghua Zhang' in Google

Hongxu Jiang

This author has not been identified. Look up 'Hongxu Jiang' in Google

Yuting Zhu

This author has not been identified. Look up 'Yuting Zhu' in Google

Runhua Zhang

This author has not been identified. Look up 'Runhua Zhang' in Google

Yongxiang Cao

This author has not been identified. Look up 'Yongxiang Cao' in Google

Chenhui Zhu

This author has not been identified. Look up 'Chenhui Zhu' in Google

Wei Wang

This author has not been identified. Look up 'Wei Wang' in Google

Dong Dong

This author has not been identified. Look up 'Dong Dong' in Google

Xiaobin Li

This author has not been identified. Look up 'Xiaobin Li' in Google