LOCP: Latency-optimized channel pruning for CNN inference acceleration on GPUs

Yonghua Zhang, Hongxu Jiang, Yuting Zhu, Runhua Zhang, Yongxiang Cao, Chenhui Zhu, Wei Wang, Dong Dong, Xiaobin Li. LOCP: Latency-optimized channel pruning for CNN inference acceleration on GPUs. The Journal of Supercomputing, 79(13):14313-14341, September 2023. [doi]

Abstract

Abstract is missing.