CNNBooster: Accelerating CNN Inference with Latency-aware Channel Pruning for GPU

Yuting Zhu, Hongxu Jiang, Runhua Zhang, Yonghua Zhang, Dong Dong. CNNBooster: Accelerating CNN Inference with Latency-aware Channel Pruning for GPU. In IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking, ISPA/BDCloud/SocialCom/SustainCom 2022, Melbourne, Australia, December 17-19, 2022. pages 355-362, IEEE, 2022. [doi]

@inproceedings{ZhuJZZD22,
  title = {CNNBooster: Accelerating CNN Inference with Latency-aware Channel Pruning for GPU},
  author = {Yuting Zhu and Hongxu Jiang and Runhua Zhang and Yonghua Zhang and Dong Dong},
  year = {2022},
  doi = {10.1109/ISPA-BDCloud-SocialCom-SustainCom57177.2022.00052},
  url = {https://doi.org/10.1109/ISPA-BDCloud-SocialCom-SustainCom57177.2022.00052},
  researchr = {https://researchr.org/publication/ZhuJZZD22},
  cites = {0},
  citedby = {0},
  pages = {355-362},
  booktitle = {IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking, ISPA/BDCloud/SocialCom/SustainCom 2022, Melbourne, Australia, December 17-19, 2022},
  publisher = {IEEE},
  isbn = {978-1-6654-6497-0},
}