Towards Efficient Sparse Deep Neural Network Inference via Multi-level Concurrency Orchestration

Ming Dun, Jie Zhou, Huawei Cao, Shuhan Song, Yiming Sun, Mingyu Yan, Xiaochun Ye. Towards Efficient Sparse Deep Neural Network Inference via Multi-level Concurrency Orchestration. In IEEE High Performance Extreme Computing Conference, HPEC 2025, Wakefield, MA, USA, September 15-19, 2025. pages 1-7, IEEE, 2025. [doi]

@inproceedings{DunZCSSYY25,
  title = {Towards Efficient Sparse Deep Neural Network Inference via Multi-level Concurrency Orchestration},
  author = {Ming Dun and Jie Zhou and Huawei Cao and Shuhan Song and Yiming Sun and Mingyu Yan and Xiaochun Ye},
  year = {2025},
  doi = {10.1109/HPEC67600.2025.11196684},
  url = {https://doi.org/10.1109/HPEC67600.2025.11196684},
  researchr = {https://researchr.org/publication/DunZCSSYY25},
  cites = {0},
  citedby = {0},
  pages = {1-7},
  booktitle = {IEEE High Performance Extreme Computing Conference, HPEC 2025, Wakefield, MA, USA, September 15-19, 2025},
  publisher = {IEEE},
  isbn = {979-8-3315-7844-2},
}