Towards Efficient Sparse Deep Neural Network Inference via Multi-level Concurrency Orchestration

Ming Dun, Jie Zhou, Huawei Cao, Shuhan Song, Yiming Sun, Mingyu Yan, Xiaochun Ye. Towards Efficient Sparse Deep Neural Network Inference via Multi-level Concurrency Orchestration. In IEEE High Performance Extreme Computing Conference, HPEC 2025, Wakefield, MA, USA, September 15-19, 2025. pages 1-7, IEEE, 2025. [doi]

Authors

Ming Dun

This author has not been identified. Look up 'Ming Dun' in Google

Jie Zhou

This author has not been identified. Look up 'Jie Zhou' in Google

Huawei Cao

This author has not been identified. Look up 'Huawei Cao' in Google

Shuhan Song

This author has not been identified. Look up 'Shuhan Song' in Google

Yiming Sun

This author has not been identified. Look up 'Yiming Sun' in Google

Mingyu Yan

This author has not been identified. Look up 'Mingyu Yan' in Google

Xiaochun Ye

This author has not been identified. Look up 'Xiaochun Ye' in Google