Shfl-BW: accelerating deep neural network inference with tensor-core aware weight pruning

Guyue Huang, Haoran Li, Minghai Qin, Fei Sun, Yufei Ding, Yuan Xie. Shfl-BW: accelerating deep neural network inference with tensor-core aware weight pruning. In Rob Oshana, editor, DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10 - 14, 2022. pages 1153-1158, ACM, 2022. [doi]

Authors

Guyue Huang

This author has not been identified. Look up 'Guyue Huang' in Google

Haoran Li

This author has not been identified. Look up 'Haoran Li' in Google

Minghai Qin

This author has not been identified. Look up 'Minghai Qin' in Google

Fei Sun

This author has not been identified. Look up 'Fei Sun' in Google

Yufei Ding

This author has not been identified. Look up 'Yufei Ding' in Google

Yuan Xie

This author has not been identified. Look up 'Yuan Xie' in Google