Probing the Efficacy of Hardware-Aware Weight Pruning to Optimize the SpMM Routine on Ampere GPUs

Roberto L. Castro, Diego Andrade, Basilio B. Fraguela. Probing the Efficacy of Hardware-Aware Weight Pruning to Optimize the SpMM Routine on Ampere GPUs. In Andreas Klöckner, José Moreira, editors, Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, PACT 2022, Chicago, Illinois, October 8-12, 2022. pages 135-147, ACM, 2022. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.