Probing the Efficacy of Hardware-Aware Weight Pruning to Optimize the SpMM Routine on Ampere GPUs

Roberto L. Castro, Diego Andrade, Basilio B. Fraguela. Probing the Efficacy of Hardware-Aware Weight Pruning to Optimize the SpMM Routine on Ampere GPUs. In Andreas Klöckner, José Moreira, editors, Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, PACT 2022, Chicago, Illinois, October 8-12, 2022. pages 135-147, ACM, 2022. [doi]

@inproceedings{CastroAF22,
  title = {Probing the Efficacy of Hardware-Aware Weight Pruning to Optimize the SpMM Routine on Ampere GPUs},
  author = {Roberto L. Castro and Diego Andrade and Basilio B. Fraguela},
  year = {2022},
  doi = {10.1145/3559009.3569691},
  url = {https://doi.org/10.1145/3559009.3569691},
  researchr = {https://researchr.org/publication/CastroAF22},
  cites = {0},
  citedby = {0},
  pages = {135-147},
  booktitle = {Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, PACT 2022, Chicago, Illinois, October 8-12, 2022},
  editor = {Andreas Klöckner and José Moreira},
  publisher = {ACM},
  isbn = {978-1-4503-9868-8},
}