A Portable and High-Performance General Matrix-Multiply (GEMM) Library for GPUs and Single-Chip CPU/GPU Systems

Rahul Garg, Laurie J. Hendren. A Portable and High-Performance General Matrix-Multiply (GEMM) Library for GPUs and Single-Chip CPU/GPU Systems. In 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2014, Torino, Italy, February 12-14, 2014. pages 672-680, IEEE, 2014. [doi]

@inproceedings{GargH14,
  title = {A Portable and High-Performance General Matrix-Multiply (GEMM) Library for GPUs and Single-Chip CPU/GPU Systems},
  author = {Rahul Garg and Laurie J. Hendren},
  year = {2014},
  doi = {10.1109/PDP.2014.40},
  url = {http://doi.ieeecomputersociety.org/10.1109/PDP.2014.40},
  researchr = {https://researchr.org/publication/GargH14},
  cites = {0},
  citedby = {0},
  pages = {672-680},
  booktitle = {22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2014, Torino, Italy, February 12-14, 2014},
  publisher = {IEEE},
}