A Portable and High-Performance General Matrix-Multiply (GEMM) Library for GPUs and Single-Chip CPU/GPU Systems

Rahul Garg, Laurie J. Hendren. A Portable and High-Performance General Matrix-Multiply (GEMM) Library for GPUs and Single-Chip CPU/GPU Systems. In 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2014, Torino, Italy, February 12-14, 2014. pages 672-680, IEEE, 2014. [doi]

Authors

Rahul Garg

This author has not been identified. Look up 'Rahul Garg' in Google

Laurie J. Hendren

This author has not been identified. It may be one of the following persons: Look up 'Laurie J. Hendren' in Google