Auto-tuning Dense Matrix Multiplication for GPGPU with Cache

Xiang Cui, Yifeng Chen, Changyou Zhang, Hong Mei. Auto-tuning Dense Matrix Multiplication for GPGPU with Cache. In IEEE 16th International Conference on Parallel and Distributed Systems, ICPADS 2010, 8-10 Dec. 2010, Shanghai, China. pages 237-242, IEEE, 2010. [doi]

Abstract

Abstract is missing.