Auto-tuning Dense Matrix Multiplication for GPGPU with Cache

Xiang Cui, Yifeng Chen, Changyou Zhang, Hong Mei. Auto-tuning Dense Matrix Multiplication for GPGPU with Cache. In IEEE 16th International Conference on Parallel and Distributed Systems, ICPADS 2010, 8-10 Dec. 2010, Shanghai, China. pages 237-242, IEEE, 2010. [doi]

Authors

Xiang Cui

This author has not been identified. Look up 'Xiang Cui' in Google

Yifeng Chen

This author has not been identified. Look up 'Yifeng Chen' in Google

Changyou Zhang

This author has not been identified. Look up 'Changyou Zhang' in Google

Hong Mei

This author has not been identified. Look up 'Hong Mei' in Google