Performance Engineering for a Tall & Skinny Matrix Multiplication Kernels on GPUs

Dominik Ernst, Georg Hager, Jonas Thies, Gerhard Wellein. Performance Engineering for a Tall & Skinny Matrix Multiplication Kernels on GPUs. In Roman Wyrzykowski, Ewa Deelman, Jack J. Dongarra, Konrad Karczewski, editors, Parallel Processing and Applied Mathematics - 13th International Conference, PPAM 2019, Bialystok, Poland, September 8-11, 2019, Revised Selected Papers, Part I. Volume 12043 of Lecture Notes in Computer Science, pages 505-515, Springer, 2019. [doi]

Authors

Dominik Ernst

This author has not been identified. Look up 'Dominik Ernst' in Google

Georg Hager

This author has not been identified. Look up 'Georg Hager' in Google

Jonas Thies

This author has not been identified. Look up 'Jonas Thies' in Google

Gerhard Wellein

This author has not been identified. Look up 'Gerhard Wellein' in Google