Performance Engineering for a Tall & Skinny Matrix Multiplication Kernels on GPUs

Dominik Ernst, Georg Hager, Jonas Thies, Gerhard Wellein. Performance Engineering for a Tall & Skinny Matrix Multiplication Kernels on GPUs. In Roman Wyrzykowski, Ewa Deelman, Jack J. Dongarra, Konrad Karczewski, editors, Parallel Processing and Applied Mathematics - 13th International Conference, PPAM 2019, Bialystok, Poland, September 8-11, 2019, Revised Selected Papers, Part I. Volume 12043 of Lecture Notes in Computer Science, pages 505-515, Springer, 2019. [doi]

Abstract

Abstract is missing.