Tensile: Auto-Tuning GEMM GPU Assembly for All Problem Sizes

David E. Tanner. Tensile: Auto-Tuning GEMM GPU Assembly for All Problem Sizes. In 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPS Workshops 2018, Vancouver, BC, Canada, May 21-25, 2018. pages 1066-1075, IEEE Computer Society, 2018. [doi]

Abstract

Abstract is missing.