Accelerated Auto-Tuning of GPU Kernels for Tensor Computations

Chendi Li, Yufan Xu, Sina Mahdipour Saravani, Ponnuswamy Sadayappan. Accelerated Auto-Tuning of GPU Kernels for Tensor Computations. In Kenji Kise, Valentina Salapura, Murali Annavaram, Ana Lucia Varbanescu, editors, Proceedings of the 38th ACM International Conference on Supercomputing, ICS 2024, Kyoto, Japan, June 4-7, 2024. pages 549-561, ACM, 2024. [doi]

Abstract

Abstract is missing.