Hong Guo, Nianhui Guo, Christoph Meinel, Haojin Yang. Low-bit CUTLASS GEMM Template Auto-tuning using Neural Network. In IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2024, Kaifeng, China, October 30 - Nov. 2, 2024. pages 394-401, IEEE, 2024. [doi]
Abstract is missing.