Romou: rapidly generate high-performance tensor kernels for mobile GPUs

Rendong Liang, Ting Cao, Jicheng Wen, Manni Wang, Yang Wang, Jianhua Zou, Yunxin Liu. Romou: rapidly generate high-performance tensor kernels for mobile GPUs. In ACM MobiCom '22: The 28th Annual International Conference on Mobile Computing and Networking, Sydney, NSW, Australia, October 17 - 21, 2022. pages 487-500, ACM, 2022. [doi]

Abstract

Abstract is missing.