Latency-Based Inter-Operator Scheduling for CNN Inference Acceleration on GPU

Yukai Ping, He Jiang 0001, Xingxiang Liu, Zhenyang Zhao, Zhide Zhou, Xin Chen 0032. Latency-Based Inter-Operator Scheduling for CNN Inference Acceleration on GPU. IEEE T. Services Computing, 17(1):277-290, January - February 2024. [doi]

Abstract

Abstract is missing.