Latency-Based Inter-Operator Scheduling for CNN Inference Acceleration on GPU

Yukai Ping, He Jiang 0001, Xingxiang Liu, Zhenyang Zhao, Zhide Zhou, Xin Chen 0032. Latency-Based Inter-Operator Scheduling for CNN Inference Acceleration on GPU. IEEE T. Services Computing, 17(1):277-290, January - February 2024. [doi]

@article{PingJLZZC24,
  title = {Latency-Based Inter-Operator Scheduling for CNN Inference Acceleration on GPU},
  author = {Yukai Ping and He Jiang 0001 and Xingxiang Liu and Zhenyang Zhao and Zhide Zhou and Xin Chen 0032},
  year = {2024},
  month = {January - February},
  doi = {10.1109/TSC.2023.3345952},
  url = {https://doi.org/10.1109/TSC.2023.3345952},
  researchr = {https://researchr.org/publication/PingJLZZC24},
  cites = {0},
  citedby = {0},
  journal = {IEEE T. Services Computing},
  volume = {17},
  number = {1},
  pages = {277-290},
}