Accurate Latency Prediction of Deep Learning Model Inference Under Dynamic Runtime Resource

Haihong She, Yigui Luo, Zhaohong Xiang, Weiming Liang, Yin Xie. Accurate Latency Prediction of Deep Learning Model Inference Under Dynamic Runtime Resource. In Biao Luo, Long Cheng 0001, Zheng-Guang Wu, Hongyi Li 0001, Chaojie Li, editors, Neural Information Processing - 30th International Conference, ICONIP 2023, Changsha, China, November 20-23, 2023, Proceedings, Part VII. Volume 1961 of Communications in Computer and Information Science, pages 495-510, Springer, 2023. [doi]

Abstract

Abstract is missing.