Optimum: Runtime optimization for multiple mixed model deployment deep learning inference

Kaicheng Guo, Yixiao Xu, Zhengwei Qi, Haibing Guan. Optimum: Runtime optimization for multiple mixed model deployment deep learning inference. Journal of Systems Architecture, 141:102901, 2023. [doi]

Abstract

Abstract is missing.