Optimum: Runtime optimization for multiple mixed model deployment deep learning inference

Kaicheng Guo, Yixiao Xu, Zhengwei Qi, Haibing Guan. Optimum: Runtime optimization for multiple mixed model deployment deep learning inference. Journal of Systems Architecture, 141:102901, 2023. [doi]

@article{GuoXQG23,
  title = {Optimum: Runtime optimization for multiple mixed model deployment deep learning inference},
  author = {Kaicheng Guo and Yixiao Xu and Zhengwei Qi and Haibing Guan},
  year = {2023},
  doi = {10.1016/j.sysarc.2023.102901},
  url = {https://doi.org/10.1016/j.sysarc.2023.102901},
  researchr = {https://researchr.org/publication/GuoXQG23},
  cites = {0},
  citedby = {0},
  journal = {Journal of Systems Architecture},
  volume = {141},
  pages = {102901},
}