Automated Backend Allocation for Multi-Model, On-Device AI Inference

Venkatraman Iyer, Sungho Lee, Semun Lee, Juitem Joonwoo Kim, Hyunjun Kim, Youngjae Shin. Automated Backend Allocation for Multi-Model, On-Device AI Inference. In Michele Garetto, Andrea Marin, Florin Ciucu, Giulia Fanti, Rhonda Righter, editors, Abstracts of the 2024 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS/PERFORMANCE 2024, Venice, Italy, June 10-14, 2024. pages 27-28, ACM, 2024. [doi]

@inproceedings{IyerLLKKS24,
  title = {Automated Backend Allocation for Multi-Model, On-Device AI Inference},
  author = {Venkatraman Iyer and Sungho Lee and Semun Lee and Juitem Joonwoo Kim and Hyunjun Kim and Youngjae Shin},
  year = {2024},
  doi = {10.1145/3652963.3655046},
  url = {https://doi.org/10.1145/3652963.3655046},
  researchr = {https://researchr.org/publication/IyerLLKKS24},
  cites = {0},
  citedby = {0},
  pages = {27-28},
  booktitle = {Abstracts of the 2024 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS/PERFORMANCE 2024, Venice, Italy, June 10-14, 2024},
  editor = {Michele Garetto and Andrea Marin and Florin Ciucu and Giulia Fanti and Rhonda Righter},
  publisher = {ACM},
}