Venkatraman Iyer, Sungho Lee, Semun Lee, Juitem Joonwoo Kim, Hyunjun Kim, Youngjae Shin. Automated Backend Allocation for Multi-Model, On-Device AI Inference. In Michele Garetto, Andrea Marin, Florin Ciucu, Giulia Fanti, Rhonda Righter, editors, Abstracts of the 2024 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS/PERFORMANCE 2024, Venice, Italy, June 10-14, 2024. pages 27-28, ACM, 2024. [doi]
Abstract is missing.