Ziyi Han, Ruiting Zhou, Chengzhong Xu, Yifan Zeng, Renli Zhang. InSS: An Intelligent Scheduling Orchestrator for Multi-GPU Inference With Spatio-Temporal Sharing. IEEE Trans. Parallel Distrib. Syst., 35(10):1735-1748, October 2024. [doi]
Abstract is missing.