Towards Optimal Preemptive GPU Time-Sharing for Edge Model Serving

Zhengxu Xia, Yitian Hao, Jun Duan, Chen Wang, Junchen Jiang. Towards Optimal Preemptive GPU Time-Sharing for Edge Model Serving. In Proceedings of the 9th International Workshop on Container Technologies and Container Clouds, WoC 2023, Bologna, Italy, December 11-15, 2023. pages 13-18, ACM, 2023. [doi]

Authors

Zhengxu Xia

This author has not been identified. Look up 'Zhengxu Xia' in Google

Yitian Hao

This author has not been identified. Look up 'Yitian Hao' in Google

Jun Duan

This author has not been identified. Look up 'Jun Duan' in Google

Chen Wang

This author has not been identified. Look up 'Chen Wang' in Google

Junchen Jiang

This author has not been identified. Look up 'Junchen Jiang' in Google