Cocktail: A Multidimensional Optimization for Model Serving in Cloud

Jashwant Raj Gunasekaran, Cyan Subhra Mishra, Prashanth Thinakaran, Bikash Sharma, Mahmut Taylan Kandemir, Chita R. Das. Cocktail: A Multidimensional Optimization for Model Serving in Cloud. In Amar Phanishayee, Vyas Sekar, editors, 19th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2022, Renton, WA, USA, April 4-6, 2022. pages 1041-1057, USENIX Association, 2022. [doi]

Abstract

Abstract is missing.