Swayam: distributed autoscaling to meet SLAs of machine learning inference services with resource efficiency

Arpan Gujarati, Sameh Elnikety, Yuxiong He, Kathryn S. McKinley, Björn B. Brandenburg. Swayam: distributed autoscaling to meet SLAs of machine learning inference services with resource efficiency. In K. R. Jayaram, Anshul Gandhi, Bettina Kemme, Peter R. Pietzuch, editors, Proceedings of the 18th ACM/IFIP/USENIX Middleware Conference, Las Vegas, NV, USA, December 11 - 15, 2017. pages 109-120, ACM, 2017. [doi]

Abstract

Abstract is missing.