Olympian: Scheduling GPU Usage in a Deep Neural Network Model Serving System

Yitao Hu, Swati Rallapalli, Bongjun Ko, Ramesh Govindan. Olympian: Scheduling GPU Usage in a Deep Neural Network Model Serving System. In Paulo Ferreira 0001, Liuba Shrira, editors, Proceedings of the 19th International Middleware Conference, Middleware 2018, Rennes, France, December 10-14, 2018. pages 53-65, ACM, 2018. [doi]

Abstract

Abstract is missing.