Batch: machine learning inference serving on serverless platforms with adaptive batching

Ahsan Ali, Riccardo Pinciroli, Feng Yan 0001, Evgenia Smirni. Batch: machine learning inference serving on serverless platforms with adaptive batching. In Christine Cuicchi, Irene Qualters, William T. Kramer, editors, SC '20: The International Conference for High Performance Computing, Networking, Storage and Analysis, Virtual Event / Atlanta, Georgia, USA, November 9-19, 2020. pages 69, IEEE/ACM, 2020. [doi]

Abstract

Abstract is missing.