RIBBON: cost-effective and qos-aware deep learning model inference using a diverse pool of cloud computing instances

Baolin Li, Rohan Basu Roy, Tirthak Patel, Vijay Gadepally, Karen Gettings, Devesh Tiwari. RIBBON: cost-effective and qos-aware deep learning model inference using a diverse pool of cloud computing instances. In Bronis R. de Supinski, Mary W. Hall, Todd Gamblin, editors, SC '21: The International Conference for High Performance Computing, Networking, Storage and Analysis, St. Louis, Missouri, USA, November 14 - 19, 2021. pages 24, ACM, 2021. [doi]

Abstract

Abstract is missing.