RIBBON: cost-effective and qos-aware deep learning model inference using a diverse pool of cloud computing instances

Baolin Li, Rohan Basu Roy, Tirthak Patel, Vijay Gadepally, Karen Gettings, Devesh Tiwari. RIBBON: cost-effective and qos-aware deep learning model inference using a diverse pool of cloud computing instances. In Bronis R. de Supinski, Mary W. Hall, Todd Gamblin, editors, SC '21: The International Conference for High Performance Computing, Networking, Storage and Analysis, St. Louis, Missouri, USA, November 14 - 19, 2021. pages 24, ACM, 2021. [doi]

@inproceedings{LiRPGGT21,
  title = {RIBBON: cost-effective and qos-aware deep learning model inference using a diverse pool of cloud computing instances},
  author = {Baolin Li and Rohan Basu Roy and Tirthak Patel and Vijay Gadepally and Karen Gettings and Devesh Tiwari},
  year = {2021},
  doi = {10.1145/3458817.3476168},
  url = {https://doi.org/10.1145/3458817.3476168},
  researchr = {https://researchr.org/publication/LiRPGGT21},
  cites = {0},
  citedby = {0},
  pages = {24},
  booktitle = {SC '21: The International Conference for High Performance Computing, Networking, Storage and Analysis, St. Louis, Missouri, USA, November 14 - 19, 2021},
  editor = {Bronis R. de Supinski and Mary W. Hall and Todd Gamblin},
  publisher = {ACM},
  isbn = {978-1-4503-8442-1},
}