VELTAIR: towards high-performance multi-tenant deep learning services via adaptive compilation and scheduling

Zihan Liu, Jingwen Leng, Zhihui Zhang, Quan Chen, Chao Li, Minyi Guo. VELTAIR: towards high-performance multi-tenant deep learning services via adaptive compilation and scheduling. In Babak Falsafi, Michael Ferdman, Shan Lu 0001, Thomas F. Wenisch, editors, ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022 - 4 March 2022. pages 388-401, ACM, 2022. [doi]

@inproceedings{LiuLZCLG22,
  title = {VELTAIR: towards high-performance multi-tenant deep learning services via adaptive compilation and scheduling},
  author = {Zihan Liu and Jingwen Leng and Zhihui Zhang and Quan Chen and Chao Li and Minyi Guo},
  year = {2022},
  doi = {10.1145/3503222.3507752},
  url = {https://doi.org/10.1145/3503222.3507752},
  researchr = {https://researchr.org/publication/LiuLZCLG22},
  cites = {0},
  citedby = {0},
  pages = {388-401},
  booktitle = {ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022 - 4 March 2022},
  editor = {Babak Falsafi and Michael Ferdman and Shan Lu 0001 and Thomas F. Wenisch},
  publisher = {ACM},
  isbn = {978-1-4503-9205-1},
}