Tetris: Memory-efficient Serverless Inference through Tensor Sharing

Jie Li, Laiping Zhao, Yanan Yang, Kunlin Zhan, Keqiu Li. Tetris: Memory-efficient Serverless Inference through Tensor Sharing. In Jiri Schindler, Noa Zilberman, editors, 2022 USENIX Annual Technical Conference, USENIX ATC 2022, Carlsbad, CA, USA, July 11-13, 2022. USENIX Association, 2022. [doi]

Abstract

Abstract is missing.