Tetris: Memory-efficient Serverless Inference through Tensor Sharing

Jie Li, Laiping Zhao, Yanan Yang, Kunlin Zhan, Keqiu Li. Tetris: Memory-efficient Serverless Inference through Tensor Sharing. In Jiri Schindler, Noa Zilberman, editors, 2022 USENIX Annual Technical Conference, USENIX ATC 2022, Carlsbad, CA, USA, July 11-13, 2022. USENIX Association, 2022. [doi]

Authors

Jie Li

This author has not been identified. Look up 'Jie Li' in Google

Laiping Zhao

This author has not been identified. Look up 'Laiping Zhao' in Google

Yanan Yang

This author has not been identified. Look up 'Yanan Yang' in Google

Kunlin Zhan

This author has not been identified. Look up 'Kunlin Zhan' in Google

Keqiu Li

This author has not been identified. Look up 'Keqiu Li' in Google