Jenga: Effective Memory Management for Serving LLM with Heterogeneity

Chen Zhang 0001, Kuntai Du, Shu Liu, Woosuk Kwon, Xiangxi Mo, Yufeng Wang, Xiaoxuan Liu, Kaichao You, Zhuohan Li 0001, Mingsheng Long, Jidong Zhai, Joseph Gonzalez 0001, Ion Stoica. Jenga: Effective Memory Management for Serving LLM with Heterogeneity. In Youjip Won, Youngjin Kwon, Ding Yuan 0004, Rebecca Isaacs, editors, Proceedings of the ACM SIGOPS 31st Symposium on Operating Systems Principles, SOSP 2025, Lotte Hotel World, Seoul, Republic of Korea, October 13-16, 2025. pages 446-461, ACM, 2025. [doi]

Abstract

Abstract is missing.