Yongjun He 0004, Haofeng Yang, Yao Lu, Ana Klimovic, Gustavo Alonso. Resource Multiplexing in Tuning and Serving Large Language Models. In Deniz Altinbüken, Ryan Stutsman, editors, Proceedings of the 2025 USENIX Annual Technical Conference, USENIX ATC 2025, Boston, MA, USA, July 7-9, 2025. pages 1639-1655, USENIX Association, 2025. [doi]
Abstract is missing.