Sohaib Ahmad, Qizheng Yang, Haoliang Wang, Ramesh K. Sitaraman, Hui Guan 0001. DiffServe: Efficiently Serving Text-to-Image Diffusion Models with Query-Aware Model Scaling. In Matei Zaharia, Gauri Joshi, Yingyan (Celine) Lin, editors, Proceedings of the Eighth Conference on Machine Learning and Systems, MLSys 2025, Santa Clara, CA, USA, May 12-15, 2025. OpenReview.net/mlsys.org, 2025. [doi]
Abstract is missing.