Fairness in Serving Large Language Models

Ying Sheng 0007, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li 0001, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica. Fairness in Serving Large Language Models. In Ada Gavrilovska, Douglas B. Terry, editors, 18th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2024, Santa Clara, CA, USA, July 10-12, 2024. pages 965-988, USENIX Association, 2024. [doi]

Abstract

Abstract is missing.