Fairness in Serving Large Language Models

Ying Sheng 0007, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li 0001, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica. Fairness in Serving Large Language Models. In Ada Gavrilovska, Douglas B. Terry, editors, 18th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2024, Santa Clara, CA, USA, July 10-12, 2024. pages 965-988, USENIX Association, 2024. [doi]

@inproceedings{0007CLZ0ZGS24,
  title = {Fairness in Serving Large Language Models},
  author = {Ying Sheng 0007 and Shiyi Cao and Dacheng Li and Banghua Zhu and Zhuohan Li 0001 and Danyang Zhuo and Joseph E. Gonzalez and Ion Stoica},
  year = {2024},
  url = {https://www.usenix.org/conference/osdi24/presentation/sheng},
  researchr = {https://researchr.org/publication/0007CLZ0ZGS24},
  cites = {0},
  citedby = {0},
  pages = {965-988},
  booktitle = {18th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2024, Santa Clara, CA, USA, July 10-12, 2024},
  editor = {Ada Gavrilovska and Douglas B. Terry},
  publisher = {USENIX Association},
}