Fairness in Serving Large Language Models

Ying Sheng 0007, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li 0001, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica. Fairness in Serving Large Language Models. In Ada Gavrilovska, Douglas B. Terry, editors, 18th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2024, Santa Clara, CA, USA, July 10-12, 2024. pages 965-988, USENIX Association, 2024. [doi]

Authors

Ying Sheng 0007

This author has not been identified. Look up 'Ying Sheng 0007' in Google

Shiyi Cao

This author has not been identified. Look up 'Shiyi Cao' in Google

Dacheng Li

This author has not been identified. Look up 'Dacheng Li' in Google

Banghua Zhu

This author has not been identified. Look up 'Banghua Zhu' in Google

Zhuohan Li 0001

This author has not been identified. Look up 'Zhuohan Li 0001' in Google

Danyang Zhuo

This author has not been identified. Look up 'Danyang Zhuo' in Google

Joseph E. Gonzalez

This author has not been identified. Look up 'Joseph E. Gonzalez' in Google

Ion Stoica

This author has not been identified. Look up 'Ion Stoica' in Google