SyncIntellects: Orchestrating LLM Inference with Progressive Prediction and QoS-Friendly Control

Xue Lin, Zhibo Zhang, Peining Yue, Haoran Li, Jin Zhang 0003, Baoyu Fan, Huayou Su, Xiaoli Gong. SyncIntellects: Orchestrating LLM Inference with Progressive Prediction and QoS-Friendly Control. In 32nd IEEE/ACM International Symposium on Quality of Service, IWQoS 2024, Guangzhou, China, June 19-21, 2024. pages 1-10, IEEE, 2024. [doi]

Authors

Xue Lin

This author has not been identified. Look up 'Xue Lin' in Google

Zhibo Zhang

This author has not been identified. Look up 'Zhibo Zhang' in Google

Peining Yue

This author has not been identified. Look up 'Peining Yue' in Google

Haoran Li

This author has not been identified. Look up 'Haoran Li' in Google

Jin Zhang 0003

This author has not been identified. Look up 'Jin Zhang 0003' in Google

Baoyu Fan

This author has not been identified. Look up 'Baoyu Fan' in Google

Huayou Su

This author has not been identified. Look up 'Huayou Su' in Google

Xiaoli Gong

This author has not been identified. Look up 'Xiaoli Gong' in Google