Quality-of-Service Aware LLM Routing for Edge Computing With Multiple Experts

Jin Yang, Qiong Wu 0009, Zhiying Feng, Zhi Zhou 0006, Deke Guo, Xu Chen 0004. Quality-of-Service Aware LLM Routing for Edge Computing With Multiple Experts. IEEE Trans. Mob. Comput., 24(12):13648-13662, December 2025. [doi]

Abstract

Abstract is missing.