Compute-Optimal Resource Allocation for Distributed Large Language Model Inference in Cloud-Scale Intelligent Systems

Tejas Pravinbhai Patel, Sandeep Shivam, Viswanathan Ranganathan. Compute-Optimal Resource Allocation for Distributed Large Language Model Inference in Cloud-Scale Intelligent Systems. In 2026 14th International Symposium on Digital Forensics and Security (ISDFS), Boston, MA, USA, March 19-20, 2026. pages 1-6, IEEE, 2026. [doi]

Abstract

Abstract is missing.