Optimizing resource allocation for geographically-distributed inference by large language models

Tingyang Sun, Ting He 0001, Bo Ji, Parimal Parag. Optimizing resource allocation for geographically-distributed inference by large language models. Perform. Eval., 170:102527, 2025. [doi]

Abstract

Abstract is missing.