Queueing analysis of GPU-based inference servers with dynamic batching: A closed-form characterization

Yoshiaki Inoue. Queueing analysis of GPU-based inference servers with dynamic batching: A closed-form characterization. Perform. Eval., 147:102183, 2021. [doi]

Abstract

Abstract is missing.