BatOpt: Optimizing GPU-Based Deep Learning Inference Using Dynamic Batch Processing

Deyu Zhang, Yunzhen Luo, Yaobo Wang, Xiaoyan Kui, Ju Ren. BatOpt: Optimizing GPU-Based Deep Learning Inference Using Dynamic Batch Processing. IEEE T. Cloud Computing, 12(1):174-185, January - March 2024. [doi]

Abstract

Abstract is missing.