Delay-Aware DNN Inference Throughput Maximization in Edge Computing via Jointly Exploring Partitioning and Parallelism

Jing Li, Weifa Liang, Yuchen Li, Zichuan Xu, Xiaohua Jia. Delay-Aware DNN Inference Throughput Maximization in Edge Computing via Jointly Exploring Partitioning and Parallelism. In 46th IEEE Conference on Local Computer Networks, LCN 2021, Edmonton, AB, Canada, October 4-7, 2021. pages 193-200, IEEE, 2021. [doi]

Abstract

Abstract is missing.