Performance Efficient Layer-aware DNN Inference Task Scheduling in GPU Cluster

Hongmin Geng, Deze Zeng, Yuepeng Li. Performance Efficient Layer-aware DNN Inference Task Scheduling in GPU Cluster. In IEEE Global Communications Conference, GLOBECOM 2022, Rio de Janeiro, Brazil, December 4-8, 2022. pages 2242-2247, IEEE, 2022. [doi]

Abstract

Abstract is missing.