Non-Clairvoyant Scheduling of Distributed Machine Learning With Inter-Job and Intra-Job Parallelism on Heterogeneous GPUs

Fahao Chen, Peng Li 0017, Celimuge Wu, Song Guo 0001. Non-Clairvoyant Scheduling of Distributed Machine Learning With Inter-Job and Intra-Job Parallelism on Heterogeneous GPUs. IEEE T. Cloud Computing, 12(4):1011-1025, October - December 2024. [doi]

Abstract

Abstract is missing.