Tiresias: A GPU Cluster Manager for Distributed Deep Learning

Juncheng Gu, Mosharaf Chowdhury, Kang G. Shin, Yibo Zhu, Myeongjae Jeon, Junjie Qian, Hongqiang Harry Liu, Chuanxiong Guo. Tiresias: A GPU Cluster Manager for Distributed Deep Learning. In Jay R. Lorch, Minlan Yu, editors, 16th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2019, Boston, MA, February 26-28, 2019. pages 485-500, USENIX Association, 2019. [doi]

Abstract

Abstract is missing.