Modeling and Optimizing the Scaling Performance in Distributed Deep Learning Training

Ting Liu, Tianhao Miao, Qinghua Wu, Zhenyu Li 0001, Guangxin He, Jiaoren Wu, Shengzhuo Zhang, Xingwu Yang, Gareth Tyson, Gaogang Xie. Modeling and Optimizing the Scaling Performance in Distributed Deep Learning Training. In Frédérique Laforest, Raphaël Troncy, Elena Simperl, Deepak Agarwal, Aristides Gionis, Ivan Herman, Lionel Médini, editors, WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25 - 29, 2022. pages 1764-1773, ACM, 2022. [doi]

Abstract

Abstract is missing.