MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning

Shaohuai Shi, Xiaowen Chu, Bo Li. MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning. IEEE Trans. Parallel Distrib. Syst., 32(8):1903-1917, 2021. [doi]

Abstract

Abstract is missing.