Optimizing DNN Compilation for Distributed Training With Joint OP and Tensor Fusion

Xiaodong Yi 0001, Shiwei Zhang, Lansong Diao, Chuan Wu 0001, Zhen Zheng, Shiqing Fan, Siyu Wang, Jun Yang, Wei Lin 0016. Optimizing DNN Compilation for Distributed Training With Joint OP and Tensor Fusion. IEEE Trans. Parallel Distrib. Syst., 33(12):4694-4706, 2022. [doi]

Abstract

Abstract is missing.