In-Network Aggregation with Transport Transparency for Distributed Training

Shuo Liu, Qiaoling Wang, Junyi Zhang, Wenfei Wu, Qinliang Lin, Yao Liu, Meng Xu, Marco Canini, Ray C. C. Cheung, Jianfei He. In-Network Aggregation with Transport Transparency for Distributed Training. In Tor M. Aamodt, Natalie D. Enright Jerger, Michael M. Swift, editors, Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3, ASPLOS 2023, Vancouver, BC, Canada, March 25-29, 2023. pages 376-391, ACM, 2023. [doi]

Abstract

Abstract is missing.