Yitang Yang, Junhong Liu, Jiapeng Chen, Xiaoyang Sun, Tianyu Wo, Chunming Hu, Chengru Song, Jin Ouyang, Renyu Yang. Kair: A Statistical and Causal Approach to Pinpointing Stragglers in Distributed Model Training. In 40th IEEE/ACM International Conference on Automated Software Engineering, ASE 2025, Seoul, Korea, Republic of, November 16-20, 2025. pages 3754-3759, IEEE, 2025. [doi]
Abstract is missing.