Holmes: Towards Distributed Training Across Clusters with Heterogeneous NIC Environment

Fei Yang, Shuang Peng, Ning Sun, Fangyu Wang, Yuanyuan Wang, Fu Wu, Jiezhong Qiu, Aimin Pan. Holmes: Towards Distributed Training Across Clusters with Heterogeneous NIC Environment. In Proceedings of the 53rd International Conference on Parallel Processing, ICPP 2024, Gotland, Sweden, August 12-15, 2024. pages 514-523, ACM, 2024. [doi]

Authors

Fei Yang

This author has not been identified. Look up 'Fei Yang' in Google

Shuang Peng

This author has not been identified. Look up 'Shuang Peng' in Google

Ning Sun

This author has not been identified. Look up 'Ning Sun' in Google

Fangyu Wang

This author has not been identified. Look up 'Fangyu Wang' in Google

Yuanyuan Wang

This author has not been identified. Look up 'Yuanyuan Wang' in Google

Fu Wu

This author has not been identified. Look up 'Fu Wu' in Google

Jiezhong Qiu

This author has not been identified. Look up 'Jiezhong Qiu' in Google

Aimin Pan

This author has not been identified. Look up 'Aimin Pan' in Google