Accelerating Distributed Training in Heterogeneous Clusters via a Straggler-Aware Parameter Server

Huihuang Yu, Zongwei Zhu, Xianglan Chen, Yuming Cheng, Yahui Hu, Xi Li 0003. Accelerating Distributed Training in Heterogeneous Clusters via a Straggler-Aware Parameter Server. In Zheng Xiao, Laurence T. Yang, Pavan Balaji, Tao Li, Keqin Li 0001, Albert Y. Zomaya, editors, 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2019, Zhangjiajie, China, August 10-12, 2019. pages 200-207, IEEE, 2019. [doi]

Authors

Huihuang Yu

This author has not been identified. Look up 'Huihuang Yu' in Google

Zongwei Zhu

This author has not been identified. Look up 'Zongwei Zhu' in Google

Xianglan Chen

This author has not been identified. Look up 'Xianglan Chen' in Google

Yuming Cheng

This author has not been identified. Look up 'Yuming Cheng' in Google

Yahui Hu

This author has not been identified. Look up 'Yahui Hu' in Google

Xi Li 0003

This author has not been identified. Look up 'Xi Li 0003' in Google