Optimizing distributed training deployment in heterogeneous GPU clusters

Xiaodong Yi 0001, Shiwei Zhang, Ziyue Luo, Guoping Long, Lansong Diao, Chuan Wu, Zhen Zheng, Jun Yang, Wei Lin. Optimizing distributed training deployment in heterogeneous GPU clusters. In Dongsu Han, Anja Feldmann, editors, CoNEXT '20: The 16th International Conference on emerging Networking EXperiments and Technologies, Barcelona, Spain, December, 2020. pages 93-107, ACM, 2020. [doi]

@inproceedings{0001ZLLDWZYL20,
  title = {Optimizing distributed training deployment in heterogeneous GPU clusters},
  author = {Xiaodong Yi 0001 and Shiwei Zhang and Ziyue Luo and Guoping Long and Lansong Diao and Chuan Wu and Zhen Zheng and Jun Yang and Wei Lin},
  year = {2020},
  doi = {10.1145/3386367.3432728},
  url = {https://doi.org/10.1145/3386367.3432728},
  researchr = {https://researchr.org/publication/0001ZLLDWZYL20},
  cites = {0},
  citedby = {0},
  pages = {93-107},
  booktitle = {CoNEXT '20: The 16th International Conference on emerging Networking EXperiments and Technologies, Barcelona, Spain, December, 2020},
  editor = {Dongsu Han and Anja Feldmann},
  publisher = {ACM},
  isbn = {978-1-4503-7948-9},
}