An Allreduce Algorithm and Network Co-design for Large-Scale Training of Distributed Deep Learning

Truong Thao Nguyen, Mohamed Wahib. An Allreduce Algorithm and Network Co-design for Large-Scale Training of Distributed Deep Learning. In Laurent Lefèvre, Stacy Patterson, Young Choon Lee, Haiying Shen, Shashikant Ilager, Mohammad Goudarzi, Adel Nadjaran Toosi, Rajkumar Buyya, editors, 21st IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2021, Melbourne, Australia, May 10-13, 2021. pages 396-405, IEEE, 2021. [doi]

@inproceedings{NguyenW21,
  title = {An Allreduce Algorithm and Network Co-design for Large-Scale Training of Distributed Deep Learning},
  author = {Truong Thao Nguyen and Mohamed Wahib},
  year = {2021},
  doi = {10.1109/CCGrid51090.2021.00049},
  url = {https://doi.org/10.1109/CCGrid51090.2021.00049},
  researchr = {https://researchr.org/publication/NguyenW21},
  cites = {0},
  citedby = {0},
  pages = {396-405},
  booktitle = {21st IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2021, Melbourne, Australia, May 10-13, 2021},
  editor = {Laurent Lefèvre and Stacy Patterson and Young Choon Lee and Haiying Shen and Shashikant Ilager and Mohammad Goudarzi and Adel Nadjaran Toosi and Rajkumar Buyya},
  publisher = {IEEE},
  isbn = {978-1-7281-9586-5},
}