Efficient Management and Intelligent Fault Tolerance for HPC Interconnect Networks

Jijun Cao, Mingche Lai, Zhang Luo, Jiaqing Xu, Zhengbin Pang. Efficient Management and Intelligent Fault Tolerance for HPC Interconnect Networks. In 25th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2019, Tianjin, China, December 4-6, 2019. pages 343-351, IEEE, 2019. [doi]

@inproceedings{CaoLLXP19,
  title = {Efficient Management and Intelligent Fault Tolerance for HPC Interconnect Networks},
  author = {Jijun Cao and Mingche Lai and Zhang Luo and Jiaqing Xu and Zhengbin Pang},
  year = {2019},
  doi = {10.1109/ICPADS47876.2019.00055},
  url = {https://doi.org/10.1109/ICPADS47876.2019.00055},
  researchr = {https://researchr.org/publication/CaoLLXP19},
  cites = {0},
  citedby = {0},
  pages = {343-351},
  booktitle = {25th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2019, Tianjin, China, December 4-6, 2019},
  publisher = {IEEE},
  isbn = {978-1-7281-2583-1},
}