Reducing False Node Failure Predictions in HPC

Alvaro Frank, Dai Yang, André Brinkmann, Martin Schulz 0001, Tim Süß. Reducing False Node Failure Predictions in HPC. In 26th IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2019, Hyderabad, India, December 17-20, 2019. pages 323-332, IEEE, 2019. [doi]

@inproceedings{FrankYBSS19,
  title = {Reducing False Node Failure Predictions in HPC},
  author = {Alvaro Frank and Dai Yang and André Brinkmann and Martin Schulz 0001 and Tim Süß},
  year = {2019},
  doi = {10.1109/HiPC.2019.00047},
  url = {https://doi.org/10.1109/HiPC.2019.00047},
  researchr = {https://researchr.org/publication/FrankYBSS19},
  cites = {0},
  citedby = {0},
  pages = {323-332},
  booktitle = {26th IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2019, Hyderabad, India, December 17-20, 2019},
  publisher = {IEEE},
  isbn = {978-1-7281-4535-8},
}