Analyzing the Impact of System Reliability Events on Applications in the Titan Supercomputer

Rizwan A. Ashraf, Christian Engelmann. Analyzing the Impact of System Reliability Events on Applications in the Titan Supercomputer. In IEEE/ACM 8th Workshop on Fault Tolerance for HPC at eXtreme Scale, FTXS@SC 2018, Dallas, TX, USA, November 16, 2018. pages 39-48, IEEE, 2018. [doi]

@inproceedings{AshrafE18,
  title = {Analyzing the Impact of System Reliability Events on Applications in the Titan Supercomputer},
  author = {Rizwan A. Ashraf and Christian Engelmann},
  year = {2018},
  doi = {10.1109/FTXS.2018.00008},
  url = {http://doi.ieeecomputersociety.org/10.1109/FTXS.2018.00008},
  researchr = {https://researchr.org/publication/AshrafE18},
  cites = {0},
  citedby = {0},
  pages = {39-48},
  booktitle = {IEEE/ACM 8th Workshop on Fault Tolerance for HPC at eXtreme Scale, FTXS@SC 2018, Dallas, TX, USA, November 16, 2018},
  publisher = {IEEE},
  isbn = {978-1-7281-0222-1},
}